Основен преглед
Searchengineinclusionreferstothespecificnumberofpagesincludedinawebsitebysearchengines.Themoreincluded,thefastertheindexingtime,whichprovesthatthiswebsiteismorefriendlytosearchengines..
Themorecommonlyusedsearchenginesincludebaidu(Baidu),google(Google),yahoo(Yahoo),sogou,youdao(有道),soso(searchsearch),bing(必应),360(360).
Принцип на изключване
Съберете URL адресите на уеб страниците, които да бъдат индексирани
ThenumberofwebpagesontheInternetisabsolutelyastronomical,andtherearecountlessnewwebpageseveryday.Searchenginesneedtofindtheobjectstobeindexedfirst.
AsfarasGoogleisconcerned,althoughthereiscontroversyoverwhetherthereisadifferencebetweenDeepBotandFreshBotonGoogleBot-asforwhethertocallthesetwonames,therearedifferentopinions.Ofcourse,thenameitselfisnotimportant-atleastfornowuntil.
ThemainstreamviewisthatinGoogle’srobots,thereareindeedquiteafewrobotsthatprepare"materials"fortheactualindexedpages—let'scallitFreshBothere.
——TheirtaskistoconstantlyscantheInterneteverydaytodiscoverandmaintainahugelistofURLsforDeepBottouse.Inotherwords,whenitvisitsandreadsoneofitswebpages,thepurposeisnotItisaboutindexingthispage,butfindingallthelinksinthispage.
——Разбира се, това изглежда противоречиво неефективност, което е малко невероятно. Въпреки това, можем просто да съдим по следния начин: FreshBotis не е "изключителен" при сканиране на уеб страници.
Inotherwords,multiplerobotslocatedindifferentGoogledatacentersmayvisitthesamepageinashortperiodoftime,suchasonedayorevenanhour,andDeepBotisindexingandcachingWhenthepageisnotsimilar,itwillnotappear.
Thatis,Googlewillrestrictrobotsinacertaindatacentertocompletethiswork,insteadoftwodatacentersindexingthesameversionofthewebpageatthesametime,ifthereisnoflawinthisstatement,Itseemsthatfromtheserveraccesslog,youcanoftenseethatGoogleBotsoriginatingfromdifferentIPshavevisitedthesamewebpagemultipletimesinashortperiodoftimetoprovetheexistenceofFreshBot.
Therefore,sometimesifyoufindthatGoogleBotfrequentlyvisitsthewebsite,don’tbetoohappytooearly.Maybeit’snotindexingwebpagesatallbutjustscanningURLs.
TheinformationrecordedbyFreshBotincludestheurlofthewebpage,TimeStamp(thetimestampofwhenthewebpagewascreatedorupdated),andtheheaderinformationofthewebpage(Note:Thispointiscontroversial,andmanypeoplebelievethatFreshBotwillnotreadit.Togettheinformationofthetargetwebpage,DeepBotwillcompletethispartofthework.
However,theauthorpreferstheformerstatement,becauseintheurllistsubmittedbyFreshBottoDeepBot,thewebsitesettingwillbeprohibited.Indexedandincludedpagesareexcludedtoimproveefficiency.Inadditiontorobots.txt,aconsiderablepartofthewebsiteisimplementedthroughthe"noindex"inthematatagwhensettingupthistypeofwebsite.TheheadofthetargetpagethatdoesnotreadthetargetpageseemstobeIfthisisnotpossible),ifthewebpageisnotaccessible,suchasnetworkinterruptionorserverfailure,FreshBotwillrecordtheurlandtryagainattheappropriatetime,butwillnotaddittotheurlsubmittedtoDeepBotuntiltheurlisaccessibleList.
Ingeneral,FreshBotoccupiesarelativelysmallamountofserverbandwidthandresources.Finally,FreshBotclassifiestherecordedinformationaccordingtodifferentprioritiesandsubmitsittoDeepBot.Accordingtothedifferentpriorities,therearemainlythefollowing:
A:Нова уеб страница;
B:Стара уебстраница/новTimeStamp,това е,има актуализирана уебстраница;
C:Use301/302redirectionwebpage;
D:ComplexdynamicURL:suchasusingmultipleparametersDynamicURL,Googlemayneedadditionalworktocorrectlyanalyzeitscontent.——WiththeimprovementofGoogle’sabilitytosupportdynamicwebpages,thisclassificationmayhavebeencancelled;
E:Други видове файлове, като връзки към PDF и DOC файлове, индексиране на тези файлове и може да се изисква допълнителна работа;
F:Oldwebpage/oldTimeStamp,thatis,webpagethathasnotbeenupdated.NotethatthetimestamphereisnotbasedonthedatedisplayedintheGooglesearchresults,butwithGoogleDatecomparisonintheindexdatabase;
G: Wrongurl, това е страницата, която връща 404 отговор при достъп.
ThepriorityisarrangedintheorderfromAtoG,decreasinginorder.Itneedstobeemphasizedthattheprioritymentionedhereisrelative.Forexample,itisalsoanewwebpage.Accordingtothequalityandquantityofthelinkstoit,thepriorityisalsoverydifferent.Ithaslinksfromrelatedauthoritativewebsites.'Spageshaveahigherpriority.
Inaddition,thepriorityreferredtohereisonlyforpageswithinthesamewebsite.Infact,differentwebsiteshavedifferentpriorities.Inotherwords,forpagesinauthoritativewebsites,eventhelowestpriorityLevel404urlmayalsohaveadvantagesovermanyothersiteswiththehighestprioritynewlycreatedwebpages.
Индекс и включване на уеб страници
Onlythenentertheactualprocessofindexingandinclusionofwebpages.Ascanbeseenfromtheaboveintroduction,theURLlistsubmittedbyFreshBotisquitelarge.Accordingtothelanguage,websitelocation,etc.,theindexingofspecificwebsiteswillbeallocatedtodifferentdatacenters.
Theentireindexingprocess,duetothehugeamountofdata,maytakeseveralweeksorevenlongertocomplete.
Asmentionedabove,DeepBotwillfirstindexhigherprioritywebsites/webpages.Thehigherthepriority,thefasteritwillappearintheGoogleindexdatabaseandeventuallyappearintheGooglesearchresultspage..
Foranewwebpage,aslongasitentersthisstage,eveniftheentireindexingprocessisnotcompleted,thecorrespondingwebpagehasthepossibilitytoappearintheGoogleindexlibrary.Ibelievemanyfriendsuse"site"inGoogle."Whensearching,Ioftenseepagesmarkedassupplementaryresultsthatonlydisplaythewebpageurloronlydisplaythepagetitleandurlwithoutdescription.Thisisthenormalresultofthewebpageatthisstage.
WhenGoogleactuallyreads,analyzes,andcachesthispage,itwillpickoutthesupplementaryresultsanddisplaynormalinformation.
——Ofcourse,thepremiseisthatthewebpagehasenoughlinks,especiallylinksfromauthoritativewebsites,andtherearenorecordsthatarethesameorsimilartothewebpagecontentintheindexlibrary(DuplicateContentfiltering).
FordynamicURLs,althoughGooglenowclaimsthattherearenoobstaclestoitsprocessing,theobservablefactsstillshowthatdynamicURLsaremorelikelytoappearinsupplementaryresultsthanstaticURLs.Webpagesoftenrequiremoreandmorevaluablelinkstoescapefromsupplementaryresults.
Forthe"F"categoryabove,thatis,webpagesthathavenotbeenupdated,DeepBotwillcompareitstimestampwiththedateintheGoogleindexdatabasetoconfirmthatalthoughthecorrespondingpageinformationinthesearchresultsmaybeavailableinthefutureUpdatebutaslongasthelatestversionisindexed-considerthesituationofmultipleupdatesandmodificationsofthewebpage-;asforthe"G"category,whichis404url,itwilllookupwhetherthereisacorrespondingrecordintheindexlibrary,anddeleteitifithas.
Синхронизация между центрове за данни
Aswementionedearlier,whenDeepBotindexesawebpage,itwillbecompletedbyaspecificdatacenterinsteadofmultipledatacentersreadingatthesametime.Thewebpageobtainsthelatestversionofthewebpagerespectively.Inthisway,aftertheindexingprocessiscompleted,adatasynchronizationprocessisrequiredtoupdatethelatestversionofthewebpageinmultipledatacenters.
ThisisthefamousGoogleDancebefore.However,aftertheBigDaddyupdate,thesynchronizationbetweendatacentersisnolongerconcentratedinaspecifictimeperiodlikethat,butinacontinuousandmoretime-sensitivemanner.
Влияние върху включването
Заглавие на сайта
Thewritingofsitetitle,description,andkeywordshasalwaysbeenaverycautiousthinginthemindsofwebmasters.Itisdirectlyrelatedtotherankingandtrafficofthewebsite,andthesethreetagscannotbeeasilymodifiedafterthewebsiteisonline.Thisrequireswebmasterstoprepareinadvance.Ifyoudonotconsideritinadvanceandmodifyitaftergoingonline,BaiduwillthinkyouYourwebsiteisunstable,youmodifythekeytagsassoonasyougoonline,andyouaresuspectedofcheating,andthenthrowyourwebsiteintothesandbox,andslowlyinvestigate.Atthistime,ifyouwantBaidutoincludethewebsiteatleastonemonthlater,andguaranteethisperiodoftimeAddhigh-qualityarticlestothewebsiteeveryday.
Външни връзки
Addingexternallinkscanallowsearchenginestoefficientlycrawlandincludewebpages.
Съдържание на уебсайта
Originalwebsitecontentiseasiertobeincluded,andmethodssuchascollectingandcopyingotherpeople'sinformationaregenerallydifficulttoinclude.
Thebiggestadvantageoforiginalarticlesisthattheycanservemultiplepurposes,increasetheprobabilityofawebsitebeingincludedbysearchengines,andimprovewebsiteoptimizationrankings.
Характеристики на Baidu
1.TheinformationprocessingmethodbasedonwordcombinationcleverlysolvestheproblemofunderstandingChineseinformation,andgreatlyimprovestheaccuracyandrecallofsearch.
2. Поддръжка на основно китайско кодиране, включително gbk (спецификация на разширение на код за китайски знаци), gb2312 (опростен), big5 (традиционен) и може да се преобразува между различни кодировки."
3.Theintelligentrelevancealgorithmusesacombinationofcontent-basedandhyperlink-basedanalysismethodsforrelevanceevaluation,whichcanobjectivelyanalyzetheinformationcontainedinwebpages,therebymaximizingtherelevanceofsearchresults.
4. Тези резултати от търсенето са по-интуитивни и могат да посочват богати атрибути на страница (като заглавие, URL, време, размер, кодиране, резюме и т.н.) и да подчертават низа за заявка на потребителя, което е удобно за потребителя да прецени дали да прочете оригиналния текст.
5.Baidusearchsupportssecondarysearch,whichcancontinuetosearchinthelastsearchresults,andgraduallynarrowthesearchscopeuntilitreachesthesmallestandmostaccurateresultset.ItismoreconvenientforuserstofindinthemassiveinformationThecontentthatyouarereallyinterestedin.
6.Theintelligentrecommendationtechnologyofrelatedsearchtermswillprompttherelatedsearchtermsaftertheusersearchesforthefirsttimetohelpusersfindmorerelevantresults.StatisticsshowthatitcanpromotethesearchIncreasedvolumeby10-20%.
7.High-performanceserversandlocalizedserversusemulti-threadingtechnology,efficientsearchalgorithms,stableunixplatforms,andlocalizedserverstoensurethefastestresponseSpeed.BaidusearchengineprovidessearchservicesinChina,whichcangreatlyshortentheresponsetimeofretrieval(theaverageresponsetimeofaretrievalislessthan0.5seconds).
8.Itcanprovidemultipleservicemethodswithin7days.ItistheChinesesearchenginewiththefastestupdatetimeandthelargestamountofdataatpresent.9.Thesearchresultoutputcategoryaggregationsupportscontentaggregation,websiteaggregation,contentaggregation+websitecategory.Avarietyofmethodssuchasgathering.Supportuserstoselecttimerangeandimproveuserretrievalefficiency.
10.Intelligentandscalablesearchtechnologyhastheworld’slargestChineseinformationdatabase,providinguserswiththemostaccurate,Themostextensiveandtime-sensitiveinformationprovidesasolidfoundation.
11.Theoptimizeddistributedstructureofstructureandalgorithm,thewell-designedoptimizationalgorithm,andthefault-tolerantdesignensurethesystem'shighperformanceunderalargenumberofvisits.Usability,highscalability,highperformanceandhighstability.
12.Highconfigurabilityenablesthesearchservicetomeettheneedsofdifferentusers.
13.AdvancedwebpagedynamicsummaryDisplaytechnology.
14.Уникална снимка на Baidus.
15.Поддържа разнообразие от синтаксис за разширено търсене, което прави заявката на потребителя по-ефективна и по-точни резултати."+"(и),"-"(не),"|"(или),"сайт:","домейн:","заглавие:","inurl",и друг ефективен синтаксис за търсене ще продължи да се добавя.
Увеличаване на включването
Basically,afterthesearchenginehasincludedthesite,andyoucanalreadyseethenumberofsearchenginesincluded,thehopemustbetoallowthesearchenginetoincludemorepages.IfyouwanttoincreaseThenumberofsearchenginesincluded,alargeincreaseinthecontentofthewebsiteisoneofthem.MoreneedstobedoneforthespidersofsearchenginesTheprogramcreatesagoodwebsitestructure.Toincreasethesite’sinclusionrate,youcantakethefollowingmethods:
Подобрете външната верига
TheexternalchainisagoodmedicineforSEO,whetheritistoimprovethesearchenginerankingorincreasethewebsite’sinclusionVolume,especiallyhigh-qualityexternallinks.Theworkoflinkbuildingmustaccompanythesearchengineoptimizationprogramfromthebeginningtotheend.
Допълнително оригинално съдържание
Onceoriginalcontentisincludedbysearchengines,suchcontentpagesarenotsoeasytobedeletedbysearchengines.Ifthecontentofawebsitehasahighrepetitionrate,evenafteritisincludedbysearchengines,itiseasytobecleanedupbysearchenginesonaregularbasis.Keepingacertainpercentageoforiginalcontentonthewebsitecancultivatetheweightofthewebsiteandensurethatsearchengineswillnotincludeanddeletethesepages.
Оптимизирайте структурата
Optimizetheinternallinksofthewebsite.Agoodwebsitestructurewillallowspiderstofollowthelinksandreadthecontentofthewebsitelayerbylayer.Websiteswithpoorwebsitestructurewillmakespidersfeelliketheyhaveenteredamaze.Ifyourwebsiteisverylarge,itisbesttoestablishuserexperienceapplicationssuchasclearwebsitenavigation,comprehensivesitemaps,etc.,whichcanguidetheinclusionandfacilitatetheusersofthewebsite.
ResearchCollection
Thecollectionprocedureofthesearchengineisacollectionwithonlythinkinganddistinguishingability.Let'snottreatitasasimplewebsitecontentporter.Whenitreadsyourcontent,itwilldistinguishthevalueofthesecontentandotheraspects.Asawebsiteadministrator,youhavetostudytherulesofinclusion,crawlingrules,etc.,anddealingwiththeinclusionofsearchenginesisalsoanimportantsubject.Forincreasingthenumberofpagesincludedonthewebsite,wehavetomakeourselvesmoreproactive.Inotherwords,itmeanstotaketheinitiative.Insteadofwaitingforthecollectiontocome,itisbettertoguidethecollection.
Карта на сайта
Asitemapisalsocalledasitemap.Itisapageonwhichlinkstoallpagesonthewebsiteareplaced.Whenmostpeoplecannotfindtheinformationtheyneedonthewebsite,theymayusethesitemapasaremedy.Thesearchengineindexlikesthesitemapverymuch.
Whybuildasitemap?Mostpeopleknowthatsitemapsaregoodforimprovingtheuserexperience:theyprovidedirectionstositevisitorsandhelplostvisitorsfindthepagetheywanttosee.Forsearchengineoptimization,thebenefitsofthesitemapareevenmore:
1.Providelinksforbrowsingtheentirewebsiteforsearchengines.
2.Providesomelinksforsearchenginestoincludelinkstodynamicpagesorpagesthataredifficulttoreachbyothermethods.
3. Като потенциална целева страница, тя може да бъде оптимизирана за трафик от търсене.
4.IfavisitortriestoaccessaURLthatdoesnotexistinthedomainwherethewebsiteislocated,thevisitorwillberedirectedtoanerrorpageof"Filecannotbefound",andthesitemapcanbeusedasthe"Quasi"content.
Включен новинарски сайт
Baidu не включва решението за нов сайт:
(1)ItisbesttowaitforallthecontentsofthewebsitetobecompletedbeforeuploadingtothewebsiteSpace
(2)Afterthewebsiteisuploaded,submitthewebsitetoBaidu:loginportalsofseveralmajorsearchengines
(3) Регистрирайте 3-5 акаунта в BaiduSoucang, след това предпочитайте URL адреси
(4) GotoLeshou, CapeofGoodHope и любими URL адреси от друга мрежа
(5) Отидете на BaiduTieba, A5 и друг сайт с висока тежест, за да публикувате примамка за връзки (със собствен уебсайт), за да накарате Baidu да го включи и обхожда
(6)Regularlyupdate2-5originalarticleseverydayforthefirstmonth
(7) Не използвайте SEOcheatingMethodoptimization
Basicallyfollowtheabovesteps,thehomepagecanbeincludedwithin1-30days.IfonemonthhaspassedandtheURLhasnotbeenincluded,youcantrytomodifythelayoutofthehomepage.