Търсачката е индексирана

Основен преглед

Searchengineinclusionreferstothespecificnumberofpagesincludedinawebsitebysearchengines.Themoreincluded,thefastertheindexingtime,whichprovesthatthiswebsiteismorefriendlytosearchengines..

Themorecommonlyusedsearchenginesincludebaidu(Baidu),google(Google),yahoo(Yahoo),sogou,youdao(有道),soso(searchsearch),bing(必应),360(360).

Принцип на изключване

Съберете URL адресите на уеб страниците, които да бъдат индексирани

ThenumberofwebpagesontheInternetisabsolutelyastronomical,andtherearecountlessnewwebpageseveryday.Searchenginesneedtofindtheobjectstobeindexedfirst.

AsfarasGoogleisconcerned,althoughthereiscontroversyoverwhetherthereisadifferencebetweenDeepBotandFreshBotonGoogleBot-asforwhethertocallthesetwonames,therearedifferentopinions.Ofcourse,thenameitselfisnotimportant-atleastfornowuntil.

ThemainstreamviewisthatinGoogle’srobots,thereareindeedquiteafewrobotsthatprepare"materials"fortheactualindexedpages—let'scallitFreshBothere.

——TheirtaskistoconstantlyscantheInterneteverydaytodiscoverandmaintainahugelistofURLsforDeepBottouse.Inotherwords,whenitvisitsandreadsoneofitswebpages,thepurposeisnotItisaboutindexingthispage,butfindingallthelinksinthispage.

——Разбира се, това изглежда противоречиво неефективност, което е малко невероятно. Въпреки това, можем просто да съдим по следния начин: FreshBotis не е "изключителен" при сканиране на уеб страници.

Inotherwords,multiplerobotslocatedindifferentGoogledatacentersmayvisitthesamepageinashortperiodoftime,suchasonedayorevenanhour,andDeepBotisindexingandcachingWhenthepageisnotsimilar,itwillnotappear.

Thatis,Googlewillrestrictrobotsinacertaindatacentertocompletethiswork,insteadoftwodatacentersindexingthesameversionofthewebpageatthesametime,ifthereisnoflawinthisstatement,Itseemsthatfromtheserveraccesslog,youcanoftenseethatGoogleBotsoriginatingfromdifferentIPshavevisitedthesamewebpagemultipletimesinashortperiodoftimetoprovetheexistenceofFreshBot.

Therefore,sometimesifyoufindthatGoogleBotfrequentlyvisitsthewebsite,don’tbetoohappytooearly.Maybeit’snotindexingwebpagesatallbutjustscanningURLs.

TheinformationrecordedbyFreshBotincludestheurlofthewebpage,TimeStamp(thetimestampofwhenthewebpagewascreatedorupdated),andtheheaderinformationofthewebpage(Note:Thispointiscontroversial,andmanypeoplebelievethatFreshBotwillnotreadit.Togettheinformationofthetargetwebpage,DeepBotwillcompletethispartofthework.

However,theauthorpreferstheformerstatement,becauseintheurllistsubmittedbyFreshBottoDeepBot,thewebsitesettingwillbeprohibited.Indexedandincludedpagesareexcludedtoimproveefficiency.Inadditiontorobots.txt,aconsiderablepartofthewebsiteisimplementedthroughthe"noindex"inthematatagwhensettingupthistypeofwebsite.TheheadofthetargetpagethatdoesnotreadthetargetpageseemstobeIfthisisnotpossible),ifthewebpageisnotaccessible,suchasnetworkinterruptionorserverfailure,FreshBotwillrecordtheurlandtryagainattheappropriatetime,butwillnotaddittotheurlsubmittedtoDeepBotuntiltheurlisaccessibleList.

Ingeneral,FreshBotoccupiesarelativelysmallamountofserverbandwidthandresources.Finally,FreshBotclassifiestherecordedinformationaccordingtodifferentprioritiesandsubmitsittoDeepBot.Accordingtothedifferentpriorities,therearemainlythefollowing:

A:Нова уеб страница;

B:Стара уебстраница/новTimeStamp,това е,има актуализирана уебстраница;

C:Use301/302redirectionwebpage;

D:ComplexdynamicURL:suchasusingmultipleparametersDynamicURL,Googlemayneedadditionalworktocorrectlyanalyzeitscontent.——WiththeimprovementofGoogle’sabilitytosupportdynamicwebpages,thisclassificationmayhavebeencancelled;

E:Други видове файлове, като връзки към PDF и DOC файлове, индексиране на тези файлове и може да се изисква допълнителна работа;

F:Oldwebpage/oldTimeStamp,thatis,webpagethathasnotbeenupdated.NotethatthetimestamphereisnotbasedonthedatedisplayedintheGooglesearchresults,butwithGoogleDatecomparisonintheindexdatabase;

G: Wrongurl, това е страницата, която връща 404 отговор при достъп.

ThepriorityisarrangedintheorderfromAtoG,decreasinginorder.Itneedstobeemphasizedthattheprioritymentionedhereisrelative.Forexample,itisalsoanewwebpage.Accordingtothequalityandquantityofthelinkstoit,thepriorityisalsoverydifferent.Ithaslinksfromrelatedauthoritativewebsites.'Spageshaveahigherpriority.

Inaddition,thepriorityreferredtohereisonlyforpageswithinthesamewebsite.Infact,differentwebsiteshavedifferentpriorities.Inotherwords,forpagesinauthoritativewebsites,eventhelowestpriorityLevel404url​​mayalsohaveadvantagesovermanyothersiteswiththehighestprioritynewlycreatedwebpages.

Индекс и включване на уеб страници

Onlythenentertheactualprocessofindexingandinclusionofwebpages.Ascanbeseenfromtheaboveintroduction,theURLlistsubmittedbyFreshBotisquitelarge.Accordingtothelanguage,websitelocation,etc.,theindexingofspecificwebsiteswillbeallocatedtodifferentdatacenters.

Theentireindexingprocess,duetothehugeamountofdata,maytakeseveralweeksorevenlongertocomplete.

Asmentionedabove,DeepBotwillfirstindexhigherprioritywebsites/webpages.Thehigherthepriority,thefasteritwillappearintheGoogleindexdatabaseandeventuallyappearintheGooglesearchresultspage..

Foranewwebpage,aslongasitentersthisstage,eveniftheentireindexingprocessisnotcompleted,thecorrespondingwebpagehasthepossibilitytoappearintheGoogleindexlibrary.Ibelievemanyfriendsuse"site"inGoogle."Whensearching,Ioftenseepagesmarkedassupplementaryresultsthatonlydisplaythewebpageurloronlydisplaythepagetitleandurlwithoutdescription.Thisisthenormalresultofthewebpageatthisstage.

WhenGoogleactuallyreads,analyzes,andcachesthispage,itwillpickoutthesupplementaryresultsanddisplaynormalinformation.

——Ofcourse,thepremiseisthatthewebpagehasenoughlinks,especiallylinksfromauthoritativewebsites,andtherearenorecordsthatarethesameorsimilartothewebpagecontentintheindexlibrary(DuplicateContentfiltering).

FordynamicURLs,althoughGooglenowclaimsthattherearenoobstaclestoitsprocessing,theobservablefactsstillshowthatdynamicURLsaremorelikelytoappearinsupplementaryresultsthanstaticURLs.Webpagesoftenrequiremoreandmorevaluablelinkstoescapefromsupplementaryresults.

Forthe"F"categoryabove,thatis,webpagesthathavenotbeenupdated,DeepBotwillcompareitstimestampwiththedateintheGoogleindexdatabasetoconfirmthatalthoughthecorrespondingpageinformationinthesearchresultsmaybeavailableinthefutureUpdatebutaslongasthelatestversionisindexed-considerthesituationofmultipleupdatesandmodificationsofthewebpage-;asforthe"G"category,whichis404url,itwilllookupwhetherthereisacorrespondingrecordintheindexlibrary,anddeleteitifithas.

Синхронизация между центрове за данни

Search engine indexed

Aswementionedearlier,whenDeepBotindexesawebpage,itwillbecompletedbyaspecificdatacenterinsteadofmultipledatacentersreadingatthesametime.Thewebpageobtainsthelatestversionofthewebpagerespectively.Inthisway,aftertheindexingprocessiscompleted,adatasynchronizationprocessisrequiredtoupdatethelatestversionofthewebpageinmultipledatacenters.

ThisisthefamousGoogleDancebefore.However,aftertheBigDaddyupdate,thesynchronizationbetweendatacentersisnolongerconcentratedinaspecifictimeperiodlikethat,butinacontinuousandmoretime-sensitivemanner.

Влияние върху включването

Заглавие на сайта

Thewritingofsitetitle,description,andkeywordshasalwaysbeenaverycautiousthinginthemindsofwebmasters.Itisdirectlyrelatedtotherankingandtrafficofthewebsite,andthesethreetagscannotbeeasilymodifiedafterthewebsiteisonline.Thisrequireswebmasterstoprepareinadvance.Ifyoudonotconsideritinadvanceandmodifyitaftergoingonline,BaiduwillthinkyouYourwebsiteisunstable,youmodifythekeytagsassoonasyougoonline,andyouaresuspectedofcheating,andthenthrowyourwebsiteintothesandbox,andslowlyinvestigate.Atthistime,ifyouwantBaidutoincludethewebsiteatleastonemonthlater,andguaranteethisperiodoftimeAddhigh-qualityarticlestothewebsiteeveryday.

Външни връзки

Addingexternallinkscanallowsearchenginestoefficientlycrawlandincludewebpages.

Съдържание на уебсайта

Originalwebsitecontentiseasiertobeincluded,andmethodssuchascollectingandcopyingotherpeople'sinformationaregenerallydifficulttoinclude.

Thebiggestadvantageoforiginalarticlesisthattheycanservemultiplepurposes,increasetheprobabilityofawebsitebeingincludedbysearchengines,andimprovewebsiteoptimizationrankings.

Характеристики на Baidu

1.TheinformationprocessingmethodbasedonwordcombinationcleverlysolvestheproblemofunderstandingChineseinformation,andgreatlyimprovestheaccuracyandrecallofsearch.

2. Поддръжка на основно китайско кодиране, включително gbk (спецификация на разширение на код за китайски знаци), gb2312 (опростен), big5 (традиционен) и може да се преобразува между различни кодировки."

3.Theintelligentrelevancealgorithmusesacombinationofcontent-basedandhyperlink-basedanalysismethodsforrelevanceevaluation,whichcanobjectivelyanalyzetheinformationcontainedinwebpages,therebymaximizingtherelevanceofsearchresults.

4. Тези резултати от търсенето са по-интуитивни и могат да посочват богати атрибути на страница (като заглавие, URL, време, размер, кодиране, резюме и т.н.) и да подчертават низа за заявка на потребителя, което е удобно за потребителя да прецени дали да прочете оригиналния текст.

5.Baidusearchsupportssecondarysearch,whichcancontinuetosearchinthelastsearchresults,andgraduallynarrowthesearchscopeuntilitreachesthesmallestandmostaccurateresultset.ItismoreconvenientforuserstofindinthemassiveinformationThecontentthatyouarereallyinterestedin.

6.Theintelligentrecommendationtechnologyofrelatedsearchtermswillprompttherelatedsearchtermsaftertheusersearchesforthefirsttimetohelpusersfindmorerelevantresults.StatisticsshowthatitcanpromotethesearchIncreasedvolumeby10-20%.

7.High-performanceserversandlocalizedserversusemulti-threadingtechnology,efficientsearchalgorithms,stableunixplatforms,andlocalizedserverstoensurethefastestresponseSpeed.BaidusearchengineprovidessearchservicesinChina,whichcangreatlyshortentheresponsetimeofretrieval(theaverageresponsetimeofaretrievalislessthan0.5seconds).

8.Itcanprovidemultipleservicemethodswithin7days.ItistheChinesesearchenginewiththefastestupdatetimeandthelargestamountofdataatpresent.9.Thesearchresultoutputcategoryaggregationsupportscontentaggregation,websiteaggregation,contentaggregation+websitecategory.Avarietyofmethodssuchasgathering.Supportuserstoselecttimerangeandimproveuserretrievalefficiency.

10.Intelligentandscalablesearchtechnologyhastheworld’slargestChineseinformationdatabase,providinguserswiththemostaccurate,Themostextensiveandtime-sensitiveinformationprovidesasolidfoundation.

11.Theoptimizeddistributedstructureofstructureandalgorithm,thewell-designedoptimizationalgorithm,andthefault-tolerantdesignensurethesystem'shighperformanceunderalargenumberofvisits.Usability,highscalability,highperformanceandhighstability.

12.Highconfigurabilityenablesthesearchservicetomeettheneedsofdifferentusers.

13.AdvancedwebpagedynamicsummaryDisplaytechnology.

14.Уникална снимка на Baidus.

15.Поддържа разнообразие от синтаксис за разширено търсене, което прави заявката на потребителя по-ефективна и по-точни резултати."+"(и),"-"(не),"|"(или),"сайт:","домейн:","заглавие:","inurl",и друг ефективен синтаксис за търсене ще продължи да се добавя.

Увеличаване на включването

Basically,afterthesearchenginehasincludedthesite,andyoucanalreadyseethenumberofsearchenginesincluded,thehopemustbetoallowthesearchenginetoincludemorepages.IfyouwanttoincreaseThenumberofsearchenginesincluded,alargeincreaseinthecontentofthewebsiteisoneofthem.MoreneedstobedoneforthespidersofsearchenginesTheprogramcreatesagoodwebsitestructure.Toincreasethesite’sinclusionrate,youcantakethefollowingmethods:

Подобрете външната верига

TheexternalchainisagoodmedicineforSEO,whetheritistoimprovethesearchenginerankingorincreasethewebsite’sinclusionVolume,especiallyhigh-qualityexternallinks.Theworkoflinkbuildingmustaccompanythesearchengineoptimizationprogramfromthebeginningtotheend.

Допълнително оригинално съдържание

Onceoriginalcontentisincludedbysearchengines,suchcontentpagesarenotsoeasytobedeletedbysearchengines.Ifthecontentofawebsitehasahighrepetitionrate,evenafteritisincludedbysearchengines,itiseasytobecleanedupbysearchenginesonaregularbasis.Keepingacertainpercentageoforiginalcontentonthewebsitecancultivatetheweightofthewebsiteandensurethatsearchengineswillnotincludeanddeletethesepages.

Оптимизирайте структурата

Optimizetheinternallinksofthewebsite.Agoodwebsitestructurewillallowspiderstofollowthelinksandreadthecontentofthewebsitelayerbylayer.Websiteswithpoorwebsitestructurewillmakespidersfeelliketheyhaveenteredamaze.Ifyourwebsiteisverylarge,itisbesttoestablishuserexperienceapplicationssuchasclearwebsitenavigation,comprehensivesitemaps,etc.,whichcanguidetheinclusionandfacilitatetheusersofthewebsite.

ResearchCollection

Thecollectionprocedureofthesearchengineisacollectionwithonlythinkinganddistinguishingability.Let'snottreatitasasimplewebsitecontentporter.Whenitreadsyourcontent,itwilldistinguishthevalueofthesecontentandotheraspects.Asawebsiteadministrator,youhavetostudytherulesofinclusion,crawlingrules,etc.,anddealingwiththeinclusionofsearchenginesisalsoanimportantsubject.Forincreasingthenumberofpagesincludedonthewebsite,wehavetomakeourselvesmoreproactive.Inotherwords,itmeanstotaketheinitiative.Insteadofwaitingforthecollectiontocome,itisbettertoguidethecollection.

Карта на сайта

Asitemapisalsocalledasitemap.Itisapageonwhichlinkstoallpagesonthewebsiteareplaced.Whenmostpeoplecannotfindtheinformationtheyneedonthewebsite,theymayusethesitemapasaremedy.Thesearchengineindexlikesthesitemapverymuch.

Whybuildasitemap?Mostpeopleknowthatsitemapsaregoodforimprovingtheuserexperience:theyprovidedirectionstositevisitorsandhelplostvisitorsfindthepagetheywanttosee.Forsearchengineoptimization,thebenefitsofthesitemapareevenmore:

1.Providelinksforbrowsingtheentirewebsiteforsearchengines.

2.Providesomelinksforsearchenginestoincludelinkstodynamicpagesorpagesthataredifficulttoreachbyothermethods.

3. Като потенциална целева страница, тя може да бъде оптимизирана за трафик от търсене.

4.IfavisitortriestoaccessaURLthatdoesnotexistinthedomainwherethewebsiteislocated,thevisitorwillberedirectedtoanerrorpageof"Filecannotbefound",andthesitemapcanbeusedasthe"Quasi"content.

Включен новинарски сайт

Baidu не включва решението за нов сайт:

(1)ItisbesttowaitforallthecontentsofthewebsitetobecompletedbeforeuploadingtothewebsiteSpace

(2)Afterthewebsiteisuploaded,submitthewebsitetoBaidu:loginportalsofseveralmajorsearchengines

(3) Регистрирайте 3-5 акаунта в BaiduSoucang, след това предпочитайте URL адреси

(4) GotoLeshou, CapeofGoodHope и любими URL адреси от друга мрежа

(5) Отидете на BaiduTieba, A5 и друг сайт с висока тежест, за да публикувате примамка за връзки (със собствен уебсайт), за да накарате Baidu да го включи и обхожда

(6)Regularlyupdate2-5originalarticleseverydayforthefirstmonth

(7) Не използвайте SEOcheatingMethodoptimization

Basicallyfollowtheabovesteps,thehomepagecanbeincludedwithin1-30days.IfonemonthhaspassedandtheURLhasnotbeenincluded,youcantrytomodifythelayoutofthehomepage.

Related Articles
TOP