Vyhledávač indexován

Základní přehled

Searchengineinclusionreferstothespecificnumberofpagesincludedinawebsitebysearchengines.Themoreincluded,thefastertheindexingtime,whichprovesthatthiswebsiteismorefriendlytosearchengines..

Themorecommonlyusedsearchenginesincludebaidu(Baidu),google(Google),yahoo(Yahoo),sogou,youdao(有道),soso(searchsearch),bing(必应),360(360).

Princip začlenění

Sbírejte adresy URL webových stránek, které mají být indexovány

ThenumberofwebpagesontheInternetisabsolutelyastronomical,andtherearecountlessnewwebpageseveryday.Searchenginesneedtofindtheobjectstobeindexedfirst.

AsfarasGoogleisconcerned,althoughthereiscontroversyoverwhetherthereisadifferencebetweenDeepBotandFreshBotonGoogleBot-asforwhethertocallthesetwonames,therearedifferentopinions.Ofcourse,thenameitselfisnotimportant-atleastfornowuntil.

ThemainstreamviewisthatinGoogle’srobots,thereareindeedquiteafewrobotsthatprepare"materials"fortheactualindexedpages—let'scallitFreshBothere.

——TheirtaskistoconstantlyscantheInterneteverydaytodiscoverandmaintainahugelistofURLsforDeepBottouse.Inotherwords,whenitvisitsandreadsoneofitswebpages,thepurposeisnotItisaboutindexingthispage,butfindingallthelinksinthispage.

——Samozřejmě, zdá se, že je to v rozporu s účinností, což je až neuvěřitelné. Při skenování webových stránek však můžeme jednoduše posoudit podle následujícího postupu:FreshBonení"exkluzivní".

Inotherwords,multiplerobotslocatedindifferentGoogledatacentersmayvisitthesamepageinashortperiodoftime,suchasonedayorevenanhour,andDeepBotisindexingandcachingWhenthepageisnotsimilar,itwillnotappear.

Thatis,Googlewillrestrictrobotsinacertaindatacentertocompletethiswork,insteadoftwodatacentersindexingthesameversionofthewebpageatthesametime,ifthereisnoflawinthisstatement,Itseemsthatfromtheserveraccesslog,youcanoftenseethatGoogleBotsoriginatingfromdifferentIPshavevisitedthesamewebpagemultipletimesinashortperiodoftimetoprovetheexistenceofFreshBot.

Therefore,sometimesifyoufindthatGoogleBotfrequentlyvisitsthewebsite,don’tbetoohappytooearly.Maybeit’snotindexingwebpagesatallbutjustscanningURLs.

TheinformationrecordedbyFreshBotincludestheurlofthewebpage,TimeStamp(thetimestampofwhenthewebpagewascreatedorupdated),andtheheaderinformationofthewebpage(Note:Thispointiscontroversial,andmanypeoplebelievethatFreshBotwillnotreadit.Togettheinformationofthetargetwebpage,DeepBotwillcompletethispartofthework.

However,theauthorpreferstheformerstatement,becauseintheurllistsubmittedbyFreshBottoDeepBot,thewebsitesettingwillbeprohibited.Indexedandincludedpagesareexcludedtoimproveefficiency.Inadditiontorobots.txt,aconsiderablepartofthewebsiteisimplementedthroughthe"noindex"inthematatagwhensettingupthistypeofwebsite.TheheadofthetargetpagethatdoesnotreadthetargetpageseemstobeIfthisisnotpossible),ifthewebpageisnotaccessible,suchasnetworkinterruptionorserverfailure,FreshBotwillrecordtheurlandtryagainattheappropriatetime,butwillnotaddittotheurlsubmittedtoDeepBotuntiltheurlisaccessibleList.

Ingeneral,FreshBotoccupiesarelativelysmallamountofserverbandwidthandresources.Finally,FreshBotclassifiestherecordedinformationaccordingtodifferentprioritiesandsubmitsittoDeepBot.Accordingtothedifferentpriorities,therearemainlythefollowing:

A:Nová webová stránka;

B:Stará webová stránka/nové časové razítko,tato,existujeaktualizovaná webová stránka;

C:Use301/302redirectionwebpage;

D:ComplexdynamicURL:suchasusingmultipleparametersDynamicURL,Googlemayneedadditionalworktocorrectlyanalyzeitscontent.——WiththeimprovementofGoogle’sabilitytosupportdynamicwebpages,thisclassificationmayhavebeencancelled;

E:Jiné typy souborů, jako jsou odkazy na soubory PDF a DOC, indexování těchto souborů a může být vyžadována další práce;

F:Oldwebpage/oldTimeStamp,thatis,webpagethathasnotbeenupdated.NotethatthetimestamphereisnotbasedonthedatedisplayedintheGooglesearchresults,butwithGoogleDatecomparisonintheindexdatabase;

G: Chybná, to je, stránka, která se po přístupu vrátí, odpovídá 404.

ThepriorityisarrangedintheorderfromAtoG,decreasinginorder.Itneedstobeemphasizedthattheprioritymentionedhereisrelative.Forexample,itisalsoanewwebpage.Accordingtothequalityandquantityofthelinkstoit,thepriorityisalsoverydifferent.Ithaslinksfromrelatedauthoritativewebsites.'Spageshaveahigherpriority.

Inaddition,thepriorityreferredtohereisonlyforpageswithinthesamewebsite.Infact,differentwebsiteshavedifferentpriorities.Inotherwords,forpagesinauthoritativewebsites,eventhelowestpriorityLevel404url​​mayalsohaveadvantagesovermanyothersiteswiththehighestprioritynewlycreatedwebpages.

Index a zahrnutí webových stránek

Onlythenentertheactualprocessofindexingandinclusionofwebpages.Ascanbeseenfromtheaboveintroduction,theURLlistsubmittedbyFreshBotisquitelarge.Accordingtothelanguage,websitelocation,etc.,theindexingofspecificwebsiteswillbeallocatedtodifferentdatacenters.

Theentireindexingprocess,duetothehugeamountofdata,maytakeseveralweeksorevenlongertocomplete.

Asmentionedabove,DeepBotwillfirstindexhigherprioritywebsites/webpages.Thehigherthepriority,thefasteritwillappearintheGoogleindexdatabaseandeventuallyappearintheGooglesearchresultspage..

Foranewwebpage,aslongasitentersthisstage,eveniftheentireindexingprocessisnotcompleted,thecorrespondingwebpagehasthepossibilitytoappearintheGoogleindexlibrary.Ibelievemanyfriendsuse"site"inGoogle."Whensearching,Ioftenseepagesmarkedassupplementaryresultsthatonlydisplaythewebpageurloronlydisplaythepagetitleandurlwithoutdescription.Thisisthenormalresultofthewebpageatthisstage.

WhenGoogleactuallyreads,analyzes,andcachesthispage,itwillpickoutthesupplementaryresultsanddisplaynormalinformation.

——Ofcourse,thepremiseisthatthewebpagehasenoughlinks,especiallylinksfromauthoritativewebsites,andtherearenorecordsthatarethesameorsimilartothewebpagecontentintheindexlibrary(DuplicateContentfiltering).

FordynamicURLs,althoughGooglenowclaimsthattherearenoobstaclestoitsprocessing,theobservablefactsstillshowthatdynamicURLsaremorelikelytoappearinsupplementaryresultsthanstaticURLs.Webpagesoftenrequiremoreandmorevaluablelinkstoescapefromsupplementaryresults.

Forthe"F"categoryabove,thatis,webpagesthathavenotbeenupdated,DeepBotwillcompareitstimestampwiththedateintheGoogleindexdatabasetoconfirmthatalthoughthecorrespondingpageinformationinthesearchresultsmaybeavailableinthefutureUpdatebutaslongasthelatestversionisindexed-considerthesituationofmultipleupdatesandmodificationsofthewebpage-;asforthe"G"category,whichis404url,itwilllookupwhetherthereisacorrespondingrecordintheindexlibrary,anddeleteitifithas.

Synchronizace mezi datovými centry

Search engine indexed

Aswementionedearlier,whenDeepBotindexesawebpage,itwillbecompletedbyaspecificdatacenterinsteadofmultipledatacentersreadingatthesametime.Thewebpageobtainsthelatestversionofthewebpagerespectively.Inthisway,aftertheindexingprocessiscompleted,adatasynchronizationprocessisrequiredtoupdatethelatestversionofthewebpageinmultipledatacenters.

ThisisthefamousGoogleDancebefore.However,aftertheBigDaddyupdate,thesynchronizationbetweendatacentersisnolongerconcentratedinaspecifictimeperiodlikethat,butinacontinuousandmoretime-sensitivemanner.

Ovlivňování inkluze

Titulek webové stránky

Thewritingofsitetitle,description,andkeywordshasalwaysbeenaverycautiousthinginthemindsofwebmasters.Itisdirectlyrelatedtotherankingandtrafficofthewebsite,andthesethreetagscannotbeeasilymodifiedafterthewebsiteisonline.Thisrequireswebmasterstoprepareinadvance.Ifyoudonotconsideritinadvanceandmodifyitaftergoingonline,BaiduwillthinkyouYourwebsiteisunstable,youmodifythekeytagsassoonasyougoonline,andyouaresuspectedofcheating,andthenthrowyourwebsiteintothesandbox,andslowlyinvestigate.Atthistime,ifyouwantBaidutoincludethewebsiteatleastonemonthlater,andguaranteethisperiodoftimeAddhigh-qualityarticlestothewebsiteeveryday.

Externí odkazy

Addingexternallinkscanallowsearchenginestoefficientlycrawlandincludewebpages.

Obsah webových stránek

Originalwebsitecontentiseasiertobeincluded,andmethodssuchascollectingandcopyingotherpeople'sinformationaregenerallydifficulttoinclude.

Thebiggestadvantageoforiginalarticlesisthattheycanservemultiplepurposes,increasetheprobabilityofawebsitebeingincludedbysearchengines,andimprovewebsiteoptimizationrankings.

Vlastnosti Baidu

1.TheinformationprocessingmethodbasedonwordcombinationcleverlysolvestheproblemofunderstandingChineseinformation,andgreatlyimprovestheaccuracyandrecallofsearch.

2.Podporujte mainstreamové čínské kódování, včetně gbk (specifikace rozšíření čínského znaku), gb2312 (zjednodušené), big5 (tradiční) a lze je převést mezi různá kódování.“

3.Theintelligentrelevancealgorithmusesacombinationofcontent-basedandhyperlink-basedanalysismethodsforrelevanceevaluation,whichcanobjectivelyanalyzetheinformationcontainedinwebpages,therebymaximizingtherelevanceofsearchresults.

4.Výsledky hledání jsou intuitivnější a mohou uvádět bohaté atributy stránky (jako je název, adresa URL, čas, velikost, kódování, abstrakt atd.) a zvýraznit řetězec dotazů uživatele, což je vhodné pro uživatele, kteří poznají, zda číst původní text.

5.Baidusearchsupportssecondarysearch,whichcancontinuetosearchinthelastsearchresults,andgraduallynarrowthesearchscopeuntilitreachesthesmallestandmostaccurateresultset.ItismoreconvenientforuserstofindinthemassiveinformationThecontentthatyouarereallyinterestedin.

6.Theintelligentrecommendationtechnologyofrelatedsearchtermswillprompttherelatedsearchtermsaftertheusersearchesforthefirsttimetohelpusersfindmorerelevantresults.StatisticsshowthatitcanpromotethesearchIncreasedvolumeby10-20%.

7.High-performanceserversandlocalizedserversusemulti-threadingtechnology,efficientsearchalgorithms,stableunixplatforms,andlocalizedserverstoensurethefastestresponseSpeed.BaidusearchengineprovidessearchservicesinChina,whichcangreatlyshortentheresponsetimeofretrieval(theaverageresponsetimeofaretrievalislessthan0.5seconds).

8.Itcanprovidemultipleservicemethodswithin7days.ItistheChinesesearchenginewiththefastestupdatetimeandthelargestamountofdataatpresent.9.Thesearchresultoutputcategoryaggregationsupportscontentaggregation,websiteaggregation,contentaggregation+websitecategory.Avarietyofmethodssuchasgathering.Supportuserstoselecttimerangeandimproveuserretrievalefficiency.

10.Intelligentandscalablesearchtechnologyhastheworld’slargestChineseinformationdatabase,providinguserswiththemostaccurate,Themostextensiveandtime-sensitiveinformationprovidesasolidfoundation.

11.Theoptimizeddistributedstructureofstructureandalgorithm,thewell-designedoptimizationalgorithm,andthefault-tolerantdesignensurethesystem'shighperformanceunderalargenumberofvisits.Usability,highscalability,highperformanceandhighstability.

12.Highconfigurabilityenablesthesearchservicetomeettheneedsofdifferentusers.

13.Pokročilý přehled dynamiky webové stránky Technologie zobrazení.

14.UniqueBaidusnapshot.

15.Podporuje různé syntaxe pokročilého vyhledávání, díky čemuž je uživatelský dotaz efektivnější a přesnější."+"(a),"-"(ne),"|"(nebo),"site:","doména:","intitle:","inurl"a další efektivní syntaxe vyhledávání bude i nadále doplněna.

Zvýšení inkluze

Basically,afterthesearchenginehasincludedthesite,andyoucanalreadyseethenumberofsearchenginesincluded,thehopemustbetoallowthesearchenginetoincludemorepages.IfyouwanttoincreaseThenumberofsearchenginesincluded,alargeincreaseinthecontentofthewebsiteisoneofthem.MoreneedstobedoneforthespidersofsearchenginesTheprogramcreatesagoodwebsitestructure.Toincreasethesite’sinclusionrate,youcantakethefollowingmethods:

Vylepšete vnější řetězec

TheexternalchainisagoodmedicineforSEO,whetheritistoimprovethesearchenginerankingorincreasethewebsite’sinclusionVolume,especiallyhigh-qualityexternallinks.Theworkoflinkbuildingmustaccompanythesearchengineoptimizationprogramfromthebeginningtotheend.

Addoriginal content

Onceoriginalcontentisincludedbysearchengines,suchcontentpagesarenotsoeasytobedeletedbysearchengines.Ifthecontentofawebsitehasahighrepetitionrate,evenafteritisincludedbysearchengines,itiseasytobecleanedupbysearchenginesonaregularbasis.Keepingacertainpercentageoforiginalcontentonthewebsitecancultivatetheweightofthewebsiteandensurethatsearchengineswillnotincludeanddeletethesepages.

Optimalizujte strukturu

Optimizetheinternallinksofthewebsite.Agoodwebsitestructurewillallowspiderstofollowthelinksandreadthecontentofthewebsitelayerbylayer.Websiteswithpoorwebsitestructurewillmakespidersfeelliketheyhaveenteredamaze.Ifyourwebsiteisverylarge,itisbesttoestablishuserexperienceapplicationssuchasclearwebsitenavigation,comprehensivesitemaps,etc.,whichcanguidetheinclusionandfacilitatetheusersofthewebsite.

Research Collection

Thecollectionprocedureofthesearchengineisacollectionwithonlythinkinganddistinguishingability.Let'snottreatitasasimplewebsitecontentporter.Whenitreadsyourcontent,itwilldistinguishthevalueofthesecontentandotheraspects.Asawebsiteadministrator,youhavetostudytherulesofinclusion,crawlingrules,etc.,anddealingwiththeinclusionofsearchenginesisalsoanimportantsubject.Forincreasingthenumberofpagesincludedonthewebsite,wehavetomakeourselvesmoreproactive.Inotherwords,itmeanstotaketheinitiative.Insteadofwaitingforthecollectiontocome,itisbettertoguidethecollection.

Mapa stránek

Asitemapisalsocalledasitemap.Itisapageonwhichlinkstoallpagesonthewebsiteareplaced.Whenmostpeoplecannotfindtheinformationtheyneedonthewebsite,theymayusethesitemapasaremedy.Thesearchengineindexlikesthesitemapverymuch.

Whybuildasitemap?Mostpeopleknowthatsitemapsaregoodforimprovingtheuserexperience:theyprovidedirectionstositevisitorsandhelplostvisitorsfindthepagetheywanttosee.Forsearchengineoptimization,thebenefitsofthesitemapareevenmore:

1.Providelinksforbrowsingtheentirewebsiteforsearchengines.

2.Providesomelinksforsearchenginestoincludelinkstodynamicpagesorpagesthataredifficulttoreachbyothermethods.

3.Jako potenciální vstupní stránka může být optimalizována pro vyhledávací provoz.

4.IfavisitortriestoaccessaURLthatdoesnotexistinthedomainwherethewebsiteislocated,thevisitorwillberedirectedtoanerrorpageof"Filecannotbefound",andthesitemapcanbeusedasthe"Quasi"content.

Včetně novinek

Baidu nezahrnuje řešení nového webu:

(1)ItisbesttowaitforallthecontentsofthewebsitetobecompletedbeforeuploadingtothewebsiteSpace

(2)Afterthewebsiteisuploaded,submitthewebsitetoBaidu:loginportalsofseveralmajorsearchengines

(3)Zaregistrujte 3–5 účtů v BaiduSoucangu, poté zařaďte mezi oblíbené adresy URL

(4) Přejít na Leshou, Capeof GoodHope a další oblíbené adresy URL sítě

(5) Přejděte na BaiduTieba, A5 a další webové stránky s vysokou gramáží a publikujte návnadu s odkazem (s vlastní webovou stránkou), toleran Baidu, který zahrnuje i procházení

(6)Regularlyupdate2-5originalarticleseverydayforthefirstmonth

(7)Nepoužívejte metodu optimalizace SEOcheating

Basicallyfollowtheabovesteps,thehomepagecanbeincludedwithin1-30days.IfonemonthhaspassedandtheURLhasnotbeenincluded,youcantrytomodifythelayoutofthehomepage.

Související články
HORNÍ