Základní přehled
Searchengineinclusionreferstothespecificnumberofpagesincludedinawebsitebysearchengines.Themoreincluded,thefastertheindexingtime,whichprovesthatthiswebsiteismorefriendlytosearchengines..
Themorecommonlyusedsearchenginesincludebaidu(Baidu),google(Google),yahoo(Yahoo),sogou,youdao(有道),soso(searchsearch),bing(必应),360(360).
Princip začlenění
Sbírejte adresy URL webových stránek, které mají být indexovány
ThenumberofwebpagesontheInternetisabsolutelyastronomical,andtherearecountlessnewwebpageseveryday.Searchenginesneedtofindtheobjectstobeindexedfirst.
AsfarasGoogleisconcerned,althoughthereiscontroversyoverwhetherthereisadifferencebetweenDeepBotandFreshBotonGoogleBot-asforwhethertocallthesetwonames,therearedifferentopinions.Ofcourse,thenameitselfisnotimportant-atleastfornowuntil.
ThemainstreamviewisthatinGoogle’srobots,thereareindeedquiteafewrobotsthatprepare"materials"fortheactualindexedpages—let'scallitFreshBothere.
——TheirtaskistoconstantlyscantheInterneteverydaytodiscoverandmaintainahugelistofURLsforDeepBottouse.Inotherwords,whenitvisitsandreadsoneofitswebpages,thepurposeisnotItisaboutindexingthispage,butfindingallthelinksinthispage.
——Samozřejmě, zdá se, že je to v rozporu s účinností, což je až neuvěřitelné. Při skenování webových stránek však můžeme jednoduše posoudit podle následujícího postupu:FreshBonení"exkluzivní".
Inotherwords,multiplerobotslocatedindifferentGoogledatacentersmayvisitthesamepageinashortperiodoftime,suchasonedayorevenanhour,andDeepBotisindexingandcachingWhenthepageisnotsimilar,itwillnotappear.
Thatis,Googlewillrestrictrobotsinacertaindatacentertocompletethiswork,insteadoftwodatacentersindexingthesameversionofthewebpageatthesametime,ifthereisnoflawinthisstatement,Itseemsthatfromtheserveraccesslog,youcanoftenseethatGoogleBotsoriginatingfromdifferentIPshavevisitedthesamewebpagemultipletimesinashortperiodoftimetoprovetheexistenceofFreshBot.
Therefore,sometimesifyoufindthatGoogleBotfrequentlyvisitsthewebsite,don’tbetoohappytooearly.Maybeit’snotindexingwebpagesatallbutjustscanningURLs.
TheinformationrecordedbyFreshBotincludestheurlofthewebpage,TimeStamp(thetimestampofwhenthewebpagewascreatedorupdated),andtheheaderinformationofthewebpage(Note:Thispointiscontroversial,andmanypeoplebelievethatFreshBotwillnotreadit.Togettheinformationofthetargetwebpage,DeepBotwillcompletethispartofthework.
However,theauthorpreferstheformerstatement,becauseintheurllistsubmittedbyFreshBottoDeepBot,thewebsitesettingwillbeprohibited.Indexedandincludedpagesareexcludedtoimproveefficiency.Inadditiontorobots.txt,aconsiderablepartofthewebsiteisimplementedthroughthe"noindex"inthematatagwhensettingupthistypeofwebsite.TheheadofthetargetpagethatdoesnotreadthetargetpageseemstobeIfthisisnotpossible),ifthewebpageisnotaccessible,suchasnetworkinterruptionorserverfailure,FreshBotwillrecordtheurlandtryagainattheappropriatetime,butwillnotaddittotheurlsubmittedtoDeepBotuntiltheurlisaccessibleList.
Ingeneral,FreshBotoccupiesarelativelysmallamountofserverbandwidthandresources.Finally,FreshBotclassifiestherecordedinformationaccordingtodifferentprioritiesandsubmitsittoDeepBot.Accordingtothedifferentpriorities,therearemainlythefollowing:
A:Nová webová stránka;
B:Stará webová stránka/nové časové razítko,tato,existujeaktualizovaná webová stránka;
C:Use301/302redirectionwebpage;
D:ComplexdynamicURL:suchasusingmultipleparametersDynamicURL,Googlemayneedadditionalworktocorrectlyanalyzeitscontent.——WiththeimprovementofGoogle’sabilitytosupportdynamicwebpages,thisclassificationmayhavebeencancelled;
E:Jiné typy souborů, jako jsou odkazy na soubory PDF a DOC, indexování těchto souborů a může být vyžadována další práce;
F:Oldwebpage/oldTimeStamp,thatis,webpagethathasnotbeenupdated.NotethatthetimestamphereisnotbasedonthedatedisplayedintheGooglesearchresults,butwithGoogleDatecomparisonintheindexdatabase;
G: Chybná, to je, stránka, která se po přístupu vrátí, odpovídá 404.
ThepriorityisarrangedintheorderfromAtoG,decreasinginorder.Itneedstobeemphasizedthattheprioritymentionedhereisrelative.Forexample,itisalsoanewwebpage.Accordingtothequalityandquantityofthelinkstoit,thepriorityisalsoverydifferent.Ithaslinksfromrelatedauthoritativewebsites.'Spageshaveahigherpriority.
Inaddition,thepriorityreferredtohereisonlyforpageswithinthesamewebsite.Infact,differentwebsiteshavedifferentpriorities.Inotherwords,forpagesinauthoritativewebsites,eventhelowestpriorityLevel404urlmayalsohaveadvantagesovermanyothersiteswiththehighestprioritynewlycreatedwebpages.
Index a zahrnutí webových stránek
Onlythenentertheactualprocessofindexingandinclusionofwebpages.Ascanbeseenfromtheaboveintroduction,theURLlistsubmittedbyFreshBotisquitelarge.Accordingtothelanguage,websitelocation,etc.,theindexingofspecificwebsiteswillbeallocatedtodifferentdatacenters.
Theentireindexingprocess,duetothehugeamountofdata,maytakeseveralweeksorevenlongertocomplete.
Asmentionedabove,DeepBotwillfirstindexhigherprioritywebsites/webpages.Thehigherthepriority,thefasteritwillappearintheGoogleindexdatabaseandeventuallyappearintheGooglesearchresultspage..
Foranewwebpage,aslongasitentersthisstage,eveniftheentireindexingprocessisnotcompleted,thecorrespondingwebpagehasthepossibilitytoappearintheGoogleindexlibrary.Ibelievemanyfriendsuse"site"inGoogle."Whensearching,Ioftenseepagesmarkedassupplementaryresultsthatonlydisplaythewebpageurloronlydisplaythepagetitleandurlwithoutdescription.Thisisthenormalresultofthewebpageatthisstage.
WhenGoogleactuallyreads,analyzes,andcachesthispage,itwillpickoutthesupplementaryresultsanddisplaynormalinformation.
——Ofcourse,thepremiseisthatthewebpagehasenoughlinks,especiallylinksfromauthoritativewebsites,andtherearenorecordsthatarethesameorsimilartothewebpagecontentintheindexlibrary(DuplicateContentfiltering).
FordynamicURLs,althoughGooglenowclaimsthattherearenoobstaclestoitsprocessing,theobservablefactsstillshowthatdynamicURLsaremorelikelytoappearinsupplementaryresultsthanstaticURLs.Webpagesoftenrequiremoreandmorevaluablelinkstoescapefromsupplementaryresults.
Forthe"F"categoryabove,thatis,webpagesthathavenotbeenupdated,DeepBotwillcompareitstimestampwiththedateintheGoogleindexdatabasetoconfirmthatalthoughthecorrespondingpageinformationinthesearchresultsmaybeavailableinthefutureUpdatebutaslongasthelatestversionisindexed-considerthesituationofmultipleupdatesandmodificationsofthewebpage-;asforthe"G"category,whichis404url,itwilllookupwhetherthereisacorrespondingrecordintheindexlibrary,anddeleteitifithas.
Synchronizace mezi datovými centry
Aswementionedearlier,whenDeepBotindexesawebpage,itwillbecompletedbyaspecificdatacenterinsteadofmultipledatacentersreadingatthesametime.Thewebpageobtainsthelatestversionofthewebpagerespectively.Inthisway,aftertheindexingprocessiscompleted,adatasynchronizationprocessisrequiredtoupdatethelatestversionofthewebpageinmultipledatacenters.
ThisisthefamousGoogleDancebefore.However,aftertheBigDaddyupdate,thesynchronizationbetweendatacentersisnolongerconcentratedinaspecifictimeperiodlikethat,butinacontinuousandmoretime-sensitivemanner.
Ovlivňování inkluze
Titulek webové stránky
Thewritingofsitetitle,description,andkeywordshasalwaysbeenaverycautiousthinginthemindsofwebmasters.Itisdirectlyrelatedtotherankingandtrafficofthewebsite,andthesethreetagscannotbeeasilymodifiedafterthewebsiteisonline.Thisrequireswebmasterstoprepareinadvance.Ifyoudonotconsideritinadvanceandmodifyitaftergoingonline,BaiduwillthinkyouYourwebsiteisunstable,youmodifythekeytagsassoonasyougoonline,andyouaresuspectedofcheating,andthenthrowyourwebsiteintothesandbox,andslowlyinvestigate.Atthistime,ifyouwantBaidutoincludethewebsiteatleastonemonthlater,andguaranteethisperiodoftimeAddhigh-qualityarticlestothewebsiteeveryday.
Externí odkazy
Addingexternallinkscanallowsearchenginestoefficientlycrawlandincludewebpages.
Obsah webových stránek
Originalwebsitecontentiseasiertobeincluded,andmethodssuchascollectingandcopyingotherpeople'sinformationaregenerallydifficulttoinclude.
Thebiggestadvantageoforiginalarticlesisthattheycanservemultiplepurposes,increasetheprobabilityofawebsitebeingincludedbysearchengines,andimprovewebsiteoptimizationrankings.
Vlastnosti Baidu
1.TheinformationprocessingmethodbasedonwordcombinationcleverlysolvestheproblemofunderstandingChineseinformation,andgreatlyimprovestheaccuracyandrecallofsearch.
2.Podporujte mainstreamové čínské kódování, včetně gbk (specifikace rozšíření čínského znaku), gb2312 (zjednodušené), big5 (tradiční) a lze je převést mezi různá kódování.“
3.Theintelligentrelevancealgorithmusesacombinationofcontent-basedandhyperlink-basedanalysismethodsforrelevanceevaluation,whichcanobjectivelyanalyzetheinformationcontainedinwebpages,therebymaximizingtherelevanceofsearchresults.
4.Výsledky hledání jsou intuitivnější a mohou uvádět bohaté atributy stránky (jako je název, adresa URL, čas, velikost, kódování, abstrakt atd.) a zvýraznit řetězec dotazů uživatele, což je vhodné pro uživatele, kteří poznají, zda číst původní text.
5.Baidusearchsupportssecondarysearch,whichcancontinuetosearchinthelastsearchresults,andgraduallynarrowthesearchscopeuntilitreachesthesmallestandmostaccurateresultset.ItismoreconvenientforuserstofindinthemassiveinformationThecontentthatyouarereallyinterestedin.
6.Theintelligentrecommendationtechnologyofrelatedsearchtermswillprompttherelatedsearchtermsaftertheusersearchesforthefirsttimetohelpusersfindmorerelevantresults.StatisticsshowthatitcanpromotethesearchIncreasedvolumeby10-20%.
7.High-performanceserversandlocalizedserversusemulti-threadingtechnology,efficientsearchalgorithms,stableunixplatforms,andlocalizedserverstoensurethefastestresponseSpeed.BaidusearchengineprovidessearchservicesinChina,whichcangreatlyshortentheresponsetimeofretrieval(theaverageresponsetimeofaretrievalislessthan0.5seconds).
8.Itcanprovidemultipleservicemethodswithin7days.ItistheChinesesearchenginewiththefastestupdatetimeandthelargestamountofdataatpresent.9.Thesearchresultoutputcategoryaggregationsupportscontentaggregation,websiteaggregation,contentaggregation+websitecategory.Avarietyofmethodssuchasgathering.Supportuserstoselecttimerangeandimproveuserretrievalefficiency.
10.Intelligentandscalablesearchtechnologyhastheworld’slargestChineseinformationdatabase,providinguserswiththemostaccurate,Themostextensiveandtime-sensitiveinformationprovidesasolidfoundation.
11.Theoptimizeddistributedstructureofstructureandalgorithm,thewell-designedoptimizationalgorithm,andthefault-tolerantdesignensurethesystem'shighperformanceunderalargenumberofvisits.Usability,highscalability,highperformanceandhighstability.
12.Highconfigurabilityenablesthesearchservicetomeettheneedsofdifferentusers.
13.Pokročilý přehled dynamiky webové stránky Technologie zobrazení.
14.UniqueBaidusnapshot.
15.Podporuje různé syntaxe pokročilého vyhledávání, díky čemuž je uživatelský dotaz efektivnější a přesnější."+"(a),"-"(ne),"|"(nebo),"site:","doména:","intitle:","inurl"a další efektivní syntaxe vyhledávání bude i nadále doplněna.
Zvýšení inkluze
Basically,afterthesearchenginehasincludedthesite,andyoucanalreadyseethenumberofsearchenginesincluded,thehopemustbetoallowthesearchenginetoincludemorepages.IfyouwanttoincreaseThenumberofsearchenginesincluded,alargeincreaseinthecontentofthewebsiteisoneofthem.MoreneedstobedoneforthespidersofsearchenginesTheprogramcreatesagoodwebsitestructure.Toincreasethesite’sinclusionrate,youcantakethefollowingmethods:
Vylepšete vnější řetězec
TheexternalchainisagoodmedicineforSEO,whetheritistoimprovethesearchenginerankingorincreasethewebsite’sinclusionVolume,especiallyhigh-qualityexternallinks.Theworkoflinkbuildingmustaccompanythesearchengineoptimizationprogramfromthebeginningtotheend.
Addoriginal content
Onceoriginalcontentisincludedbysearchengines,suchcontentpagesarenotsoeasytobedeletedbysearchengines.Ifthecontentofawebsitehasahighrepetitionrate,evenafteritisincludedbysearchengines,itiseasytobecleanedupbysearchenginesonaregularbasis.Keepingacertainpercentageoforiginalcontentonthewebsitecancultivatetheweightofthewebsiteandensurethatsearchengineswillnotincludeanddeletethesepages.
Optimalizujte strukturu
Optimizetheinternallinksofthewebsite.Agoodwebsitestructurewillallowspiderstofollowthelinksandreadthecontentofthewebsitelayerbylayer.Websiteswithpoorwebsitestructurewillmakespidersfeelliketheyhaveenteredamaze.Ifyourwebsiteisverylarge,itisbesttoestablishuserexperienceapplicationssuchasclearwebsitenavigation,comprehensivesitemaps,etc.,whichcanguidetheinclusionandfacilitatetheusersofthewebsite.
Research Collection
Thecollectionprocedureofthesearchengineisacollectionwithonlythinkinganddistinguishingability.Let'snottreatitasasimplewebsitecontentporter.Whenitreadsyourcontent,itwilldistinguishthevalueofthesecontentandotheraspects.Asawebsiteadministrator,youhavetostudytherulesofinclusion,crawlingrules,etc.,anddealingwiththeinclusionofsearchenginesisalsoanimportantsubject.Forincreasingthenumberofpagesincludedonthewebsite,wehavetomakeourselvesmoreproactive.Inotherwords,itmeanstotaketheinitiative.Insteadofwaitingforthecollectiontocome,itisbettertoguidethecollection.
Mapa stránek
Asitemapisalsocalledasitemap.Itisapageonwhichlinkstoallpagesonthewebsiteareplaced.Whenmostpeoplecannotfindtheinformationtheyneedonthewebsite,theymayusethesitemapasaremedy.Thesearchengineindexlikesthesitemapverymuch.
Whybuildasitemap?Mostpeopleknowthatsitemapsaregoodforimprovingtheuserexperience:theyprovidedirectionstositevisitorsandhelplostvisitorsfindthepagetheywanttosee.Forsearchengineoptimization,thebenefitsofthesitemapareevenmore:
1.Providelinksforbrowsingtheentirewebsiteforsearchengines.
2.Providesomelinksforsearchenginestoincludelinkstodynamicpagesorpagesthataredifficulttoreachbyothermethods.
3.Jako potenciální vstupní stránka může být optimalizována pro vyhledávací provoz.
4.IfavisitortriestoaccessaURLthatdoesnotexistinthedomainwherethewebsiteislocated,thevisitorwillberedirectedtoanerrorpageof"Filecannotbefound",andthesitemapcanbeusedasthe"Quasi"content.
Včetně novinek
Baidu nezahrnuje řešení nového webu:
(1)ItisbesttowaitforallthecontentsofthewebsitetobecompletedbeforeuploadingtothewebsiteSpace
(2)Afterthewebsiteisuploaded,submitthewebsitetoBaidu:loginportalsofseveralmajorsearchengines
(3)Zaregistrujte 3–5 účtů v BaiduSoucangu, poté zařaďte mezi oblíbené adresy URL
(4) Přejít na Leshou, Capeof GoodHope a další oblíbené adresy URL sítě
(5) Přejděte na BaiduTieba, A5 a další webové stránky s vysokou gramáží a publikujte návnadu s odkazem (s vlastní webovou stránkou), toleran Baidu, který zahrnuje i procházení
(6)Regularlyupdate2-5originalarticleseverydayforthefirstmonth
(7)Nepoužívejte metodu optimalizace SEOcheating
Basicallyfollowtheabovesteps,thehomepagecanbeincludedwithin1-30days.IfonemonthhaspassedandtheURLhasnotbeenincluded,youcantrytomodifythelayoutofthehomepage.