spectral analysis of ranking algorithms · spectral analysis of ranking algorithms social and...
TRANSCRIPT
Spectralanalysisofrankingalgorithms
SocialandTechnologicalNetworks
Rik Sarkar
UniversityofEdinburgh,2018.
Recap:HITSalgorithm
• Evaluatehubandauthorityscores• ApplyAuthorityupdatetoallnodes:– auth(p)=sumofallhub(q)whereq->pisalink
• ApplyHubupdatetoallnodes:– hub(p)=sumofallauth(r)wherep->risalink
• Repeatforkrounds
Hubsandauthorityscores
• Canbewrittenasvectorshanda
• Thedimension(numberofelements)ofthevectorsaren
• Hubrulefori :sumofa-valuesofnodesthatipointsto:
• Authorityrulefori :sumofh-valuesofnodesthatpointtoi:
Convergence
• Rememberthathkeepsincreasing• Wewanttoshowthatthenormalizedvalue
• Convergestoavectoroffiniterealnumbersaskgoestoinfinity
• Ifconvergencehappens,thenthereisac:
Proofofconvergencetoeigen vectors
• UsefulTheorem:Asymmetricmatrixhasorthogonaleigen vectors.– Theyformabasisofn-Dspace– Anyvectorcanbewrittenasalinearcombination
• issymmetric
• FormatrixPwithallpositivevalues,Perron’stheoremsays:– Auniquepositiverealvaluedlargesteigen valuecexists
– Correspondingeigen vectoryisuniqueandhaspositiverealcoordinates
– Ifc=1,thenconvergestoy
Nowtoproveconvergence:• Supposesortedeigen valuesare:
• Correspondingeigen vectorsare:
• Wecanwriteanyvectorxas
• So:
Properties
• Thevectorq1z1 isasimplemultipleofz1– Avectoressentiallysimilartothefirsteigen vector– Thereforeindependentofstartingvaluesofh
• q1canbeshowntobenon-zeroalways,sothescoresarenotzero
• Authorityscoreanalysisisanalogous
• Scaledpagerank:
• Overkiterations:
• Pagerank doesnotneednormalization.
• Wearelookingforaneigen vectorwitheigenvalue=1
Randomwalks
• Arandomwalkerismovingalongrandomdirectededges
• Supposevectorbshowstheprobabilitiesofwalkercurrentlybeingatdifferentnodes
• Thenvectorgivestheprobabilitiesforthenextstep
Randomwalks
• Thus,pagerank valuesofnodesafterkiterationsisequivalentto:– Theprobabilitiesofthewalkerbeingatthenodesafterksteps
• Thefinalvaluesgivenbytheeigen vectorarethesteadystateprobabilities– Notethatthesedependonlyonthenetworkandareindependentofthestartingpoints
Historyofwebsearch
• YAHOO:Adirectory(hierarchiclist)ofwebsites– JerryYang,DavidFilo,Stanford1995
• 1998:Authoritativesourcesinhyperlinkedenvironment(HITS),symposiumondiscretealgorithms– JonKleinberg,Cornell
• 1998:Pagerank citationranking:Bringingordertotheweb– LarryPage,SergeyBrin,Rajeev Motwani,TerryWinograd,Stanfordtechreport