developing recommendation techniques for scholarly papers

16
Developing Recommendation Techniques for Scholarly Papers Kazunari Sugiyama National University of Singapore

Upload: chavez

Post on 13-Jan-2016

48 views

Category:

Documents


0 download

DESCRIPTION

Developing Recommendation Techniques for Scholarly Papers. Kazunari Sugiyama. National University of Singapore. Previous Research Topics. Web Information Retrieval (@NAIST) How to Characterize Web page User Adaptive Information Retrieval Disambiguation (@TITECH) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Developing Recommendation Techniques for Scholarly Papers

Developing Recommendation Techniques for Scholarly Papers

Kazunari Sugiyama

National University of Singapore

Page 2: Developing Recommendation Techniques for Scholarly Papers

Previous Research Topics• Web Information Retrieval (@NAIST)

• How to Characterize Web page

• User Adaptive Information Retrieval

• Disambiguation (@TITECH)• Personal Name Disambiguation in Web Search Results

• Word Sense Disambiguation in Japanese Texts

2

Page 3: Developing Recommendation Techniques for Scholarly Papers

Scholarly Paper Recommendation (@NUS)

3

• Senior researchers

• Junior researchersOnly one recently published paper without citations

Multiple published papers with citation papers

Page 4: Developing Recommendation Techniques for Scholarly Papers

User Profile Construction (Junior Researchers)

4

Weighting schemeCosine similarity

Page 5: Developing Recommendation Techniques for Scholarly Papers

User Profile Construction (Senior Researchers)

5

Weighting schemeCosine similarity

Forgetting factor

Page 6: Developing Recommendation Techniques for Scholarly Papers

Feature Vector Construction for Candidate Papers• Basically, TF-IDF• Also use information about citation and reference papers

6

recpcp 1

References

recp

1refrecp

recpcrecpcrecrecpppp W 11 ffF

11 refrecrefrec ppW f

Weighting schemeCosine similarity

Page 7: Developing Recommendation Techniques for Scholarly Papers

Is Pruning of Citation and Reference Papers Effective?

7

References

ip

1refip 2refip 3refip 4refip lrefip

sim:0.18 sim:0.58 sim:0.22 sim:0.36 sim:0.45

ipcp 1

sim:0.32 sim:0.27 sim:0.42 sim:0.25 sim:0.13

Threshold: 0.3

ipcp 2 ipcp 3 ipcp 4 ik pcp

Page 8: Developing Recommendation Techniques for Scholarly Papers

Is Pruning of Citation and Reference Papers Effective?

8

References

ip

1refip 2refip 3refip 4refip lrefip

sim:0.18 sim:0.58 sim:0.22 sim:0.36 sim:0.45

ipcp 1

sim:0.33 sim:0.27 sim:0.42 sim:0.25 sim:0.13

ipcp 2 ipcp 3 ipcp 4 ik pcp

ipcpW 1

ipcpW 3

2refipW 4refipW lrefipW

Threshold: 0.3

Weighting schemeCosine similarity

Page 9: Developing Recommendation Techniques for Scholarly Papers

ExperimentsExperimental Data• Researchers

• 15 junior researchers

• 13 senior researchersNLP and IR researchers who have publication

lists

in DBLP

• Candidate Papers to Recommend• ACL Anthology Reference Corpus

Includes information about citation and reference papers

9

Page 10: Developing Recommendation Techniques for Scholarly Papers

Junior ResearchersThe most recent paper with pruning its reference papers

10

[NDCG@5]

Pruning is effective!

Page 11: Developing Recommendation Techniques for Scholarly Papers

Senior ResearchersPast published papers with forgetting factor

11

[NDCG@5]

When and are small,FF is effective!

d

Page 12: Developing Recommendation Techniques for Scholarly Papers

ExtensionsCharacterize the target paper using potential papers

Serendipitous recommendation

12

Page 13: Developing Recommendation Techniques for Scholarly Papers

tgtp

(‘06) (‘07)(‘09)

tgtk pcp

(‘05)

tgtpcp 1

13

Potential paper that should cite the target paper

Characterize the target paper using potential papers

Page 14: Developing Recommendation Techniques for Scholarly Papers

Finding potential papers with collaborative filtering

14

pc1 pc2 pc3 pci pcn-1 pcN

p10.212 0.735 0.687

p20.656 0.328 0.436

p30.764 0.527

ptgt0.581 0.330

pN-10.248

pN0.654 0.525

Pi (i=1, … ,N):All papers in the dataset

Pcj (j=1, … ,N):Papers as citation papersin the dataset0.536 0.4720.368 0.211

Page 15: Developing Recommendation Techniques for Scholarly Papers

tgtp

(‘06) (‘07)(‘09)

tgtk pcp

(‘05)

tgtpcp 1

15

Potential paper that should cite the target paper

Characterize the target paper using potential papers

tgtpcp 3tgtN pcp

Page 16: Developing Recommendation Techniques for Scholarly Papers

User 1

User 2

User 3

User n

User profile generated from history of contents

User profile for serendipitous

recommendationUser 4 (Sim: 0.16)Weight: 1/(0.16+1)

User 10 (Sim: 0.26)Weight: 1/(0.26+1)

User 5 (Sim: 0.21)Weight: 1/(0.21+1)

User 1 (Sim: 0.32)Weight: 1/(0.32+1)

User 1 (Sim: 0.14)Weight: 1/(0.14+1)

User profile for serendipitous

recommendation

User 7 (Sim: 0.25)Weight: 1/(0.25+1)

User profile for serendipitous

recommendation

User 6 (Sim: 0.07)Weight: 1/(0.07+1)

User 2 (Sim: 0.12)Weight: 1/(0.12+1)

User profile for serendipitous

recommendation

Serendipitous Recommendation