towards exploratory relationship search: a clustering-based approach

23
Towards Exploratory Relationship Search: A Clustering-Based Approach Yanan Zhang, Gong Cheng, Yuzhong Qu Nanjing University, China

Upload: gong-cheng

Post on 15-Jun-2015

450 views

Category:

Technology


0 download

DESCRIPTION

Presented at JIST2013, Seoul, Korea.

TRANSCRIPT

Page 1: Towards Exploratory Relationship Search: A Clustering-based Approach

Towards Exploratory Relationship Search: A Clustering-Based Approach

Yanan Zhang, Gong Cheng, Yuzhong QuNanjing University, China

Page 2: Towards Exploratory Relationship Search: A Clustering-based Approach

Outline

• Motivation• Challenges• Approach• Evaluation• Conclusion

Page 3: Towards Exploratory Relationship Search: A Clustering-based Approach

Outline

• Motivation• Challenges• Approach• Evaluation• Conclusion

Page 4: Towards Exploratory Relationship Search: A Clustering-based Approach

Relationship search

Page 5: Towards Exploratory Relationship Search: A Clustering-based Approach

Searching graph-structured data

relatonship = path

Page 6: Towards Exploratory Relationship Search: A Clustering-based Approach

Too many results!

Page 7: Towards Exploratory Relationship Search: A Clustering-based Approach

Exploratory relationship search

• Exploring a set of relationships interactively and continuously

clustering(our solution: RelClus)

faceted categories(RelFinder)

Page 8: Towards Exploratory Relationship Search: A Clustering-based Approach

Outline

• Motivation• Challenges• Approach• Evaluation• Conclusion

Page 9: Towards Exploratory Relationship Search: A Clustering-based Approach

Challenges

• How to meaningfully label a cluster?• How to make sense of a cluster hierarchy?• How to measure similarity between clusters?

Agglomerative hierarchical clustering• Initially: relationships singleton clusters• Then: progressively merge the most similar pair

Page 10: Towards Exploratory Relationship Search: A Clustering-based Approach

Outline

• Motivation• Challenges• Approach• Evaluation• Conclusion

Page 11: Towards Exploratory Relationship Search: A Clustering-based Approach

Relationship pattern

• High-level abstraction of relationships– Vertices: entities or classes– Edges: properties (undirected)

Page 12: Towards Exploratory Relationship Search: A Clustering-based Approach

How to meaningfully label a cluster?

• Using a leastest common relationship pattern– Vertices: leastest common classes (or entities)– Edges: leastest common properties

Person

label({R4, R5}) = P1

P1

R4

R5

Page 13: Towards Exploratory Relationship Search: A Clustering-based Approach

How to make sense of a cluster hierarchy?

• subPatternOf ( )⊑– Vertices: s.t. subClassOf (or instance-type)– Edges: s.t. subPropertyOf

P3

P2

P1

P2 P⊑ 3, P1 P⊑ 3

Page 14: Towards Exploratory Relationship Search: A Clustering-based Approach

How to measure similarity between clusters?

• sim(Ci,Cj) = how many commonalities they share

which are exactly captured by label(Ci C∪ j)– Measure: -log (probability of seeing label(Ci C∪ j))

i.e. the information content associated with label(Ci C∪ j)– Probability estimation: based on the data set

P3

P2

P1

Page 15: Towards Exploratory Relationship Search: A Clustering-based Approach

A running exampleP3

P2

P1

R4

R5

R1

R2

R3

Page 16: Towards Exploratory Relationship Search: A Clustering-based Approach

Outline

• Motivation• Challenges• Approach• Evaluation• Conclusion

Page 17: Towards Exploratory Relationship Search: A Clustering-based Approach

Design• Data set: DBpedia• Systems

– RList: just a list of all results– RFacet: w/ faceted categories (similar to RelFinder)– RClus: w/ hierarchical clustering (our solution)

• Participants and tasks– 2 participants provide searh tasks

• 3 (well-defined) lookup tasks• 3 (open) exploratory search tasks

– 15 participants carry out tasks

• Metrics– Questionnaire– SUS– User feedback

Page 18: Towards Exploratory Relationship Search: A Clustering-based Approach

Questionnaire results

Page 19: Towards Exploratory Relationship Search: A Clustering-based Approach

Some inspiring user feedback

• Dislike deep hierarchies• Expect more concise visualization• Need more cognitive support

Page 20: Towards Exploratory Relationship Search: A Clustering-based Approach

Performance testing

Page 21: Towards Exploratory Relationship Search: A Clustering-based Approach

Outline

• Motivation• Challenges• Approach• Evaluation• Conclusion

Page 22: Towards Exploratory Relationship Search: A Clustering-based Approach

Conclusion

• Goal: clustering-based exploratory relationship search• Approach: pattern-centric

• Future work– Combining faceted categories and hierarchical clustering– Going beyond them

Page 23: Towards Exploratory Relationship Search: A Clustering-based Approach