beyond basic faceted search ben-yitzhak, et al
DESCRIPTION
Beyond Basic Faceted Search Ben-Yitzhak, et al. Fahimeh Fakour CS 572 Summer 2010. Introduction. Importance and significance Background Information Objective Related work Approach and Solutions Enhancements Contributions Pros & Cons. 1. Importance and Significance. Too much info - PowerPoint PPT PresentationTRANSCRIPT
Beyond Basic Faceted SearchBen-Yitzhak, et al.
Fahimeh FakourCS 572Summer 2010
Introduction1. Importance and significance2. Background Information3. Objective4. Related work5. Approach and Solutions6. Enhancements7. Contributions8. Pros & Cons
7/7/2010 2Beyond Basic Faceted Search
1. Importance and Significance
• Too much info• Transactions
7/7/2010 3Beyond Basic Faceted Search
1. Importance and Significance (cont)
• Categories, lists, and the human mind
7/7/2010 4Beyond Basic Faceted Search
7/7/2010 Beyond Basic Faceted Search 5
• Research done in IBM & Yahoo Research labs
• Facets, buckets, and categories– Navigate multiple paths for different
ordering
• Free text queries• List of matching
documents with count
2. Background Information
3. Objective
• Extend traditional facet – Beyond numbersNumbers
Words
• Search & Index correlated documents
• Similarity to OLAP: multi-dimensional data
7/7/2010 6Beyond Basic Faceted Search
4. Related Work• Multifaceted search– Lexical subsumption– Synsets and hypernym– RawSugar social tagging
• Online Analytical Processing (OLAP)– Multi-dimensional data– Aggregation of data: Cube
• N-dimensional “group by”
Exciting new technique
7/7/2010 7Beyond Basic Faceted Search
5. Approach & Solutions5.1 Technologies: Lucene & Solr5.2 Data Model5.3 Facet hierarchy: Forest5.4 Creating the facet paths5.5 Running the facet query5.6 Example
7/7/2010 8Beyond Basic Faceted Search
5.1. Technologies: Lucene & Solr
• Posting element:
docID, offset, payload
• Matching document processing
byte array of additional info (runtime accessible)
7/7/2010 9Beyond Basic Faceted Search
5.2. Data Model• Taxonomy: hierarchical
relationships among facets– Predefined taxonomy– Acquired/Learned through
documents
• Facet-path forest– Tree: top-level facet
7/7/2010 10Beyond Basic Faceted Search
5.3. Facet hierarchy: Forest
7/7/2010 11Beyond Basic Faceted Search
5.4. Creating the facet paths
• Posting element for document for each prefix of Pi
• Add path to taxonomy index• Encode all k paths related to this
document
7/7/2010 12Beyond Basic Faceted Search
5.5. Running the facet query
• Terms:– Faceted query string + taxonomy
subtrees– Faceted result set ranked list of
documents matching query + counters
• Lucene: use the Taxonomy Index function to determine ordinal number of paths
7/7/2010 13Beyond Basic Faceted Search
5.6. Example
Facet$clothing: doc1,doc2Facet$clothing$children’s: doc1Facet$clothing: doc1,doc2Facet$clothing$children’s: doc1
7/7/2010 14Beyond Basic Faceted Search
6. Enhancements
7/7/2010 15Beyond Basic Faceted Search
6.1. Business Intelligence• Qualitative rather than quantitative– Best sellers rather than number of
books published by author
7/7/2010 16Beyond Basic Faceted Search
6.2. Dynamic Facets: Welcome to the real world
• Not always independent data
• Example:– Running shorts • Different sizes per color• Location & price
7/7/2010 17Beyond Basic Faceted Search
6.2. Dynamic Facets: Solution
• Use tree over the data
7/7/2010 18Beyond Basic Faceted Search
6.2. Dynamic Facets: Solution (cont)
7/7/2010 19Beyond Basic Faceted Search
7. Contributions• “rich” aggregation : qualitative• Engineering details• Correlation in facet values
7/7/2010 20Beyond Basic Faceted Search
8.1. Pros • Detailed description of engineering
aspects & design decisions• Use of implemented technologies• Clearly defines the scope of the paper• Give foundation/background
information• Compatible with real life data
7/7/2010 21Beyond Basic Faceted Search
8.2. Cons • Experiments and testing: No
qualitative measurement– effectiveness of “qualitative” facets
• Not explain relevance of some of the previous work
• Criteria for display/grouping?– Key use cases & known user access
patterns not explained
• Build taxonomy: depth/breadth?
7/7/2010 22Beyond Basic Faceted Search
Thank You
7/7/2010 23Beyond Basic Faceted Search
References
247/7/2010 24Beyond Basic Faceted Search
Ben-Yitzhak, et al. “Beyond Basic Faceted Search”. Proceedings of the international conference on Web search and web data mining. Pp.33-44, 2008. <http://nadav.harel.org.il/papers/p33-ben-yitzhak.pdf>
“Faceted Search with Solr” Lucid Imagination. July 1, 2010. <http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Faceted-Search-Solr >
“Faceted classification” Wikipedia. July 7, 2010 <http://en.wikipedia.org/wiki/Faceted_classification >
Lemieux, Earley, and Associates. “Designing for Faceted Search” User Interface Engineering. July 6, 2010 <http://www.uie.com/articles/faceted_search/>(Originally in KM World, March 2009)
Mattman, Chris. “Query Models” (presentation slides for class)