august 2014 hug : this ain't your father's search engine
DESCRIPTION
This ain't your Father's Search EngineTRANSCRIPT
This ain’t your Parent’s Search Engine!!
!
!
!
!
Grant Ingersoll CTO
Search is dead.
Long Live Search!
Search Tech has evolved• Traditional fuzzy keyword lookup is faster than ever at
ever increasing scale
• Richer data modeling capabilities
• “light relational”
• Advanced types
• Faceting, Aggregations, Analytics
• Spatial, Record linkage, alerting
• Top N problems
(R)Evolutionary Changes in Lucene/Solr• Reduced Memory Usage
• FS(A|T)
• Pluggable Formats and Similarity
• Column-oriented storage (optional)
• Time/Space Integration
• Cursors
• Advanced distributed capabilities
• Joins/Grouping/Pivots
Search + Hadoop• What’s Old is New Again
!
• “Traditional” Use Cases: • Build/Store indexes • https://cwiki.apache.org/confluence/display/solr/
Running+Solr+on+HDFS
!
• Enrichment and Signal processing • PageRank, Statistically Interesting Phrases, etc.
LucidWorks + Hadoop• Ingestion Help
• Flexible Map-Reduce content ingestion supporting: • Directory of files • CSV, Writable, etc. • LogStash • Build Your Own
•Pig Load/Store and UDFs •Hive 2-way support •http://www.lucidworks.com/search-for-hadoop/
Demos
Time Series Search
• Time series search, analysis and visualization
• Data:
• S&P 500 historical data
• Research
Cure what ails you
Signal Processing• Signals power modern relevance!
• Clicks, conversions, sharing, history, signatures and more
• Make it easy to capture and leverage signals
• Power recommendations, analytics, discovery
• Simplify:
• Data workflow
• Operational footprint
Search and Recommendations
• eCommerce data set
• ~1.2M products
• ~4M clicks
Meta
• http://www.lucidworks.com
!
• Twitter: @gsingers