august 2014 hug : this ain't your father's search engine

13
This ain’t your Parent’s Search Engine Grant Ingersoll CTO

Upload: yahoo-developer-network

Post on 15-Jan-2015

197 views

Category:

Education


0 download

DESCRIPTION

This ain't your Father's Search Engine

TRANSCRIPT

Page 1: August 2014 HUG : This ain't your Father's Search Engine

This ain’t your Parent’s Search Engine!!

!

!

!

!

Grant Ingersoll CTO

Page 2: August 2014 HUG : This ain't your Father's Search Engine

Search is dead.

Page 3: August 2014 HUG : This ain't your Father's Search Engine

Long Live Search!

Page 4: August 2014 HUG : This ain't your Father's Search Engine

Search Tech has evolved• Traditional fuzzy keyword lookup is faster than ever at

ever increasing scale

• Richer data modeling capabilities

• “light relational”

• Advanced types

• Faceting, Aggregations, Analytics

• Spatial, Record linkage, alerting

• Top N problems

Page 5: August 2014 HUG : This ain't your Father's Search Engine

(R)Evolutionary Changes in Lucene/Solr• Reduced Memory Usage

• FS(A|T)

• Pluggable Formats and Similarity

• Column-oriented storage (optional)

• Time/Space Integration

• Cursors

• Advanced distributed capabilities

• Joins/Grouping/Pivots

Page 6: August 2014 HUG : This ain't your Father's Search Engine

Search + Hadoop• What’s Old is New Again

!

• “Traditional” Use Cases: • Build/Store indexes • https://cwiki.apache.org/confluence/display/solr/

Running+Solr+on+HDFS

!

• Enrichment and Signal processing • PageRank, Statistically Interesting Phrases, etc.

Page 7: August 2014 HUG : This ain't your Father's Search Engine

LucidWorks + Hadoop• Ingestion Help

• Flexible Map-Reduce content ingestion supporting: • Directory of files • CSV, Writable, etc. • LogStash • Build Your Own

•Pig Load/Store and UDFs •Hive 2-way support •http://www.lucidworks.com/search-for-hadoop/

Page 8: August 2014 HUG : This ain't your Father's Search Engine

Demos

Page 9: August 2014 HUG : This ain't your Father's Search Engine

Time Series Search

• Time series search, analysis and visualization

• Data:

• S&P 500 historical data

• Twitter

• Research

Page 10: August 2014 HUG : This ain't your Father's Search Engine

Cure what ails you

Page 11: August 2014 HUG : This ain't your Father's Search Engine

Signal Processing• Signals power modern relevance!

• Clicks, conversions, sharing, history, signatures and more

• Make it easy to capture and leverage signals

• Power recommendations, analytics, discovery

• Simplify:

• Data workflow

• Operational footprint

Page 12: August 2014 HUG : This ain't your Father's Search Engine

Search and Recommendations

• eCommerce data set

• ~1.2M products

• ~4M clicks

Page 13: August 2014 HUG : This ain't your Father's Search Engine

Meta

• http://www.lucidworks.com

[email protected]

!

• Twitter: @gsingers