building saas solutions for online media using apache solr

36
Building SaaS solutions with Apache Solr Alberto Mijares, Canoo Engineering AG [email protected], 26/05/2011 Twitter: @lemaiol

Upload: lucidworks-archived

Post on 11-May-2015

1.388 views

Category:

Documents


3 download

DESCRIPTION

In the last years, the idea of building applications that can be used remotely by mean of the Web, has coined a new concept called "Software as a Service". Such applications, have the advantage of a remote web deployment that can be instantaneously be used by potentially any consumer in internet or of the cost reduction that a Web-based deployment provides.

TRANSCRIPT

Page 1: Building SaaS Solutions for Online Media Using Apache Solr

Building SaaS solutions with Apache Solr

Alberto Mijares, Canoo Engineering AG [email protected], 26/05/2011

Twitter: @lemaiol

Page 2: Building SaaS Solutions for Online Media Using Apache Solr

Bullet point time!

3

Page 3: Building SaaS Solutions for Online Media Using Apache Solr

What I Will Cover §  Practical applications of Apache Solr and

Apache Lucene: how to increase the time spent by a user in an website and do website “cross-selling”.

§  Use case: how Canoo helped Axel Springer Switzerland to increased the page impressions, user permanence time and traffic in their financial online newspapers.

§  Key concepts: •  How to achieve this using Lucene & Solr •  How to profit from a SaaS business model

4

Page 4: Building SaaS Solutions for Online Media Using Apache Solr

Who I am §  Alberto Mijares §  Canoo Engineering AG §  Background in web applications and standards:

•  Participated in W3C Semantic Web interest group (SWEO)

•  Led web standards compliance tools development in the past (Web Accessibility and Mobile Web)

•  Led enterprise information retrieval projects in the recent past

•  Actually coaching Google Web Toolkit projects’ development

5

Page 5: Building SaaS Solutions for Online Media Using Apache Solr

Who is Canoo §  People:

•  Dirk Koenig: Groovy founder •  Andres Almiray: Griffon project lead and Java

Champion •  Hamlet D’Arcy: Groovy committer and enthusiast •  … almost 40 more top software engineers

6

§  Products: •  WebTest: framework for web functional testing •  RIA Suite (aka ULC): Java based RIA framework •  FindIT: information retrieval and search tools •  WMTrans: language analysis tools

Page 6: Building SaaS Solutions for Online Media Using Apache Solr

Canoo FindIT

http://www.canoo.com/videos/FindIT.html

7

Page 7: Building SaaS Solutions for Online Media Using Apache Solr

Stop “bullet-pointing”!

8

Page 8: Building SaaS Solutions for Online Media Using Apache Solr

The facts

9

Axel Springer group is a market leader

Bilanz, Handelszeitung and Stocks

In Switzerland financials are important!

Financial language is German

Online media is the future

Page 9: Building SaaS Solutions for Online Media Using Apache Solr

The facts

10

Axel Springer group is a market leader

Bilanz, Handelszeitung and Stocks

In Switzerland financials are important!

Financial language is German

Online media is the future

Page 10: Building SaaS Solutions for Online Media Using Apache Solr

The gap

Make the online versions more profitable

11

Make all newspapers “market leaders”

Page 11: Building SaaS Solutions for Online Media Using Apache Solr

The gap

Make the online versions more profitable

12

Make all newspapers “market leaders”

Page 12: Building SaaS Solutions for Online Media Using Apache Solr

The how

Workshop

13

“Related articles”

“Cross-selling”

Page 13: Building SaaS Solutions for Online Media Using Apache Solr

The how

Workshop

14

“Related articles”

“Cross-selling”

Page 14: Building SaaS Solutions for Online Media Using Apache Solr

The analysis

Find a funding model

15

Use Lucene’s “More like this”

Integrate back the suggestions

Implement a selection mechanism

Page 15: Building SaaS Solutions for Online Media Using Apache Solr

The analysis

Find a funding model

16

Use Lucene’s “More like this”

Integrate back the suggestions

Implement a selection mechanism

Page 16: Building SaaS Solutions for Online Media Using Apache Solr

The issues “More like this” was “experimental”

17

Works out-of-the-box only in English

Without “semantics” not always makes sense

Indexing full pages produces noise

Page 17: Building SaaS Solutions for Online Media Using Apache Solr

The issues “More like this” was “experimental”

18

Works out-of-the-box only in English

Without “semantics” not always makes sense

Indexing full pages produces noise

Page 18: Building SaaS Solutions for Online Media Using Apache Solr

The key

19

Page 19: Building SaaS Solutions for Online Media Using Apache Solr

The key

20

Page 20: Building SaaS Solutions for Online Media Using Apache Solr

The functional requirements

Discover and index articles

21

Extract only content

Simple and flexible query service

Page 21: Building SaaS Solutions for Online Media Using Apache Solr

The functional requirements

Discover and index articles

22

Extract only content

Simple and flexible query service

Page 22: Building SaaS Solutions for Online Media Using Apache Solr

The funding model

23

Page 23: Building SaaS Solutions for Online Media Using Apache Solr

The business model

24

SaaS

Page 24: Building SaaS Solutions for Online Media Using Apache Solr

The “other” requirements Lucene-based analysis pipeline

25

Web oriented platform

Multi-application platform

Reliable, fast and scalable

Plan B?

Page 25: Building SaaS Solutions for Online Media Using Apache Solr

The “other” requirements Lucene-based analysis pipeline

26

Web oriented platform

Multi-application platform

Reliable, fast and scalable

Plan B?

Page 26: Building SaaS Solutions for Online Media Using Apache Solr

The search Wraps Lucene in a nice way

27

It is mature and Open Source

Supports scheduling, REST API, DIH,…

Scalability out-of-the-box

Well documented and has professional support

Page 27: Building SaaS Solutions for Online Media Using Apache Solr

The search Wraps Lucene in a nice way

28

It is mature and Open Source

Supports scheduling, REST API, DIH…

Scalability out-of-the-box

Well documented and has professional support

Page 28: Building SaaS Solutions for Online Media Using Apache Solr

The plan

From POC to PROD in “80 days”

29

Page 29: Building SaaS Solutions for Online Media Using Apache Solr

The plan

From POC to PROD in “80 days”

30

Page 30: Building SaaS Solutions for Online Media Using Apache Solr

The results

Google analytics

31

Page 31: Building SaaS Solutions for Online Media Using Apache Solr

The results

Google analytics

32

Page 32: Building SaaS Solutions for Online Media Using Apache Solr

The conclusions

33

Page 33: Building SaaS Solutions for Online Media Using Apache Solr

The Q&A

34

Thanks!

Page 34: Building SaaS Solutions for Online Media Using Apache Solr

Sources §  Links

•  http://people.canoo.com/share •  http://www.canoo.com •  http://www.canoo.net •  http://www.leo.org •  http://www.bilanz.ch •  http://www.handelszeitung.ch •  http://www.stocks.ch

35

Page 35: Building SaaS Solutions for Online Media Using Apache Solr

Contact §  Alberto Mijares

•  [email protected] •  Twitter: @lemaiol

36

Page 36: Building SaaS Solutions for Online Media Using Apache Solr

Architecture

Platform: Apache Solr 1.4.1 Architecture:

Solr container Web container

Springer Solr Springer WebApp

Customer 2 Solr Customer 2 WebApp

Customer 3 Solr Customer 3 WebApp

Extern access Intern access

Requests