![Page 1: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/1.jpg)
21 May 2010Apache Lucene EuroCon
1
From publisher to platformHow the guardian used content, search, and open source to build a powerful new business modelStephen Dunn, Guardian News and Media
![Page 2: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/2.jpg)
21 May 2010Apache Lucene EuroCon 2
The publishing era
![Page 3: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/3.jpg)
21 May 2010Apache Lucene EuroCon
We started a long time ago:
![Page 4: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/4.jpg)
21 May 2010Apache Lucene EuroCon
To secure the financial and editorial independence of the Guardian in perpetuity. To promote freedom in the press and liberal journalism globally.
To become the world's leading liberal voice.
“To secure the financial and editorial independence of The Guardian in perpetuity.”
“To promote freedom in the press and liberal journalism globally.”
![Page 5: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/5.jpg)
21 May 2010Apache Lucene EuroCon
2010
![Page 6: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/6.jpg)
21 May 2010Apache Lucene EuroCon
Swine flu
Keyword page
Twitter updates
Content partnerships
Audio
Video Data API
Live blogs
Comment
Mobile siteiPhone app
Newspapers
2010
![Page 7: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/7.jpg)
21 May 2010Apache Lucene EuroCon 6
1996
![Page 8: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/8.jpg)
21 May 2010Apache Lucene EuroCon
1999
7
![Page 9: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/9.jpg)
21 May 2010Apache Lucene EuroCon
1999
7
![Page 10: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/10.jpg)
21 May 2010Apache Lucene EuroCon 8
01-> 06
![Page 11: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/11.jpg)
21 May 2010Apache Lucene EuroCon 9
2009★ 1.5M pages
and counting
★ 250M+ pages/month
★ 30M visitors/month
★ 4x Webby award winner (best newspaper site)
![Page 12: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/12.jpg)
21 May 2010Apache Lucene EuroCon 9
2009★ 1.5M pages
and counting
★ 250M+ pages/month
★ 30M visitors/month
★ 4x Webby award winner (best newspaper site)
![Page 13: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/13.jpg)
21 May 2010Apache Lucene EuroCon 9
2009★ 1.5M pages
and counting
★ 250M+ pages/month
★ 30M visitors/month
★ 4x Webby award winner (best newspaper site)
![Page 14: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/14.jpg)
21 May 2010Apache Lucene EuroCon 9
2009★ 1.5M pages
and counting
★ 250M+ pages/month
★ 30M visitors/month
★ 4x Webby award winner (best newspaper site)
![Page 15: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/15.jpg)
21 May 2010Apache Lucene EuroCon 10
Part of the Web
![Page 16: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/16.jpg)
21 May 2010Apache Lucene EuroCon
• “A cool URI is one that does not change” Tim Berners-Lee 1998• 1.5 million resources redirected to new scheme
11
1. Permanent
http://www.flickr.com/photos/fstorr/
![Page 17: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/17.jpg)
21 May 2010Apache Lucene EuroCon 12
2. Addressable★ Resources are “about” something - ready for the
social web.
★ We live in “the age of point-at-things” (Coates 2005)
![Page 18: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/18.jpg)
21 May 2010Apache Lucene EuroCon 13
★ Multiple routes to content
★ Tagging drives discovery
3. Discoverable
![Page 19: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/19.jpg)
21 May 2010Apache Lucene EuroCon 13
★ Multiple routes to content
★ Tagging drives discovery
3. Discoverable
![Page 20: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/20.jpg)
21 May 2010Apache Lucene EuroCon 13
★ Multiple routes to content
★ Tagging drives discovery
3. Discoverable
![Page 21: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/21.jpg)
21 May 2010Apache Lucene EuroCon 13
★ Multiple routes to content
★ Tagging drives discovery
3. Discoverable
![Page 22: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/22.jpg)
21 May 2010Apache Lucene EuroCon 14
![Page 23: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/23.jpg)
21 May 2010Apache Lucene EuroCon
The hackable guardian.co.ukhttp://www.guardian.co.uk/....
![Page 24: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/24.jpg)
21 May 2010Apache Lucene EuroCon
/technology/internet
/technology/all
/environment/climatechange
The hackable guardian.co.ukhttp://www.guardian.co.uk/....
![Page 25: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/25.jpg)
21 May 2010Apache Lucene EuroCon
/technology/internet
/technology/all
/environment/climatechange
The hackable guardian.co.ukhttp://www.guardian.co.uk/....
+business/globaleconomy
![Page 26: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/26.jpg)
21 May 2010Apache Lucene EuroCon
/technology/internet
/technology/all
/environment/climatechange
The hackable guardian.co.ukhttp://www.guardian.co.uk/....
+business/globaleconomy
![Page 27: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/27.jpg)
21 May 2010Apache Lucene EuroCon
/technology/internet
/technology/all
/environment/climatechange
The hackable guardian.co.ukhttp://www.guardian.co.uk/....
/rss
/rss
+business/globaleconomy/rss
![Page 28: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/28.jpg)
21 May 2010Apache Lucene EuroCon
Results...
16
![Page 29: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/29.jpg)
21 May 2010Apache Lucene EuroCon 17
First release
Final ReleaseSite traffic growthUnique Users
![Page 30: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/30.jpg)
21 May 2010Apache Lucene EuroCon 17
3,750,000
7,500,000
11,250,000
15,000,000
18,750,000
22,500,000
26,250,000
30,000,000
Sep 2005 Feb 2006 Jul 2006 Dec 2006 May 2007 Oct 2007 Mar 2008 Aug 2008 Jan 2009
Uni
que
Use
rs
First release
Final ReleaseSite traffic growthUnique Users
![Page 31: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/31.jpg)
21 May 2010Apache Lucene EuroCon 17
3,750,000
7,500,000
11,250,000
15,000,000
18,750,000
22,500,000
26,250,000
30,000,000
Sep 2005 Feb 2006 Jul 2006 Dec 2006 May 2007 Oct 2007 Mar 2008 Aug 2008 Jan 2009
Uni
que
Use
rs Pre - project
First release
Final ReleaseSite traffic growthUnique Users
![Page 32: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/32.jpg)
21 May 2010Apache Lucene EuroCon 17
3,750,000
7,500,000
11,250,000
15,000,000
18,750,000
22,500,000
26,250,000
30,000,000
Sep 2005 Feb 2006 Jul 2006 Dec 2006 May 2007 Oct 2007 Mar 2008 Aug 2008 Jan 2009
Uni
que
Use
rs Pre - project
First release
Final ReleaseSite traffic growthUnique Users
36M
![Page 33: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/33.jpg)
21 May 2010Apache Lucene EuroCon
However...
18
![Page 34: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/34.jpg)
21 May 2010Apache Lucene EuroCon 19
1 Billion+Internet Users!
![Page 35: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/35.jpg)
21 May 2010Apache Lucene EuroCon 20
![Page 36: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/36.jpg)
21 May 2010Apache Lucene EuroCon 21
![Page 37: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/37.jpg)
21 May 2010Apache Lucene EuroCon 22
![Page 38: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/38.jpg)
21 May 2010Apache Lucene EuroCon 23
....”How I stopped worrying about my website and learned to love the whole Internet.”
Matt McAlister
![Page 39: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/39.jpg)
21 May 2010Apache Lucene EuroCon 24
OPEN IN
Bring in data and apps from the Internet
OPEN OUT
Enable partners to build applications using Guardian content and services for other digital platforms
The Open Strategy
![Page 40: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/40.jpg)
21 May 2010Apache Lucene EuroCon 25
![Page 41: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/41.jpg)
21 May 2010Apache Lucene EuroCon 26
![Page 42: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/42.jpg)
21 May 2010Apache Lucene EuroCon 27
![Page 43: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/43.jpg)
21 May 2010Apache Lucene EuroCon 28
"Our most interesting experiments lie in combining what we know with the experience, opinions and expertise of the people who want to participate rather than passively receive.”
![Page 44: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/44.jpg)
21 May 2010Apache Lucene EuroCon 29
BETA
The Open Platform
![Page 45: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/45.jpg)
21 May 2010Apache Lucene EuroCon 30
OPEN OUT
Allow partners to build applications using Guardian content and services for other digital platforms
OPEN IN
Bring in data and apps from the Internet
BETA
OPEN IN
Bring in data and apps from the Internet
![Page 46: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/46.jpg)
21 May 2010Apache Lucene EuroCon 30
OPEN OUT
Allow partners to build applications using Guardian content and services for other digital platforms
OPEN IN
Bring in data and apps from the Internet
BETA
![Page 47: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/47.jpg)
21 May 2010Apache Lucene EuroCon 31
The suite of services enabling partners to build
applications with the Guardian
BETA
![Page 48: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/48.jpg)
21 May 2010Apache Lucene EuroCon
32
BETA
![Page 49: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/49.jpg)
21 May 2010Apache Lucene EuroCon
32
CONTENT API
A service for selecting and
collecting content from the Guardian
for re-use
DATA STORE
A directory of useful data curated by Guardian editors
POLITICS API
Open database of candidates, voting
records, constituencies, election results,
live data on election day
BETA
![Page 50: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/50.jpg)
21 May 2010Apache Lucene EuroCon
Guardian database
CMSSearch engine
REST API
Your App Here!BETA
CONTENT APIA service for selecting and collecting content from the Guardian for
re-use
![Page 51: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/51.jpg)
21 May 2010Apache Lucene EuroCon 34
BETA
![Page 52: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/52.jpg)
21 May 2010Apache Lucene EuroCon 35• Stamen Design - APIMaps.org
![Page 53: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/53.jpg)
21 May 2010Apache Lucene EuroCon 36
Text
![Page 54: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/54.jpg)
21 May 2010Apache Lucene EuroCon
BETA
DATA STOREA directory of
useful data curated by Guardian
editors
![Page 55: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/55.jpg)
21 May 2010Apache Lucene EuroCon
BETA
POLITICS APIOpen database of candidates, voting
records, constituencies, election results, live data on election day
![Page 56: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/56.jpg)
21 May 2010Apache Lucene EuroCon 39
POLITICS APIOpen database of candidates, voting
records, constituencies, election results, live data on election day
BETA
![Page 57: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/57.jpg)
21 May 2010Apache Lucene EuroCon 40
Open for Business
BETA
![Page 58: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/58.jpg)
21 May 2010Apache Lucene EuroCon 40
Open for Business
![Page 59: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/59.jpg)
21 May 2010Apache Lucene EuroCon 41
3 Tiers of access, 3 Revenue models
BESPOKE: Take, reformat, augment our content. Same access as Guardian. Revenue model to be negotiated. Combination of Media, Fees, Downloads.
APPROVED: Take our full article content, with an advert. Guardian keeps ad revenue, you keep rest-of-page revenue
KEYLESS: Take our headlines. You keep associated revenues
1
![Page 60: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/60.jpg)
21 May 2010Apache Lucene EuroCon 42
![Page 61: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/61.jpg)
21 May 2010Apache Lucene EuroCon 43
OPEN OUT: Developers can now access our full content APIs on demand with keys post-approved.
We are now positioning the platform as a place to do business with us.
So, rapid scalability, reliability, performance, are now core requirements
What this means
![Page 62: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/62.jpg)
21 May 2010Apache Lucene EuroCon
44
CONTENT APIA service for selecting and collecting content from the Guardian for
re-use
DATA STOREA directory of
useful data curated by Guardian
editors
POLITICS APIOpen database of candidates, voting
records, constituencies,
election results, live data on election day
2 Open In
![Page 63: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/63.jpg)
21 May 2010Apache Lucene EuroCon
44
CONTENT APIA service for selecting and collecting content from the Guardian for
re-use
DATA STOREA directory of
useful data curated by Guardian
editors
POLITICS APIOpen database of candidates, voting
records, constituencies,
election results, live data on election day
MICROAPPSA framework for
integrating 3rd party applications into guardian.co.uk.
2 Open In
![Page 64: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/64.jpg)
21 May 2010Apache Lucene EuroCon 45
OPEN OUT
Allow partners to build applications using Guardian content and services for other digital platforms
OPEN IN
Bring in data and apps from the Internet
![Page 65: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/65.jpg)
21 May 2010Apache Lucene EuroCon 46
![Page 66: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/66.jpg)
21 May 2010Apache Lucene EuroCon 47
![Page 67: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/67.jpg)
21 May 2010Apache Lucene EuroCon 48
App showcase
![Page 68: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/68.jpg)
21 May 2010Apache Lucene EuroCon 49
What this meansOpen In: Partners can now more easily integrate into our core
The Open Platform will become key to our commercial future.
![Page 69: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/69.jpg)
21 May 2010Apache Lucene EuroCon 50
Evolving the architecture
![Page 70: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/70.jpg)
21 May 2010Apache Lucene EuroCon 51
From Publisher to Platform
★Seeking massive growth, but no longer only broadcasting content
★User/partner engagement & contribution on★journalism★data★software★applications★revenue and ads
★ Support developers and partners with data and APIs, need scalability, reliability, speed
![Page 71: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/71.jpg)
21 May 2010Apache Lucene EuroCon
App server App server App server
Web server Web server Web server
CMS
Oracle
Memcached
![Page 72: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/72.jpg)
21 May 2010Apache Lucene EuroCon
App server App server App server
Web server Web server Web server
CMS Data feeds
Oracle
Memcached
Why RDBMS?
5 years ago, fewer alternatives
Understand operations procedures
Can easily recruit DBAs / devs
Developer/ops tools
Business critical system: a safe choice
![Page 73: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/73.jpg)
21 May 2010Apache Lucene EuroCon 54
Scaling
![Page 74: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/74.jpg)
21 May 2010Apache Lucene EuroCon 55
Unique Users
![Page 75: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/75.jpg)
21 May 2010Apache Lucene EuroCon 55
3,750,000
7,500,000
11,250,000
15,000,000
18,750,000
22,500,000
26,250,000
30,000,000
Sep 2005 Feb 2006 Jul 2006 Dec 2006 May 2007 Oct 2007 Mar 2008 Aug 2008 Jan 2009
Uni
que
Use
rs
Unique Users
![Page 76: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/76.jpg)
21 May 2010Apache Lucene EuroCon 56
Unique Users
![Page 77: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/77.jpg)
21 May 2010Apache Lucene EuroCon
12,250,00014,500,00016,750,00019,000,00021,250,00023,500,00025,750,00028,000,000
May 2008 Jul 2008 Sep 2008 Nov 2008 Jan 200956
Unique Users
![Page 78: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/78.jpg)
21 May 2010Apache Lucene EuroCon
Whatʼs going on?
57
★We tag our content (multifaceted)
★Guardian.co.uk is a faceted browse through our tag-space, with editorial teams “spotlighting” key resources on selected nodes.
★Can apply multiple facets in queries faster in a search-like architecture, than an RDBMS
![Page 79: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/79.jpg)
21 May 2010Apache Lucene EuroCon
Whatʼs going on?
57
★We tag our content (multifaceted)
★Guardian.co.uk is a faceted browse through our tag-space, with editorial teams “spotlighting” key resources on selected nodes.
★Can apply multiple facets in queries faster in a search-like architecture, than an RDBMS
![Page 80: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/80.jpg)
21 May 2010Apache Lucene EuroCon
Whatʼs going on?
57
★We tag our content (multifaceted)
★Guardian.co.uk is a faceted browse through our tag-space, with editorial teams “spotlighting” key resources on selected nodes.
★Can apply multiple facets in queries faster in a search-like architecture, than an RDBMS
![Page 81: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/81.jpg)
21 May 2010Apache Lucene EuroCon 58
“Related content” from search engine
![Page 82: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/82.jpg)
21 May 2010Apache Lucene EuroCon
59
![Page 83: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/83.jpg)
21 May 2010Apache Lucene EuroCon
Guardian database
CMSSearch engine
REST API
Your App Here!
CONTENT APIA service for selecting and collecting content from the Guardian for
re-use
![Page 84: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/84.jpg)
21 May 2010Apache Lucene EuroCon 61
![Page 85: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/85.jpg)
21 May 2010Apache Lucene EuroCon
We used Solr/LuceneCan perform complex queries, including full text search
We can change the schema with no downtime.
On our dataset most queries are of a similar cost
Scales very well horizontally
Replication makes it easy to work in the cloud
62
![Page 86: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/86.jpg)
21 May 2010Apache Lucene EuroCon
App server
Web servers
CMS
Memcached
Core
rdbms
63
![Page 87: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/87.jpg)
Solr
Content API
Cloud, EC2
21 May 2010Apache Lucene EuroCon
App server
Web servers
CMS
Memcached
Core
Solr
Solr
Solr
Solr
Solr
rdbms
63
![Page 88: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/88.jpg)
21 May 2010Apache Lucene EuroCon
MICROAPPSA framework for
integrating 3rd party applications into guardian.co.uk.
Simple REST/ HTTP framework allows lightweight development
Applications proxied for performance
Apps generally hosted in the cloud, hot deployment into production
Open in?
![Page 89: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/89.jpg)
21 May 2010Apache Lucene EuroCon
MICROAPPSA framework for
integrating 3rd party applications into guardian.co.uk.
Simple REST/ HTTP framework allows lightweight development
Applications proxied for performance
Apps generally hosted in the cloud, hot deployment into production
Open in?
![Page 90: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/90.jpg)
21 May 2010Apache Lucene EuroCon
App server
Web servers
CMS
Memcached
Core
App
App
App
App
App
App
Apps
Proxy
external hostingapp engine etc
rdbms
65
![Page 91: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/91.jpg)
21 May 2010Apache Lucene EuroCon
App servers
Web servers
CMS
Memcached
Solr
Solr
Solr
Solr
Solr
Solr
Cloud, EC2
App
App
App
App
App
App
Proxyexternal hostingapp engine etc
rdbms
OPEN IN OPEN OUT
![Page 92: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/92.jpg)
21 May 2010Apache Lucene EuroCon
C
Clo
OI
external r
C
Clo
OI
external r
CONTENT
???????
![Page 93: From Publisher To Platform: How The Guardian Used Content, Search, and Open Source To Build a Powerful New Business Model](https://reader034.vdocuments.us/reader034/viewer/2022051515/554d87a2b4c9053e0c8b5342/html5/thumbnails/93.jpg)
21 May 2010Apache Lucene EuroCon 68
Thank you
http://www.guardian.co.uk/open-platform
Twitter: @openplatform @cuica (Stephen Dunn)