the emergence of big data

18
Big Data Internet Research Group April, 2012 John Katsaros [email protected] 1

Upload: johnkatsaros

Post on 22-Apr-2015

1.587 views

Category:

Technology


2 download

DESCRIPTION

Overview of the emerging Big Data market and the growth of the Hadoop Ecosystem - forecast for growth, important segments and start-up funding

TRANSCRIPT

Page 1: The Emergence of Big Data

Big Data Internet Research GroupApril, 2012John Katsaros [email protected]

1

Page 2: The Emergence of Big Data

2

In 2011 organizations realized that they were sitting on an information goldmine. Rather than discarding datasets which seemed too costly to analyze, they have reached the point where a business can affordably analyze vast amounts of information and unlock valuable insights.

Page 3: The Emergence of Big Data

3

A new industry was launched – Big Data

Page 4: The Emergence of Big Data

4

Big Data can trace its roots to Google when in 2004 two Google engineers, Jeffery Dean and Sanjay Ghemawat, published a Usenix paper describing MapReduce – Simplified Data Processing on Large Clusters.

Subsequently Doug Cutting at Yahoo! developed an open source version named Hadoop.

Page 5: The Emergence of Big Data

5

The Big Data Market hit the radar in 2011

2011 2012 2013 2014 2015 2016 $-

$200

$400

$600

$800

$1,000

$1,200

$1,400

$1,600

Big Data Market (M)

Source: IRG Research

Page 6: The Emergence of Big Data

6

Big Data is often machine generated and includes, but is not limited to, click stream data, log files, (servers, network equipment, apps…) alerts (network, security devices …) and social media content.

6

Click Stream

Logs

Alerts

Social Media

Page 7: The Emergence of Big Data

7

At the same time MapReduce was developing, server prices continued to drop and Amazon’s AWS service began offering servers priced on a time basis – these events made computing more affordable, especially for Big Data applications.

Server Prices

Server = 8 cents/hour

Page 8: The Emergence of Big Data

8

The early adopters of Big Data ran large Web properties like e-Bay and Facebook – organizations that had a lot of click stream data and large numbers of registered users.

1. Personalize the visit – lead to longer visits2. Spam mitigation – reduce annoyance3. Suggesting friends you may know – increases member interconnections

Page 9: The Emergence of Big Data

9

Recently enterprises which operate large Web sites have begun working with Hadoop and Big Data

1. Customize the visit – increase brand awareness2. Suggesting products (e.g., financial services)3. Cross selling/multi-channel (e.g., Disney) CRM

Page 10: The Emergence of Big Data

10

Enterprise adoption of Big Data will grow quickly and in 18 months spending will exceed the amount spent by large Websites.

2011 2012 2013 2014 2015 2016

Big Data Segment Spending Comparison

Enterprises Large Web Sites

Source: IRG Research Source: IRG Research

Page 11: The Emergence of Big Data

11

In 2013 Enterprise Big Data Spending will surpass Large Website Spending

2011 2012 2013 2014 2015 2016 $-

$200

$400

$600

$800

$1,000

$1,200

Big Data Segment Spending Forecast (M)

Enterprises Large Web Sites

Source: IRG Research

Page 12: The Emergence of Big Data

12

Finally, when (and how) will SMB’s and Enterprises not running large web properties adopt Big Data Technology?

1. When Business Intelligence products become useable and widely available

2. Through platforms like Splunk which already has 3,500 IT user organizations

3. When SMBs are presented with compelling value propositions

Page 13: The Emergence of Big Data

13

Meanwhile Big Data developers are extending functionality ranging from better easier to manage systems to higher performance systems to simplified Business Intelligence platforms.

Source: Hortonworks

Page 14: The Emergence of Big Data

14

There seems to be plenty of money available to fund Big Data companies

Accel Partners Launches $100mm Big Data FundAccel's Big Data Fund aims to fund transformative early stage and growth companies throughout the Big Data ecosystem, from next generation storage and data management platforms to a wide range of revolutionary software applications and services – i.e. data analytics, business intelligence, collaboration, mobile, vertical applications and many more. We believe the future multi-billion software companies will be emerge from the Big Data ecosystem.

Page 15: The Emergence of Big Data

Data Presentation

No SQL

Hadoop Releases

Hadoop InfrastructureManagement

Big Data Analytics

HStreaming

Data Integration

The Hadoop Ecology

Page 16: The Emergence of Big Data

16

Our list of interesting Big Data startups

Company Investment Investors Location

10gen $31MSequoia, Flybridge and Union Square Ventures

Redwood Shores, CA

BackType $1.32MYCombinator, True Ventures, lowercase, Freestyle

San Francisco, CA

Cloudera $76MGreylock, Meritech Capital, Accel, Ignition Partners

Palo Alto, CA

Couchbase $30MAccel Partners, Ignition Partners, Mayfield, North Bridge, Docomo

Mountain View, CA

DataStax $13.7M Lightspeed, Crosslink Capital Burlingame, CA

Datameer $11.75M Kleiner Perkins, Redpoint San Mateo, CA

Hadapt $9.5MNorwest Venture Partners, Bessemer Venture Partners

Cambridge, MA

HortonWorks Benchmark Capital Sunnyvale, CA

HStreaming Chicago, IL

Karmasphere $5M Hummer, Winblad, USVP Cupertino, CA

Kitenga Santa Clara, CA

MapR Technologies $9M Lightspeed, NEA San Jose, CA

Mintigo $9M Sequoia, Giza Menlo Park, CA

Neo Technologies $10.6M Fidelity, Sunstone, Conor Menlo Park, CA

Pentaho $32M Benchmark, Index, NEA Orlando, FL

StackIQ $3M Antham, Avalon La Jolla, CA

Talend $28MBalderton Capital, AGF Private Equity, Galileo Partners

Los Altos, CA

Tableau Software $15MNEA, Meritech Capital Partners

Seattle, WA

Total Investment $284+

       

Page 17: The Emergence of Big Data

17

And the Venture Companies that Funded these startups

Company

Sequoia

Fly-bridg

Union Square

Y-Comb

True

Lower-Case

Free-Style

Greylock

M

eritech

Accel

Ignition

Northbridge

Docomo

M

ayfield

Lightspeed

CrossLink

Kliner

Redpoint

Norwest

Bessemer

Benchmark

Hummerwinblad

USVP

NEA

Antham

Avalon

Balderton

AGF

Galileo

Index

Fidelidy

Sunstone

Conor

Giza

10gen X X X

BackType X X X X

Cloudera X X X X

Couchbase X X X X X

DataStax X X

Datameer X X

Hadapt X X

HortonWorks X

HStreaming

Karmasphere X X

Kitenga MapR Technologies

X X

Mintigo X     X

Neo Technologies

    X X X

Pentaho X X X        

StackIQ X X

Talend X X X Tableau Software

X X

Page 18: The Emergence of Big Data

18

Thank You

John Katsaros [email protected]