acquiring big data - oracle...acquiring big data •right place for your data –hdfs –nosql...
TRANSCRIPT
Acquiring Big Data
Xavier Verhaeghe | Vice-President Emerging Technologies & Security Solutions Leader |
| Oracle Europe, Middle-East & Africa |
MOVING OPPORTUNITY FOR IT
MOVING FROM BEING TOO LATE RESPONDING TO REQUESTED URGENT BUSINESS NEEDS
TO BEING PROACTIVE IN DRIVING NEW
OPPORTUNITIES
DATA EXPLOSION: REAL USERS & INTERNET OF THINGS
TRADITIONAL DATA SOURCES
Billing
engines
Custom
developed
NEW DATA SOURCES
Billing
engines
Custom
developed
What Makes it Big Data: The 4 Vs Principle
6
VOLUME VELOCITY VARIETY VALUE
SOCIAL
BLOG
SMART
METER
1011001010010
0100110101010
1011100101010
100100101
• NETWORK USAGE
• CALL DATA RECORDS
• SMS DATA
• SURFING BEHAVIOR
• BANDWIDTH USAGE
• COMMUNICATION FAULTS
• USER PROFILES
• SENSORS (EG CARS)
• PORTALS
• CRM DATA
• BILLING DATA
• ORDER MANAGEMENT
• DIGITAL TV DATA
• WIFI-HOTSPOTS
• DEVICE PROFILING
• MOBILE PAYMENTS
• AUGMENTED REALITY SOURCES
• LOCATION BASED DATA
• GPS DRIVING DATA
• HEALTHCARE DATA
• CONTRACTS
• BLOGS
• SUPPORT LOGS
• INTERNAL SOCIAL MEDIA
• EXTERNAL SOCIAL MEDIA
• QR CODE DATA
• CALL CENTER LOGS ...
ALL DATA SOURCES AND
THE BLEND OF ALL THIS DATA
Make Better Decisions Using Big Data
Big Data In Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
Acquire all available data
Big Data In Action
ANALYZE
DECIDE
ORGANIZE
ACQUIRE
Two Sets of Characteristics
Batch-Oriented Real-Time
Process data to use Deliver a service
Bulk storage Fast access to specific record
Write once, read all Read, write, delete update
Best Choices
Hadoop Distributed File
System (HDFS)
Oracle NoSQL Database
File System Database
Parallel scanning Indexed storage
No inherent structure Simple data structure
High volume writes High volume random reads
and writes
Hadoop Architecture
Management/Monitoring
HADOOP DISTRIBUTED FILE SYSTEM (HDFS)
MapReduce
Distributed file system Map/Reduce programming paradigm Highly scalable data processing
HDFS Overview
Distributes Data on Cluster Multiple Copies Add Nodes to Scale
HDFS Use Cases
• Click stream storage and analysis
– Number of web sessions lasting more than X minutes
– Most/Least frequently browsed pages
– Group session times by hour of day and source location
• Sentiment analysis
– How many comments contain the word(s) or phrase(s)
• Relationship discovery
– What items appear to be related in time or proximity
– How many times does X and Y happen in proximity
Oracle NoSQL Database
Nodes East
Nodes West
Nodes Central
NoSQL Driver
Application
NoSQL Driver
Application R
ea
d
Dele
te
Rea
d
Upd
ate
Distributed key-value database
Simple programming model
Scalable throughput
Commercial software and support
Easy management
Oracle NoSQL Database
• Replicated Application Servers
• Driver linked into each Application
• Data Nodes kept current
• Storage Nodes across Data Centers
• Automatic Storage Node failure handling
– Graceful degradation
– Automatic recovery
• No Single point of failure
Enterprise Topology
Oracle NoSQL Database Use Cases
• Data capture
– Sensor data capture (i.e. IA, SmartGrid, Earth Sc., BioMedical Sc.)
– Statistics & network capture (QOS Network Mgmt)
– Web applications (click-through capture)
– Backup services for mobile devices
• Data services
– NoSQL data sharing (Earth Sci, BioMedical)
– Scalable authentication
– Real-time communication (MMS, SMS, routing)
– Social Networks, Personalization
Oracle NoSQL Database Differentiation
• Seamless integration with Oracle stack
• Commercial grade
• Scalable
• Simple programming model
• Easy management
BIG DATA APPLIANCE
•Oracle Linux 5.6
•Java Hotspot VM
•Cloudera Hadoop Distribution
•R Distribution
•Oracle NoSQL Database Community Edition
•Oracle Big Data Connectors
Oracle Big Data Appliance Software
RELIABILITY PERFORMANCE
MANAGEABILITY
SECURITY
SUPPORT
Oracle NoSQL
Database
HDFS
Enterprise
Applications Oracle Data Integrator
Oracle Big Data Connectors
Hadoop (MapReduce)
Oracle Integrated Solution Stack for Big Data
ACQUIRE ORGANIZE DECIDE
Analytic
Applications
ANALYZE
In-D
ata
base
An
aly
tics
Data
Warehouse
Exadata Exalytics
Usage Model
ACQUIRE ORGANIZE DECIDE ANALYZE
Big Data
Appliance
Acquiring Big Data
• Right place for your data
– HDFS
– NoSQL
– Relational
• Uncover value with analysis
TELCO DIFFERENT POTENTIAL USE
CASES
B2C MARKET OPPORTUNITIES
BETTER MARKET KNOWLEDGE
• USER PROFILING
• BETTER AND REAL TIME CHURN ANALYSIS
• CUSTOMER EXTENDED NETWORK VALUE
• SENTIMENT ANALYSIS
BETTER USER EXPERIENCE
• BETTER TARGETED CUSTOMER SERVICE
• SUPER FAST INFO ACCESS
• BETTER INTEGRATED SERVICES WHERE, WHEN, WHY
• RELEVANT REAL TIME COMMERCIAL OFFERS
MORE SECURITY
• NETWORK FRAUD ANALYSIS
• USER SECURITY SERVICES
• CONTENT, CONTEXT AND DEVICE AWARE ALERTS
B2B MARKET OPPORTUNITIES
DATA VALORISATION SERVICES
• SELLING USER PROFILE DATA
• CUSTOMER EXTENDED NETWORK VALUE
• SENTIMENT ANALYSIS
• BLENDED LOCATION BASED & PROFILE DATA
• GPS ENHANCEMENT SERVICES
• MARKETING AND STORE LOCATION BASED ADVISORY
• MARKET & TREND ANALYSIS
• DEVICE USABILITY ANALYSIS
IT & BUSINESS SERVICES
• BIG DATA INFRASTRUCTURE HOSTER
• BIG DATA SYSTEM INTEGRATOR
• BIG DATA MANAGED SERVICES (EG FOR HEALTHCARE)
• METERING SERVICES
SECURITY
EQUIPMENT
MANUFACTURER
FRAUD ANALYSIS
HEALTH
INSURER
INCREASED SERVICE
-
FRAUD ANALYSIS
UTILITIES SMART METERING ANALYSIS
-
TRADING SUPPORT
MEDIA COMPANY
REAL TIME EXPERIENCE
-
SOCIAL NETWORK TRENDING
-
AD VIEWING ANALYSIS
MEDIA COMPANY
FINANCIAL
SERVICES
REAL TIME EXPERIENCE
-
FRAUD DETECTION
MANUFACTURING PRODUCT FAILURE DETECTION
-
PATENTS RESEARCH
OIL & GAS ANALYSIS OF RIG SENSOR DATA,
SEISMIC & GEOLOGICAL DATA,
MANAGING COMPRESSOR DATA
TELCO
•Customer buys a new smartphone -> this goes
into regular systems
•Same customer starts playing with his new
smartphone, downloads an app. This info does
not end up in traditional systems.
•Consolidate this type of info in a NoSQL DB to
have a complete customer view (eg for call
centres)
(ONLINE)
RETAILERS
NEXT PURCHASE
RECOMMENDATIONS
-
STORE TRENDING