big data & real-time campaign_gse - external

Upload: komalkothari

Post on 19-Feb-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    1/29

    2011 IBM Corporation

    Telecom Solutions Lab

    BAO & Big Data OverviewApplied to Real-time Campaign

    GSE

    Joel Viale

    Telecom Solutions Lab Solution Architect

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    2/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Agenda

    2

    BAO & Big Data - Overview

    Customer use-cases

    Live Prototypes:

    Streams for Real-time Campaign

    Big Insights for Web Log Analysis

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    3/29

    2011 IBM Corporation

    Telecom Solutions Lab

    3

    The Business Analytics Journey: Transform data into actionableinsight!

    AnalyzeIntegrate

    Transactional

    & CollaborativeApplications

    Manage

    Business AnalyticsApplications

    Internal & External

    formation Sources

    Cubes

    Streams

    Big DataMaster

    Data

    Content

    Data

    Streaming

    Information

    DataWarehouses

    Govern

    QualitySecurity &

    PrivacyLifecycle

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    4/29

    2011 IBM Corporation

    Telecom Solutions Lab

    4

    What is Big Data? How could it impact your organization?

    Big Data technologies describe a newgeneration of technologies and architectures,designed to economically extract value fromvery large volumes of a wide variety of data,

    by enabling velocity capture, discovery and/oranalysis.

    - Matt Eastwood, IDChttp://www.tweetdeck.com/twitter/matteastwood/~hHgsU

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    5/29

    2011 IBM Corporation

    Telecom Solutions Lab

    5

    The Big Data Challenge Manage and benefit from massive and growing amounts of data

    Handle varied data formats (structured, unstructured, semi-

    structured) and increased data velocity Exploit BIG Data in a timely and cost effective fashion

    Collect Manage

    Integrate Analyze

    COLLECT MANAGE

    INTEGRATE ANALYZE

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    6/29

    2011 IBM Corporation

    Telecom Solutions Lab

    6

    Bring Together a Large Volume and Variety of Data to Find NewInsights

    Identify criminals and threats from

    disparate video, audio, and data

    feeds

    Make risk decisions based on real-time transactional data

    Predict weather patterns to planoptimal wind turbine usage, and

    optimize capital expenditure on

    asset placement

    Detect life-threatening

    conditions at hospitals in time

    to intervene

    Leverage Multi-channel customer

    sentiment and experience

    analysis to upsell in real time

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    7/29

    2011 IBM Corporation

    Telecom Solutions Lab

    7

    The Solution IBMs Big Data PlatformBring together any data source, at any velocity, to generate insight

    Analyzing a variety of data atenormous volumes

    Insights on streaming data Large volume structured data

    analysis

    IBM Big DataPlatform

    Variety

    Velocity

    Volume

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    8/29

    2011 IBM Corporation

    Telecom Solutions Lab

    8

    2 ways of creating insights: Streaming and Storing Big Data

    Real time analysis of data-in-motion

    Structured or unstructured

    Analytic operations on streaming data in real-time

    Data QueriesResults

    b) streaming data

    Data QueriesResults

    b) streaming data

    Data QueriesResultsData QueriesResults

    b) streaming data

    Managing and analyzing Internet-scale volumes

    Structured or unstructured data

    Based on Googles MapReduce technology

    Inspired by Apache Hadoop; compatible with its ecosystem and distribution

    Well-suited to batch-oriented, read-intensive applications

    Integrated Text Analytics

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    9/29

    2011 IBM Corporation

    Telecom Solutions Lab

    9

    9

    -Distributed File System (HDFS)

    -Map/Reduce

    How to deal with Big Data: Split a big job into smaller piecesHighly Massive Parallel Processing (MPP)

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    10/29

    2011 IBM Corporation

    Telecom Solutions Lab

    10

    InfoSphere Streams enables highly scalable stream processing

    InfoSphere Streams provides a programming model for defining data flow graphs consisting of data sources

    (inputs), operators, and sinks (outputs)

    controls for fusing operators into processing elements (PEs) infrastructure to support the composition of scalable stream processing

    applications from these components deployment and operation of these applications across distributed x86

    processing nodes,

    when scaled-up processing is required

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    11/29

    2011 IBM Corporation

    Telecom Solutions Lab

    11

    Big Data complements traditional warehouse & analyticsinfrastructure

    Streams

    Internet

    Scale

    TraditionalWarehouse

    In-Motion

    Analytics

    Data Analytics,

    Data Operations

    & Model

    Building

    Results

    Internet Scale

    Database &

    Warehouse

    At-Rest Data

    Analytics

    Results

    Ultra Low

    Latency Results

    InfoSphereBig

    Insights

    InfoSphereBig

    Insights

    Moving Beyond the Traditional Warehouse

    Traditional /

    Relational

    Data Sources

    Traditional /

    Relational

    Data Sources

    Non-Traditional /

    Non-Relational

    Data Sources

    Non-Traditional /

    Non-Relational

    Data Sources

    Non-Traditional/

    Non-RelationalData Sources

    Non-Traditional/

    Non-RelationalData Sources

    Traditional/Relatio

    nal Data Sources

    Traditional/Relatio

    nal Data Sources

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    12/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Agenda

    12

    BAO & Big Data - Overview

    Customer use-cases

    Live Prototypes:

    Streams for Real-time Campaign

    Big Insights for Web Log Analysis

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    13/29

    2011 IBM Corporation

    Telecom Solutions Lab

    13

    Examples of Streams use-casesStock market Impact of weather on

    securities prices Analyze market data at

    ultra-low latencies

    Fraud prevention Detecting multi-party fraud Real time fraud prevention

    e-Science Space weather prediction Detection of transient events

    Synchrotron atomic research

    Health & Life Sciences Neonatal ICU monitoring Epidemic early warning system Remote healthcare monitoring

    Transportation

    Intelligent traffic management

    Law Enforcement, Defense &Cyber Security

    Real-time multimodal surveillance Situational awareness Cyber security detection

    Manufacturing Process control for

    microchip fabrication

    Natural Systems Wildfire management Water management

    Telephony CDR processing Social analysis Churn prediction Geomapping

    Other Smart Grid Text Analysis Whos Talking to Whom? ERP for Commodities FPGA Acceleration

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    14/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Streams in Telecommunications

    14

    Data in motion

    CDRs

    Billing

    CRM

    Location

    Account Mgt

    Internet

    Network

    Millions of

    events per

    second

    Microsecond

    Latency

    Dropped Calls

    Outgoing InternationalCallsCall Duration

    Extra Call

    Contract Expiration

    Entered new cell

    New Top-Up

    5 minutes left on pre-paid

    MDM

    EDW

    Invoice Issued

    Campaign MgtReal-time

    Promo

    Fraud Detection

    ServiceAssurance

    NetworkMonitoring

    Real-timeMonitoring(Voucher

    Recharge &Service Usage)

    3Drop

    pedCallsi

    nthela

    st

    hour

    Locatio

    n-base

    dPromo

    5Outg

    oinginte

    rnation

    alCallsinth

    elast

    day

    100 SMSs sent in 1hour500$ top-up on pre-paid

    Nogamedownloadinlast30minutes

    500FailedSMSDeliverieslast10minutes

    AggregatedDataRecordsatService

    Level

    AggregatedDataRecordsatCell

    Level

    AggregatedVoucherRecharge

    Data Aggregated at

    Single Customer

    Level

    Data Aggregated at

    Cell or Service

    Level

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    15/29

    2011 IBM Corporation

    Telecom Solutions Lab

    CDR cleansing and pre-processing

    The Pain Point:

    A CSP has 6 Billions CDRs per day!

    As much as 500,000 per second peak rate

    Kept up, but re-processing meant waiting fornights/weekends to catch up

    The Solution:

    InfoSphere Streams to clean and pre-processCDRs

    De-duplication of CDR against 15 days of data(90B CDRs)

    Offloading CDRs processing to Streams platformincreased the performance of their warehouse forother analytics

    Single platform for mediation and real timeanalytics reduced IT complexity

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    16/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Real-time marketing campaign

    InsightInsight

    InformationInformation

    pre

    scriptive

    pre

    scriptive

    DataData activ

    e

    active

    Business

    flexibility&r

    esponsiveness

    Business value

    The Pain Point

    100M CDRs per day from SMS from25M subscribers, only used to sendbills to customers

    The Solution

    InfoSphere Streams to create real-time marketing promotions andmonetize the CDRs

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    17/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Typical Telco Use-Cases with InfoSphere Streams

    ETL-like for massive data to off-load traditional ETL and Warehouse

    Mediations

    Prediction and prevention of customer churn In-motion analytics can help identify group leaders

    Real Time context sensitive promotions Location based promotions

    Customer minutes usage based promotions Customer smart phone browsing pattern based promotions Network equipment utilization based promotions

    Real Time & pro-active network monitoring

    Real Time call traffic & revenue monitoring

    SMS spam filtering

    Fraud detection17

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    18/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Typical Telco Use-Cases with InfoSphere Big Insights

    CDR Use Cases:

    User behavior analysis

    Prediction on the basis of Customer churn

    Service association analysis

    Social Media Analytics: Get insights, improve campaign, influence opinions

    Fraud Detection: Monitoring/Mining transaction logs to detect fraud activities

    Recommendation Engines: Improving user experience and likelihood of purchase

    Network Traffic Logging: Network optimization & fault detection

    Advertising Optimization: in support of advertising based models

    Server Logs: Fault detection / Performance related analysis

    Email and Email Logs:

    Consumer email analysis & decision system

    18

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    19/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Product Capabilities

    Big Insights at the core of Cognos Consumer Insight

    BLOGSBLOGS

    DISCUSSION FORUMSDISCUSSION FORUMS

    TWITTERTWITTER

    NEWSGROUPSNEWSGROUPS

    FACEBOOKFACEBOOK

    Source Areas

    Dimensional Analysis

    Filtering

    Voice

    Keyword Search

    Dimensional Navigation

    Drill Through to Content

    Relevant Topics Associated Themes

    Ranking and Volume

    Relationship Tables Relationship Matrix

    Relationship Graph

    COMPREHENSIVEANALYSIS

    SENTIMENT

    EVOLVING TOPICSAFFINITY ANALYTICS

    Business Drivers

    Customer CareCustomer CareCorporate ReputationCorporate Reputation

    Campaign EffectivenessCampaign Effectiveness

    Competitive Analysis

    Product Insight

    MULTILINGUALMULTILINGUAL

    19

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    20/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Agenda

    20

    BAO & Big Data - Overview

    Customer use-cases

    Live Prototypes:

    Streams for Real-time Campaign

    Big Insights for Web Log Analysis

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    21/29

    2011 IBM Corporation

    Telecom Solutions Lab

    , &

    Real-time Campaign Solution Architecture

    InfoSphere Streams

    Unica Suite

    SPSS

    Cognos Suite

    Netezza

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    22/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Outbound campaign, enhanced with Real-time Monitoring andadvanced predictive analytics

    Events Collaboration Content MonitoringAnalytics Rules

    ProductManager

    MarketingManager

    VP ofMarketin

    g

    Customer

    DefineMarketingStrategy

    CreateCampaig

    n

    IdentifyTargetCustomers

    andOffering

    ApproveCampaig

    n

    Monitor &EvaluateCampaig

    n

    ExecuteCampaig

    n

    DeliveryChannel

    Real-time Monitoring

    Detect situations andpatterns in service usage.Determine the need of acampaign

    Predictive Analytics

    Segment Customers basedon Churn Prediction,likelihood of campaignresponse, LTCV, etc

    Presence & Rules based channel

    selection

    Send campaign message on appropriatechannel (including social networks),

    based on customer preferences andavailability/presence

    Monitor key KPIs

    Sales/Usage increase % of campaign response

    22

    Collectand

    aggregatexDRs

    Network

    Real-time xDRs

    Processing

    Collect, correlate andaggregate data records inreal-time

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    23/29

    2011 IBM Corporation

    Telecom Solutions Lab

    23

    Real-time campaign, based on network events

    Manage

    CampaignDelivery

    MonitorCampaignExecution

    Event Based

    CampaignProcessing

    Evaluate

    CampaignEffectiveness

    Real-time Event Detection

    InfoSphere Streams correlatemultiple data records in real-time down to the customerlevel.Here: detects a number ofdropped calls in a certaintime

    Event-based Campaign

    The event-based campaignprocessing is triggered: acampaign is executed in Unica

    Campaign.

    Message Delivery

    The campaign message isdelivered to the customeron the appropriate channel

    Campaign monitoring

    Monitor campaign executionand analyze effectiveness

    CDRs

    CDR

    Feed

    Dropped CallsFiltering

    Dropped CallsAggregation

    Per Subscriber

    InfoSphere StreamsReal-time event detection

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    24/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Agenda

    24

    BAO & Big Data - Overview

    Customer use-cases

    Live Prototypes:

    Streams for Real-time Campaign

    Big Insights for Web Log Analysis

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    25/29

    2011 IBM Corporation

    Telecom Solutions Lab

    The Sample Outdoors Companyhas many visitors a year to their

    website and only a subsetresulted in purchases. Theywould like to know what is

    happening to the other visits?

    25

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    26/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Every time the user browses a page, the click-stream data is logged

    User goes to

    web site

    Additem

    to cart

    Searchfor

    item

    Userchecks

    out

    26

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    27/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Web logs capture click streams of user

    Basic analysis of web logs can provide valuable insights to web site usage

    and user behaviors How many users going to my web site?

    What pages and products are people looking at?

    Additional sessionization and aggregation analysis help to detect valuableclick patterns and provide insights for:

    Improving customer service

    Ads promotion and sales improvement

    Detection of incidents and errors

    Click Stream Data Analysis for Customer Insights

    27

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    28/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Big Insights Web Log Analysis Solution Overview

    Web ServerLogs

    BigInsights Analytics

    28

  • 7/23/2019 Big Data & Real-time Campaign_GSE - External

    29/29

    2011 IBM Corporation

    Telecom Solutions Lab

    Solution Architecture for Web Log Analysis

    (, )

    ()

    29