predicting churn in telco industry: machine learning approach - marko mitić

Post on 06-Jan-2017

24 Views

Category:

Data & Analytics

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Dr. Marko MitićBusiness Data Analyst at Telenor Serbia

Predicting churn in telco industry: machine learning approach

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Contents• Introduction to machine learning

• Churn definition & telco data

• Algorithm description

• Data exploration

• Modelling in R language

• Conclusion

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Introduction to machine learningSupervised learningEach training example is a pair consisting of an input object and a desired output value

• Regression (real values)• Classification (discrete labels)

Unsupervised learningDraw inferences from datasets data without labeled responses 

• Clustering• Dimensionality reduction

Reinforcement learningAgents ought to take actions in an environment so as to maximize cumulative reward

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Introduction to machine learning

Regression Classification

Clustering ReinforcementLearning

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Introduction to machine learning

Training set (observed)

Universal set(unobserved)

Testing set(unobserved)

Data acquisition

Practical usage

Classification

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Churn definitionChurn rate (sometimes called attrition rate), is a measure of the number of individuals or items moving out of a collective group over a specific period of time

= Customer leaving

Pay TVE-mail/website subscribersLegal sectorRecreation Newspaper subscribers

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Telco dataReal telco data available in latest C50 library in R language

Feature engineering: 3/6 months average usage, average total charge,...

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Algorithms1. Logistic Regression• In logistic regression the outcome variable is binary, and the

purpose of the analysis is to assess the effects of multiple explanatory variables

Odds of success = P / 1-P = = e α + β1X1 + β2X2 + …+βpXp

The joint effects of all explanatory variables put together on the odds isLogit P = α+β1X1+β2X2+..+βpXp

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Algorithms2. Support Vector Machines• SVMs maximize the margin around the

separating hyperplane.• The decision function is fully specified by a

subset of training samples, the support vectors.wTxi + b ≥ 1, if yi = 1wTxi + b ≤ −1, if yi = −1

w2ρ• Margin

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Algorithms3. Neural Network

• A neuron network (NN) is a computational model based on the structure and functions of biological neural networks.

•  A neural network usually involves a large number of processing units with the aim of successfully mapping input to output space through iterative process

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Algorithms4. Boosting

• Adaboost

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Evaluation metricsConfusion matrix

• Accuracy, Precision, Recall

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Evaluation metricsROC curve and AUC

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Data exploration

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Modelling in R (1)Logistic Regression

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Modelling in R (1.1)ROC and AUC

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Modelling in R (2)Support Vector Machines

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Modelling in R (4)BP Neural Networks

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Conclusions• 3 machine algorithms for churn prediction are presented

• Logistic Regression and BP Neural Net with boosting gave best results

• Good base for successfull broadcast campaign towards potential churners

Works even better• Implementation of more complex ML algorithms (Random Forest,

Gradient Boosting Machines, Deep NNs)

• Generate hybrid ensemble models

yana-ckwivjw2n5n1lxfv@guest.airbnb.com

Thank you!

top related