We’ve uploaded some sample data sets in the IBM Watson Analytics community for you to work with as you learn more about Watson Analytics. This expert blog uses the Telco Customer Churn data set. WA_Fn-UseC_-Telco-Customer-Churn
What’s in the Telco Customer Churn data set?
This data set provides info to help you predict behavior to retain customers. You can analyze all relevant customer data and develop focused customer retention programs.
A telecommunications company is concerned about the number of customers leaving their landline business for cable competitors. They need to understand who is leaving. Imagine that you’re an analyst at this company and you have to find out who is leaving and why.
The data set includes information about:
- Customers who left within the last month – the column is called Churn
- Services that each customer has signed up for – phone, multiple lines, internet, online security, online backup, device protection, tech support, and streaming TV and movies
- Customer account information – how long they’ve been a customer, contract, payment method, paperless billing, monthly charges, and total charges
- Demographic info about customers – gender, age range, and if they have partners and dependents
If you don’t have the data set…
- Go to https://community.watsonanalytics.com/resources/
- Download the Telco Customer Churn sample data file.
- In Watson Analytics, tap Add and upload Telco Customer Churn.
The filename is a bit longer: WA_Fn-UseC_-Telco-Customer-Churn.csv.
The data set appears as a tile in the Welcome page and you’re ready to get to work.
Which customers are likely to leave?
- To find the answer to this question, tap the WA_Fn-UseC_-Telco-Customer-Churn tile and tap Prediction.
You want to learn more about customers who’ve left the company in the past month – this is the target that you want to investigate. The data is in the column called Churn, which is the column we’ve already picked as the target for the prediction. Let’s find out which variables influence customers who leave.
- Name the prediction and tap Create Prediction.
Watson Analytics analyzes the data and generates visualizations to provide insights into this issue.
The spiral shows you the top predictors, or key drivers, of churn in color; other drivers appear in gray. The closer the driver is to the center of the spiral, the stronger the predictive strength of the driver is.
The key drivers are tenure, contract, and online security. The visualizations to the right of the spiral show how one driver at a time drives churn. The blue or green dots in the upper right of the visualizations identify which driver is being shown.
- Tap tenure drives Churn.
- Close this visualization by tapping the X in its upper right corner.
You can look at the visualizations for the other drivers on your own. Let’s move on and explore churn in more depth.
To the left of the spiral are options for creating visualizations that show more than one driver at a time.
- Let’s go straight to the deeper and more predictive analysis of the data. Tap Combination.
You get a new set of visualizations on the right, including a decision tree, that show the combination of variables that influence your target.
- Let’s look at the combination of key drivers that influence whether customers leave. Tap the decision tree.
- Let’s look at a word cloud about the key factors that influence churn. Tap Predictor Importance.
Contract, Internet Service, Tenure, and Total Charges are the most important factors.
- Let’s get some more details on who is leaving so we can predict who is likely to leave in the future. Tap Top Decision Rules.
The rules are specific and detailed, and are sorted by accuracy. They currently focus on customers who do not leave. We need to change that.
- Change the No to Yes.
You can now predict which customers are at risk to churn. Use the decision rules to identify customers who fit the churn profile so you can proactively offer them an incentive to stay.