applying data science to sales pipelines - for fun and profit

18
Applying data science to sales pipelines – for fun and profit Andy Twigg CTO

Upload: c9-inc

Post on 15-Feb-2017

105 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Applying data science to sales pipelines - for fun and profit

Applying data science to sales pipelines – for fun and profit!

!Andy Twigg!

CTO!

Page 2: Applying data science to sales pipelines - for fun and profit

Data!science!

Domain!expertise!

Machine!learning! Data! •  62B sales pipeline records!

•  Structured, unstructured!•  3rd party public data!•  Fine-grained temporal data!

Deep expertise:!•  sales!•  forecasting!•  revenue models!

•  Automated ML infrastructure!•  ML models tuned for specific

problems!

Page 3: Applying data science to sales pipelines - for fun and profit

CUSTOMERS!

Page 4: Applying data science to sales pipelines - for fun and profit

DATA SCIENCE @ C9!

•  Opportunity Scoring!•  What is Pr(win) for this deal?!•  What is Pr(win in quarter) for this deal?!•  How does this compare to sales team commits?!•  Which deals can we influence most?!

•  Forecasting!•  How much will we close this quarter?!

Page 5: Applying data science to sales pipelines - for fun and profit

SALES PIPELINES & OPPORTUNITIES!

•  Opportunities are temporal creatures; while ‘open’ they proceed through a number of observations and terminate in one of a discrete set of ‘closed’ states – typically won or lost!

•  Usually they proceed through ‘stages’, except:!•  An opportunity can be entered into the CRM system as closed (no open observations)!•  Stages are only a partial order - can skip / revisit stages!•  Can be re-opened after closed!

•  As the opportunity evolves, we get more and more data about the opportunity!•  A pipeline is a set of open opportunities!

Lead created!

Stage: Qualifying!

Email sent!

Email sent: response!

Amount= $1000! Call!

Stage: demo!

Meeting! Demo!

Push close date!

Stage: negotiation!

Closed/won!

ReopenedAmount=

$2000!

Closed/won!

Page 6: Applying data science to sales pipelines - for fun and profit

ANATOMY OF AN OPPTY!

Page 7: Applying data science to sales pipelines - for fun and profit

ANATOMY OF AN OPPTY!

Pushed out Pulled back

in

Final outcome: won

Committed here (by the sales rep)

Page 8: Applying data science to sales pipelines - for fun and profit

ANATOMY OF AN OPPTY!

Pushed out Pulled back

in

Final outcome: won

Committed here (by the sales rep)

Predicted won from the start

Predicted won in the correct

quarter

Page 9: Applying data science to sales pipelines - for fun and profit

PREDICTIVE ENGINE!

Build a fine-grained history of closed

opportunities

•  Cleaning!•  Preprocessing!•  Featurizing!•  ~10 GB/customer!•  ~ 1M training rows!!

1,000s of raw signals per opportunity

•  Structured (CRM, ERP)!•  Unstructured (NLP)!•  Firmographic!•  Gov sources!•  SEC filings!•  Crunchbase!•  …!!

Identify historic deals with similar behavior

Continuously re-score opportunities as they

evolve

Update model as opportunities close

•  Fully-automated model rebuilding and scoring platform!•  Model input features:!

•  Historic observations of opportunity!•  Sales-specific features e.g. momentum!•  Temporal features e.g. std(amount over last 30 days)!•  Industry-wide features e.g. avg_sales_cycle(target)!

•  Continuously cross-validated model tuning!•  Extensible, scalable platform using Hadoop (HDFS), Python!

Page 10: Applying data science to sales pipelines - for fun and profit

Win/Loss Model (Random Forest) Estimate Pr(win)

Duration Model (Poisson Regression) Estimate Pr(win in quarter)

Influencer Model (Linear) Positive/Negative Drivers

•  Standard Features •  Temporal Features •  Derived Features

BEHIND THE SCORES!

Page 11: Applying data science to sales pipelines - for fun and profit

©2014 All Rights Reserved

Page 12: Applying data science to sales pipelines - for fun and profit

©2014 All Rights Reserved

Page 13: Applying data science to sales pipelines - for fun and profit
Page 14: Applying data science to sales pipelines - for fun and profit

sales team: good precision (~70-80%) but poor recall (~10-40%)!C9 won precision ~ sales team won precision!C9 won recall ~ 3 x sales team won recall!

First observation Last observation

precision recall f1 precision recall f1

C9 scoring 0.65 0.86 0.74 0.75 0.93 0.83

Commit 0.70 0.07 0.13 0.87 0.45 0.59

Page 15: Applying data science to sales pipelines - for fun and profit
Page 16: Applying data science to sales pipelines - for fun and profit

FORECASTING: TOP-DOWN VS BOTTOM-UP!

Top-down: Predict current quarter based on previous quarters!!

Accounts for seasonality and trending!!

Ignores state of current pipeline!

0.0e+00

2.5e+08

observed

5.0e+07

2.5e+08

trend

−5e+06

5e+06

seasonal

−1e+07

5e+06

2013.0 2013.2 2013.4 2013.6 2013.8 2014.0 2014.2 2014.4

random

Time

Decomposition of additive time series

Bottom-up: Predict current quarter based on currently open pipeline!!

Considers quality of deals in pipe!!

Ignores trends, deals not in pipe!

$265,410!

$157,000 77%

$200,000 37%

$82,000 86%

+!-!

+!-!

Page 17: Applying data science to sales pipelines - for fun and profit

HYBRID FORECASTING!top down + bottom up!

20

40

60

11 10 9 8 7 6 5 4 3 2 1Weeks to EOQ

Amou

nt ($

M)

C9Final AmountActual Amount

Amount Forecast

•  Augment time-series model with side information from bottom-up model, e.g.:!

•  Amount predicted to close in current quarter!

•  Average score of currently open opportunities!

•  Average predicted days to close!

!•  Sometimes known as ARIMAX!

log(yt) ∼12!

i=1

log(yt−i) + log(x(1)t−12) + log(x(2)

t−12) + log(x(3)t−12)

Page 18: Applying data science to sales pipelines - for fun and profit