Measuring users is hard. Lifetimes makes it easy.
Read me first: Latest on the lifetimes project
A project has emerged as a successor to lifetimes, PyMC-Lab/PyMC-Marketing, please check it out!
Introduction
Lifetimes can be used to analyze your users based on a few assumption:
- Users interact with you when they are "alive".
- Users under study may "die" after some period of time.
I've quoted "alive" and "die" as these are the most abstract terms: feel free to use your own definition of "alive" and "die" (they are used similarly to "birth" and "death" in survival analysis). Whenever we have individuals repeating occurrences, we can use Lifetimes to help understand user behaviour.
Applications
If this is too abstract, consider these applications:
- Predicting how often a visitor will return to your website. (Alive = visiting. Die = decided the website wasn't for them)
- Understanding how frequently a patient may return to a hospital. (Alive = visiting. Die = maybe the patient moved to a new city, or became deceased.)
- Predicting individuals who have churned from an app using only their usage history. (Alive = logins. Die = removed the app)
- Predicting repeat purchases from a customer. (Alive = actively purchasing. Die = became disinterested with your product)
- Predicting the lifetime value of your customers
Specific Application: Customer Lifetime Value
As emphasized by P. Fader and B. Hardie, understanding and acting on customer lifetime value (CLV) is the most important part of your business's sales efforts. And (apparently) everyone is doing it wrong (Prof. Fader's Video Lecture). Lifetimes is a Python library to calculate CLV for you.
Installation
pip install lifetimes
Contributing
Please refer to the Contributing Guide before creating any Pull Requests. It will make life easier for everyone.
Documentation and tutorials
Questions? Comments? Requests?
Please create an issue in the lifetimes repository.
Main Articles
- Probably, the seminal article of Non-Contractual CLV is Counting Your Customers: Who Are They and What Will They Do Next?, by David C. Schmittlein, Donald G. Morrison and Richard Colombo. Despite it being paid, it is worth the read. The relevant information will eventually end up in this library's documentation though.
- The other (more recent) paper is “Counting Your Customers” the Easy Way: An Alternative to the Pareto/NBD Model, by Peter Fader, Bruce Hardie and Ka Lok Lee.
More Information
- Roberto Medri did a nice presentation on CLV at Etsy.
- Papers, lots of papers.
- R implementation is called BTYD (Buy 'Til You Die).
- Bruce Hardie's Website, especially his notes, is full of useful and essential explanations, many of which are featured in this library.