Cohort analysis is a type of data analysis that groups users into segments based on common characteristics shared over time. By grouping people into cohorts, you can analyze and optimize campaigns based on a wide range of factors such as conversion trends, buying patterns, or product popularity — so as you can imagine, it’s a really powerful tool for marketing teams!
In our previous blog explaining what cohort analysis is, we mentioned that getting set up to run cohort analysis can be tricky and intimidating. So, we’ve put together a guide that takes you through how to perform a cohort analysis step-by-step. But there are a few things you’ll need before you can get started!
The first thing you’ll need is a database you can trust. Data sources specialized in eCommerce like Shopify, Magento or WooCommerce are off-the-shelf solutions which fit the requirements. However, if you have an in-house solution such as your own CRM or database, make sure it contains the following information:
The above are all must-have requirements. If one of them is missing, you won’t be able to finish your cohort analysis.
As an aside, while it’s technically possible to perform a cohort analysis with aggregated data as well, this provides much less flexibility than user-level data and is, therefore, less valuable. We recommend avoiding data that’s already been aggregated like CPC (Cost Per Click) where possible when performing a cohort analysis, and instead using the more granular user-level data. To find out more about why already aggregated data can be so tricky to work with, you can check out this blog.
Cohort analyses on user-level data helps build a clearer picture of customer behaviour
Once all of the above are available, you’ll need to identify:
Cohort identifiers are used for grouping your cohorts based on a certain time window. In other words, you are grouping your users into different groups depending on the specific dates of a particular activity — in this case, the customer acquisition date. This can be different time periods depending on what you are trying to analyze, for example:
Time elapsed calculates the time since the acquisition of the customer (or whatever activity you have chosen as your cohort identifier). It is expressed by the formula ((current day) - (acquisition date)) and subsequently:
Before jumping into the visuals we still need to make sure our data pipeline is set up for regular processing with the lowest level of manual maintenance possible. Make sure to:
You can streamline your cohort analysis by automating your updates
All set and done? Great, then you’re finally ready to create your cohort analysis table.
You’ve prepared the ground for your cohort analysis, now it’s time to build a cohort analysis chart in your BI tool of choice.
You’ll need to add your cohort identifier — in this case, customer acquisition date, down the Y-axis of the table, and add the time elapsed along the row at the top. From here, you can use the table to aggregate the metric of choice — in this case, order value.
Each cell in this table represents the collective amount that a particular cohort of customers spent in a given month. So, if we take a look at the row for Jan-20, we can see that this cohort of customers who were acquired in January 2020 collectively spent $256 in the month they were acquired, and 7 months later, the same cohort of customers collectively spent $25.
Congratulations, you’ve created your first cohort analysis table!
Now your cohort analysis table is complete, and you have a blank canvas from which you can perform further analysis and derive more insights. As food for thought, you can try to:
It’s important to mention, these are just a few hints of what you can bring to the table you already have at hand. However, you can also integrate other kinds of data, for instance, socials and advertising spending to calculate the Customer Acquisition Cost (CAC), break-even analysis, or even the customer lifetime value. The sky’s the limit here.
A cohort analysis chart gives you excellent insights into how different cohorts behave over time. Once you have built a cohort analysis table, you can read left to right, top to bottom, or even diagonally to reveal trends in customer behavior over time. And things get even more interesting when you start comparing two different cohorts against each other.
Single cohort charts can help you identify and dig into the behavioral patterns of certain groups. But, comparing two charts allows you to track the impact of specific variables on the same audiences, or even assess the effectiveness of different approaches on varied audiences.
To learn more about how to read a cohort analysis chart, check out our blog “How To Read a Cohort Analysis Chart: Best Practice”.