In this article, we’ll walk you through everything you need to know about the new dbt™ pricing plan and what are your options.
Over the last 6 months, dbt Cloud™ pricing has changed drastically. On top of the 100–700% increase from Dec-2022, we are now looking at Shiny New Pricing that’s going to cut even deeper holes into already stretched analytics budgets.
In this article, we’ll walk you through everything you need to know about the new dbt™ pricing plan. We’ll help you answer questions like:
Let’s jump right into it by answering the first question:
Here’s a simple illustration of the old price plan vs the new one. Don’t worry, we’ll summarize it for you below.
Developer Tier:
Team Tier:
($100 x number of developer seats) + ((models built — 15,000) x $0.01)
Let’s say your organization has 12 seats and you run 95k models per month, how much will you be paying each month?
Enterprise Tier:
Let’s read dbt™’s definition and then unpack it.
So, anytime you make a dbt™ run outside the dbt Cloud™ IDE environment (ex. a table and/or view is updated in your data warehouse), your dbt Cloud™ bill goes up by $.010 per successful model built. This price hike occurs whenever you execute any of the following:
It does not matter if the whole run fails; all successfully built models during even a failed run are charged.
All of these dbt™ runs were free before the price hike.
Let’s look at the impact of model build-based pricing. Financially, it comes down to how many models do you have and how many times do you run them during the day. But there is a deeper operational impact as well to consider.
To understand, how problematic this is, let’s say you have 500 models in the dbt™ repo.
If we assume:
These are pretty reasonable assumptions. We have users, who are running 100+ schedules.
With the above assumptions, we have:
In the team tier, with 8 seats, you would now be paying 225% more than before. Where your annual bill was $9.6k, it will now be around $31k!
More than the financial impact, this situation presents a massive operational impact. I can immediately think of questions that would probably be top of mind for every data leader:
These are real operational questions that data leaders need to ask. And whether the operational overhead for all of this is really worth it.
Depending on your team size, repo size, and scheduling frequency, you could be looking at your cost to increase between 160% and 1700%+.
From our estimates, if you have 200 models, then you can run ONLY 1 schedule and 1 CI job per day to keep your budget unaffected. That for all practical purposes is impossible to achieve.
At the beginning of this year, Christof Blefari wrote a pretty insightful post on how to manage and schedule dbt™. At the end of the post he had a comparison chart, which I have re-produced below on all the various options available.
All the non-SaaS options, require both data engineering time and data engineering teams of varying sizes. Now, data engineering time is precious and it’s expensive. I would argue, its best spent solving hard data engineering problems than maintaining analytics platforms.
But in a non-SaaS world, you don’t have that option. So we created a little comparison to show what’s the actual cost of various options outside dbt Cloud™.
We assumed:
Based on the above, there would be 195,000 successful model-builds per month, and the alternatives and their associated costs would be:
We can see that:
If we plot the timeline between Dec-2022 and Dec-2023, for a lot of teams, Jan-2024 will be the start of the new financial year. The time to get those budget approvals is between now and October. Otherwise, you run the risk of scrambling for a budget once again, just like what happened last year.
Making sure data budgets are flat from last year is top of mind for every CFO or VP Finance I have spoken to. It’s important that they are onboard with your plans.
As a thought exercise, how are you thinking of protecting / hedging yourself from another price increase in December? Paradime aside, I would love to know and help in any way I can.
I am also wondering, what if dbt™ Labs says, you will have to pay for model execution time as well after Coalesce?
In conclusion, if you are a data leader, and you are using dbt™ or considering dbt™ and reading this, you have three choices: