Identify and Automate Data Checks on Critical dbt Models
Do you know which are the critical models in your data project?
I’m sure the answer is yes. Even if you don’t rank models, you can definitely point to which models you should tread carefully around.
Do you check these critical models for data impact with every pull request?
Maybe some, but it’s probably on a more ad-hoc basis. If they really are critical models, you need to be aware of unintended impact. The last thing you want to do is mistakenly change historical metrics, or lose data.
Identifying critical models
Knowing the critical models in your project comes from your domain knowledge. You know these models have: