DuckDB is my new Excel
How to replace Excel for ad-hoc data exploration using DuckDB: query remote CSVs, wildcard multi-file ingestion, union-by-name schema handling, unnesting lists, aggregating and counting, and flattening JSON APIs. If you can work with Excel formulas, you can do this.
Continue reading...
Optimizing Snowflake Costs with dbt Query Tags
Set Snowflake query tags with dbt to monitor which models are burning through your Snowflake credits. Track usage over time, identify expensive models, and optimize your biggest cost drivers with three ingredients: custom macro, usage tables as dbt sources, comprehensive cost calculations.
Continue reading...
Test Driven Development (TDD) with dbt: Test First, SQL Later
Stop building dbt models and praying they're correct. Start defining what "good" looks like first. This guide shows you how to apply TDD to analytics engineering, from unit tests to model contracts, so your data is trustworthy instead of just hope-it-works.
Continue reading...
Unit Testing dbt Macros: A workaround for dbt's unit testing limitations
Ever wished you could catch that broken SQL logic before it wrecks your dashboards? With dbt 1.8's new unit testing capabilities, you can finally sleep at night! However, support for testing macros is still limited. Let's explore how to test both models and macros with a workaround.
Continue reading...
Data Ingestion Pipelines Without Headaches: 8 simple steps
Data, like wine and cheese, becomes more valuable when combined. However, to combine, you must first retrieve the data and a reliable and scalable manner. This post covers the 8 steps of a data ingestion pipeline and 3 overarching topics to ensure reliability and quality over time.
Continue reading...