GCP - Tags - Jaehyeon Kim

Data Build Tool (Dbt) Pizza Shop Demo - Part 4 ETL on BigQuery via Airflow

February 22, 20249 min read Data Engineering DBT Pizza Shop Demo Apache Airflow BigQuery Data Build Tool (DBT)Docker Docker Compose GCP Python

In Part 3, we developed a dbt project that targets Google BigQuery with fictional pizza shop data. Two dimension tables that keep product and user records are created as Type 2 slowly changing dimension (SCD Type 2) tables, and one transactional fact table is built to keep pizza orders. The fact table is denormalized using nested and repeated fields for improving query performance. In this post, we discuss how to set up an ETL process on the project using Apache Airflow.

February 8, 202416 min read Data Engineering DBT Pizza Shop Demo BigQuery Data Build Tool (DBT)GCP Python

In this series, we discuss practical examples of data warehouse and lakehouse development where data transformation is performed by the data build tool (dbt) and ETL is managed by Apache Airflow. In Part 1, we developed a dbt project on PostgreSQL using fictional pizza shop data. At the end, the data sets are modelled by two SCD type 2 dimension tables and one transactional fact table. In this post, we create a new dbt project that targets Google BigQuery. While the dimension tables are kept by the same SCD type 2 approach, the fact table is denormalized using nested and repeated fields, which potentially can improve query performance by pre-joining corresponding dimension records.

Data Build Tool (Dbt) Pizza Shop Demo - Part 4 ETL on BigQuery via Airflow

Data Build Tool (Dbt) Pizza Shop Demo - Part 3 Modelling on BigQuery