This project demonstrates a CI/CD approach for handling daily SQL schema changes part of the development process. Two fundamental questions are getting adressed by this: How do we define the SQL ...
Short description (≤350 chars): PySpark analytics pipeline: ingest orders/products/customers/returns, transform with broadcast joins and windowing, data quality ...