This repo is an Open source Data Engineering and AI/ML Knowledge base.
It is a collection of design principles, strategies and frameworks to help you build data infrastructure and set up data teams.
It talks about data infrastructure, product analytics, analytics engineering, product and engineering management principles and more.
A glossary of terms and concepts that are commonly used in the data world.
Coding standards, best practices and design principles for building warehouses, pipelines and data products that are easy to understand and maintain.
A step-by-step guide to building a Kimball-style data warehouse for an e-commerce store using dbt.
A fictional startup called “Jaffle Shop” is used to illustrate how data can be used to make decisions in a business.