This is a repo with links to everything you'd ever want to learn about data engineering
-
Updated
Dec 18, 2023
This is a repo with links to everything you'd ever want to learn about data engineering
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Compare tables within or across databases
SQLMesh is a data transformation framework that brings the benefits of DevOps to data teams. It enables data scientists, analysts, and engineers to efficiently run and deploy data transformations written in SQL or Python.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
An open source development framework to help you build data workflows and modern data architecture on AWS.
This repository provides various demos/examples of using Snowpark for Python.
Code and data for the Modern Polars book
A Data Platform built for AWS, powered by Kubernetes.
Roadmap for Data Engineering
Recohut - Learn data engineering, data science
Index for online reading materials in order to learn Python and backend development/engineering concepts from scratch and develop a mastery sufficient for Senior/Principal Backend Engineers and Data Engineers
Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате
Predict stock price based on financial news feeds
Resources about data science, machine learning, deep learning, data engineering, and SQL.
Data engineering interviews Q&A for data community by data community
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in the Philippines. Data Engineering Pilipinas is a PyData group.
Build, test, deploy, iterate - Dev and prod tool for data science pipelines
Add a description, image, and links to the dataengineering topic page so that developers can more easily learn about it.
To associate your repository with the dataengineering topic, visit your repo's landing page and select "manage topics."