42 results found Sort:
- Filter by Primary Language:
- Python (17)
- Jupyter Notebook (5)
- Java (3)
- HTML (2)
- TypeScript (2)
- MDX (1)
- Rust (1)
- Scala (1)
- Shell (1)
- VBA (1)
- PLpgSQL (1)
- +
This is a repo with links to everything you'd ever want to learn about data engineering
Created
2023-11-19
407 commits to main branch, last one 12 days ago
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Created
2021-08-01
12,416 commits to main branch, last one 8 hours ago
Compare tables within or across databases
This repository has been archived
(exclude archived)
Created
2022-03-07
1,932 commits to master branch, last one 11 months ago
Scalable and efficient data transformation framework - backwards compatible with dbt.
Created
2022-09-23
3,571 commits to main branch, last one 8 hours ago
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Created
2021-08-25
2,622 commits to main branch, last one 2 days ago
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
Created
2019-09-27
4,057 commits to master branch, last one about a month ago
This repository has no description...
Created
2024-04-05
54 commits to main branch, last one 4 months ago
This repository has no description...
Created
2022-06-09
1,085 commits to master branch, last one 22 days ago
This repository provides various demos/examples of using Snowpark for Python.
Created
2022-05-26
226 commits to main branch, last one about a year ago
An open source development framework to help you build data workflows and modern data architecture on AWS.
Created
2022-02-16
566 commits to main branch, last one about a month ago
Roadmap for Data Engineering
Created
2022-09-30
41 commits to main branch, last one 11 months ago
Code and data for the Modern Polars book
Created
2022-12-21
164 commits to master branch, last one 4 months ago
end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence
Created
2024-01-29
132 commits to main branch, last one 20 days ago
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in ...
Created
2023-09-05
189 commits to main branch, last one 12 days ago
Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате
Created
2021-06-19
212 commits to main branch, last one 10 months ago
A Data Platform built for AWS, powered by Kubernetes.
This repository has been archived
(exclude archived)
Created
2020-10-08
894 commits to main branch, last one about a year ago
The developer framework for your data & analytics stack
Created
2023-07-20
1,287 commits to main branch, last one a day ago
Simple stream processing pipeline
Created
2020-10-03
25 commits to main branch, last one 10 months ago
Index for online reading materials in order to learn Python and backend development/engineering concepts from scratch and develop a mastery sufficient for Senior/Principal Backend Engineers and Data E...
Created
2023-02-05
46 commits to main branch, last one about a year ago
Recohut - Learn data engineering, data science
Created
2022-12-30
190 commits to main branch, last one about a year ago
Resources about data science, machine learning, deep learning, data engineering, and SQL.
Created
2022-10-14
119 commits to main branch, last one about a year ago
Duke MIDS: Data Engineering and DataOps Course
Created
2021-07-04
137 commits to main branch, last one 3 months ago
Found a data engineering challenge or participated in a selection process ? Share with us!
github
beginner
beginners
help-wanted
first-timers
data-pipeline
github-events
hacktoberfest
markdown-only
data-pipelines
code-challenges
dataengineering
beginner-project
data-engineering
beginner-friendly
hacktoberfest2022
hacktoberfest-2022
interview-practice
hacktoberfest-accepted
code-challenge-practice
Created
2022-10-08
41 commits to main branch, last one 2 years ago
Data Engineering/Scraping Project. Creating a detailed Sports Relational Database for the Top European Soccer Leagues.
Created
2021-03-25
106 commits to master branch, last one 3 years ago
A guide for leading a data (engineering) team
Created
2024-05-07
6 commits to main branch, last one 11 months ago
Build, test, deploy, iterate - Dev and prod tool for data science pipelines
Created
2019-03-17
165 commits to master branch, last one 5 years ago
Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market
Created
2022-06-30
45 commits to master branch, last one 2 years ago
Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.
Created
2022-05-02
55 commits to main branch, last one 2 years ago
Apply for a job at Olist's Data Team: https://olist.gupy.io/
Created
2018-09-21
52 commits to master branch, last one 3 years ago
Companion repository that goes along with Snowflake's "Introduction to Modern Data Engineering with Snowflake" course on Coursera
Created
2024-06-14
13 commits to main branch, last one about a month ago