Statistics for topic data-engineering

RepositoryStats tracks 633,144 Github repositories, of these 333 are tagged with the data-engineering topic. The most common primary language for repositories using this topic is Python (144). Other languages include: Jupyter Notebook (39),  Go (18),  TypeScript (15),  JavaScript (12),  Scala (12),  Rust (11)

Stargazers over time for topic data-engineering

500500450450400400350350300300250250200200150150100100505000202020202021202120222022202320232024202420252025

Most starred repositories for topic data-engineering (view more)

14.7k
65.2k
apache-2.0
1.5k
Apache Superset is a Data Visualization and Data Exploration Platform
Created 2015-07-21
16,826 commits to master branch, last one a day ago
14.8k
39.4k
apache-2.0
764
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Created 2015-04-13
29,163 commits to main branch, last one 20 hours ago
6.1k
38.4k
mit
1.2k
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Created 2018-11-05
18 commits to main branch, last one about a year ago
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
Created 2021-10-21
1,033 commits to main branch, last one a day ago
3.7k
27.9k
mit
951
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Created 2020-07-04
485 commits to main branch, last one 11 months ago
1.8k
18.8k
apache-2.0
162
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Created 2018-06-29
18,898 commits to main branch, last one a day ago

Trending repositories for topic data-engineering (view more)