Statistics for topic data-engineering

RepositoryStats tracks 650,729 Github repositories, of these 347 are tagged with the data-engineering topic. The most common primary language for repositories using this topic is Python (151). Other languages include: Jupyter Notebook (41),  Go (18),  TypeScript (15),  JavaScript (13),  Scala (13),  Rust (12)

Stargazers over time for topic data-engineering

Most starred repositories for topic data-engineering (view more)

15.0k
66.2k
apache-2.0
1.5k
Apache Superset is a Data Visualization and Data Exploration Platform
Created 2015-07-21
16,992 commits to master branch, last one 10 hours ago
15.0k
40.0k
apache-2.0
769
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Created 2015-04-13
30,384 commits to main branch, last one 17 hours ago
6.1k
38.6k
mit
1.2k
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Created 2018-11-05
18 commits to main branch, last one about a year ago
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
Created 2021-10-21
1,036 commits to main branch, last one 26 days ago
3.8k
28.0k
mit
954
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Created 2020-07-04
485 commits to main branch, last one about a year ago
1.8k
19.3k
apache-2.0
163
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Created 2018-06-29
19,120 commits to main branch, last one 17 hours ago

Trending repositories for topic data-engineering (view more)