Statistics for topic data-engineering

RepositoryStats tracks 633,144 Github repositories, of these 333 are tagged with the data-engineering topic. The most common primary language for repositories using this topic is Python (144). Other languages include: Jupyter Notebook (39), Go (18), TypeScript (15), JavaScript (12), Scala (12), Rust (11)

Stargazers over time for topic data-engineering

Most starred repositories for topic data-engineering (view more)

superset apache

14.7k

65.2k

apache-2.0

1.5k

Apache Superset is a Data Visualization and Data Exploration Platform

bi asf flask react apache python data-viz superset analytics sql-editor data-science data-analysis data-analytics apache-superset data-engineering business-analytics data-visualization business-intelligence

Created 2015-07-21

16,826 commits to master branch, last one a day ago

airflow apache

14.8k

39.4k

apache-2.0

764

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Created 2015-04-13

29,163 commits to main branch, last one 20 hours ago

Made-With-ML GokuMohandas

6.1k

38.4k

mit

1.2k

Learn how to design, develop, deploy and iterate on production-grade ML applications.

ray llms mlops python pytorch data-quality data-science deep-learning distributed-ml data-engineering machine-learning distributed-training natural-language-processing

Created 2018-11-05

18 commits to main branch, last one about a year ago

data-engineering-zoomcamp DataTalksClub

6.3k

29.7k

unknown

493

Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.

dbt kafka spark docker kestra data-engineering

Created 2021-10-21

1,033 commits to main branch, last one a day ago

applied-ml eugeneyan

3.7k

27.9k

mit

951

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

recsys search production data-quality data-science deep-learning data-discovery computer-vision data-engineering machine-learning applied-data-science reinforcement-learning applied-machine-learning natural-language-processing

Created 2020-07-04

485 commits to main branch, last one 11 months ago

prefect PrefectHQ

1.8k

18.8k

apache-2.0

162

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

data ml-ops python prefect data-ops pipeline workflow automation data-science observability orchestration infrastructure workflow-engine data-engineering

Created 2018-06-29

18,898 commits to main branch, last one a day ago

Statistics for topic data-engineering

Stargazers over time for topic data-engineering

Most starred repositories for topic data-engineering (view more)

Trending repositories for topic data-engineering (view more)