Statistics for topic data-pipelines

RepositoryStats tracks 635,088 Github repositories, of these 60 are tagged with the data-pipelines topic. The most common primary language for repositories using this topic is Python (25).

Stargazers over time for topic data-pipelines

60605050404030302020101000202020202021202120222022202320232024202420252025

Most starred repositories for topic data-pipelines (view more)

14.8k
39.5k
apache-2.0
764
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Created 2015-04-13
29,332 commits to main branch, last one 7 hours ago
348
23.7k
other
47
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Created 2022-11-27
1,227 commits to main branch, last one 9 hours ago
4.7k
13.4k
apache-2.0
325
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Created 2019-03-01
8,628 commits to dev branch, last one a day ago
1.6k
12.9k
apache-2.0
123
An orchestration platform for the development, production, and observation of data assets.
Created 2018-04-30
23,206 commits to master branch, last one 7 hours ago
894
10.8k
apache-2.0
68
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Created 2022-09-26
1,712 commits to main branch, last one 23 hours ago
828
8.2k
apache-2.0
62
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Created 2022-05-16
5,573 commits to master branch, last one 2 days ago

Trending repositories for topic data-pipelines (view more)