Statistics for topic data-pipelines

RepositoryStats tracks 596,208 Github repositories, of these 54 are tagged with the data-pipelines topic. The most common primary language for repositories using this topic is Python (22).

Stargazers over time for topic data-pipelines

Most starred repositories for topic data-pipelines (view more)

14.4k
38.0k
apache-2.0
764
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Created 2015-04-13
27,387 commits to main branch, last one 3 hours ago
2.5k
25.9k
apache-2.0
143
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Created 2023-12-12
1,951 commits to main branch, last one 2 days ago
4.7k
13.2k
apache-2.0
328
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Created 2019-03-01
8,579 commits to dev branch, last one a day ago
1.5k
12.1k
apache-2.0
124
An orchestration platform for the development, production, and observation of data assets.
Created 2018-04-30
21,587 commits to master branch, last one 10 hours ago
802
9.5k
apache-2.0
62
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Created 2022-09-26
1,656 commits to main branch, last one 2 days ago
778
8.0k
apache-2.0
62
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Created 2022-05-16
5,511 commits to master branch, last one 8 hours ago

Trending repositories for topic data-pipelines (view more)