Statistics for topic data-pipelines
RepositoryStats tracks 596,208 Github repositories, of these 54 are tagged with the data-pipelines topic. The most common primary language for repositories using this topic is Python (22).
Stargazers over time for topic data-pipelines
Most starred repositories for topic data-pipelines (view more)
Trending repositories for topic data-pipelines (view more)
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Visual Data Transformation with Python Code Generation. Low-Code Python-based ETL.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
An orchestration platform for the development, production, and observation of data assets.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.