Statistics for topic etl
RepositoryStats tracks 579,129 Github repositories, of these 255 are tagged with the etl topic. The most common primary language for repositories using this topic is Python (88). Other languages include: Go (38), Java (29), TypeScript (12), Rust (11), Scala (11)
Stargazers over time for topic etl
Most starred repositories for topic etl (view more)
Trending repositories for topic etl (view more)
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
An orchestration platform for the development, production, and observation of data assets.
A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal vector embeddings.
Ylem is an open-source platform for real-time data streaming orchestration
A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal vector embeddings.
The open-source, model-agnostic alternative to OpenAI's structured outputs for your own documents or the web.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
An orchestration platform for the development, production, and observation of data assets.
A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal vector embeddings.
Ylem is an open-source platform for real-time data streaming orchestration
A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal vector embeddings.
The open-source, model-agnostic alternative to OpenAI's structured outputs for your own documents or the web.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
An orchestration platform for the development, production, and observation of data assets.
Ylem is an open-source platform for real-time data streaming orchestration
The open-source, model-agnostic alternative to OpenAI's structured outputs for your own documents or the web.
Visual Data Transformation with Python Code Generation. Low-Code Python-based ETL.
A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal vector embeddings.
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
An orchestration platform for the development, production, and observation of data assets.
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
The open-source, model-agnostic alternative to OpenAI's structured outputs for your own documents or the web.