Statistics for topic etl
RepositoryStats tracks 633,534 Github repositories, of these 285 are tagged with the etl topic. The most common primary language for repositories using this topic is Python (102). Other languages include: Go (39), Java (33), TypeScript (14), Rust (12), Scala (12), JavaScript (11)
Stargazers over time for topic etl
Most starred repositories for topic etl (view more)
Trending repositories for topic etl (view more)
ETL framework to index data for AI, such as RAG; with realtime incremental updates and support custom logic like lego.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
An open source, standard data file format for graph data storage and retrieval.
Ape Data Transfer Suite, written in Rust. Provides ultra-fast data replication between MySQL, PostgreSQL, Redis, MongoDB, Kafka and ClickHouse, ideal for disaster recovery (DR) and migration scenarios...
An open source, standard data file format for graph data storage and retrieval.
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
ETL framework to index data for AI, such as RAG; with realtime incremental updates and support custom logic like lego.
Ape Data Transfer Suite, written in Rust. Provides ultra-fast data replication between MySQL, PostgreSQL, Redis, MongoDB, Kafka and ClickHouse, ideal for disaster recovery (DR) and migration scenarios...
A curated list of awesome system integration software and resources.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
ETL framework to index data for AI, such as RAG; with realtime incremental updates and support custom logic like lego.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
An orchestration platform for the development, production, and observation of data assets.
ETL framework to index data for AI, such as RAG; with realtime incremental updates and support custom logic like lego.
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
An open source, standard data file format for graph data storage and retrieval.
Ape Data Transfer Suite, written in Rust. Provides ultra-fast data replication between MySQL, PostgreSQL, Redis, MongoDB, Kafka and ClickHouse, ideal for disaster recovery (DR) and migration scenarios...
A curated list of awesome system integration software and resources.
ETL framework to index data for AI, such as RAG; with realtime incremental updates and support custom logic like lego.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
ETL framework to index data for AI, such as RAG; with realtime incremental updates and support custom logic like lego.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
An orchestration platform for the development, production, and observation of data assets.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
An open source, standard data file format for graph data storage and retrieval.
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
ETL framework to index data for AI, such as RAG; with realtime incremental updates and support custom logic like lego.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
An orchestration platform for the development, production, and observation of data assets.
Visual Data Transformation and Data Preparation. Low-Code Python-based ETL.
A hub for various industry-specific schemas to be used with VLMs.
Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a single tool call.