Statistics for topic data-integration
RepositoryStats tracks 569,918 Github repositories, of these 56 are tagged with the data-integration topic. The most common primary language for repositories using this topic is Python (19).
Stargazers over time for topic data-integration
Most starred repositories for topic data-integration (view more)
Trending repositories for topic data-integration (view more)
Turns Data and AI algorithms into production-ready web applications in no time.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
An orchestration platform for the development, production, and observation of data assets.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Turns Data and AI algorithms into production-ready web applications in no time.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
NicheNet: predict active ligand-target links between interacting cells
Turns Data and AI algorithms into production-ready web applications in no time.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
An orchestration platform for the development, production, and observation of data assets.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Turns Data and AI algorithms into production-ready web applications in no time.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
A curated list of open source tools used in analytical stacks and data engineering ecosystem
The Common Core Ontology Repository holds the current released version of the Common Core Ontology suite.
Turns Data and AI algorithms into production-ready web applications in no time.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
An orchestration platform for the development, production, and observation of data assets.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Turns Data and AI algorithms into production-ready web applications in no time.
Perform historical snapshots without database locks and read change data capture logs from databases. Artie Reader is compatible with Debezium and is written in Go.
The Common Core Ontology Repository holds the current released version of the Common Core Ontology suite.
A high-performance, extremely flexible, and easily extensible universal workflow engine.
A curated list of open source tools used in analytical stacks and data engineering ecosystem
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
A curated list of open source tools used in analytical stacks and data engineering ecosystem
A high-performance, extremely flexible, and easily extensible universal workflow engine.
Turns Data and AI algorithms into production-ready web applications in no time.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
An orchestration platform for the development, production, and observation of data assets.
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Self-contained distributed software platform for building stateful, massively real-time streaming applications in Rust.
Turns Data and AI algorithms into production-ready web applications in no time.
A tool for semi-automatic cell type harmonization and integration
A curated list of open source tools used in analytical stacks and data engineering ecosystem
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.