Statistics for topic pipeline
RepositoryStats tracks 595,858 Github repositories, of these 442 are tagged with the pipeline topic. The most common primary language for repositories using this topic is Python (145). Other languages include: Go (44), Nextflow (37), Java (23), Jupyter Notebook (22), TypeScript (19), R (15), Rust (14), C++ (13), Shell (13)
Stargazers over time for topic pipeline
Most starred repositories for topic pipeline (view more)
Trending repositories for topic pipeline (view more)
Turns Data and AI algorithms into production-ready web applications in no time.
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative AI workflows using Google Cloud Vertex AI.
Turns Data and AI algorithms into production-ready web applications in no time.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Turns Data and AI algorithms into production-ready web applications in no time.
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
CDK Express Pipelines is a library built on the AWS CDK, allowing you to define pipelines in a CDK-native method. It leverages the CDK CLI to compute and deploy the correct dependency graph between Wa...
Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative AI workflows using Google Cloud Vertex AI.
This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)
Turns Data and AI algorithms into production-ready web applications in no time.
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Kubernetes-native platform to run massively parallel data/streaming jobs
Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative AI workflows using Google Cloud Vertex AI.
Kubernetes-native platform to run massively parallel data/streaming jobs
CDK Express Pipelines is a library built on the AWS CDK, allowing you to define pipelines in a CDK-native method. It leverages the CDK CLI to compute and deploy the correct dependency graph between Wa...
Tiny automation pipelines. Bring CI/CD to the smallest projects. Self-hosted, Lightweight, CLI only.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Turns Data and AI algorithms into production-ready web applications in no time.
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.