Trending repositories for topic airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
Construct Apache Airflow DAGs Declaratively via YAML configuration files
More than 2000+ Data engineer interview questions.
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
A series of DAGs/Workflows to help maintain the operation of Airflow
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
基于 Apache Airflow 的微信智能应用编排框架,通过可视化工作流驱动 AI 与数据自动化任务。支持 智能客服(多轮对话/知识库)、AI 图文/短视频生成、智能提醒等应用,灵活扩展多模态交互与大模型能力。
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
基于 Apache Airflow 的微信智能应用编排框架,通过可视化工作流驱动 AI 与数据自动化任务。支持 智能客服(多轮对话/知识库)、AI 图文/短视频生成、智能提醒等应用,灵活扩展多模态交互与大模型能力。
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Construct Apache Airflow DAGs Declaratively via YAML configuration files
More than 2000+ Data engineer interview questions.
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
A series of DAGs/Workflows to help maintain the operation of Airflow
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
More than 2000+ Data engineer interview questions.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Construct Apache Airflow DAGs Declaratively via YAML configuration files
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
基于 Apache Airflow 的微信智能应用编排框架,通过可视化工作流驱动 AI 与数据自动化任务。支持 智能客服(多轮对话/知识库)、AI 图文/短视频生成、智能提醒等应用,灵活扩展多模态交互与大模型能力。
🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤...
Data Foundation - Google Cloud Cortex Framework
📈 A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transforma...
基于 Apache Airflow 的微信智能应用编排框架,通过可视化工作流驱动 AI 与数据自动化任务。支持 智能客服(多轮对话/知识库)、AI 图文/短视频生成、智能提醒等应用,灵活扩展多模态交互与大模型能力。
Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.
A data and analytics engineering platform designed for real-time sports betting analytics.
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
Data Foundation - Google Cloud Cortex Framework
A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset
🐳 Проектная деятельность. Здесь хранятся лекции, практические задания и проекты с karpov_courses. Ссылка: https://karpov.courses/
This repository contains code snippets, steps and other artifacts used in the youtube videos in the demo. You can use this to get access to the code or artifacts.
Airflow plugins for implementing data pipelines. | Plugins do Airflow para implementação de pipelines de dados.
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
More than 2000+ Data engineer interview questions.
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
基于 Apache Airflow 的微信智能应用编排框架,通过可视化工作流驱动 AI 与数据自动化任务。支持 智能客服(多轮对话/知识库)、AI 图文/短视频生成、智能提醒等应用,灵活扩展多模态交互与大模型能力。
Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
📈 A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transforma...
A data and analytics engineering platform designed for real-time sports betting analytics.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
More than 2000+ Data engineer interview questions.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Construct Apache Airflow DAGs Declaratively via YAML configuration files
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
A series of DAGs/Workflows to help maintain the operation of Airflow
🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤...
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
This repository contains code snippets, steps and other artifacts used in the youtube videos in the demo. You can use this to get access to the code or artifacts.
Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.
A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset
Dockerized monitoring stack for Apache Airflow
Projects done in the Data Engineer Nanodegree Program by Udacity.com
End-to-end data platform leveraging the Modern data stack
📈 A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transforma...
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source tools
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
A Python package that creates fine-grained dbt tasks on Apache Airflow
Full-stack Highly Scalable Cloud-native Machine Learning system for demand forecasting with realtime data streaming, inference, retraining loop, and more
User friendly and open source platform for workflow creation and monitoring
HashiQube - The Ultimate Hands on DevOps Lab running All the HashiCorp Products in a Github Codespace or a Docker Container using Vagrant or Docker Compose