Trending repositories for topic airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
User friendly and open source platform for workflow creation and monitoring
A CLI tool to streamline getting started with Apache Airflowโข and managing multiple Airflow projects
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
User friendly and open source platform for workflow creation and monitoring
A CLI tool to streamline getting started with Apache Airflowโข and managing multiple Airflow projects
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
A series of DAGs/Workflows to help maintain the operation of Airflow
Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)
Data Foundation - Google Cloud Cortex Framework
A CLI tool to streamline getting started with Apache Airflowโข and managing multiple Airflow projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
ๅบไบ Apache Airflow ็ๅพฎไฟกๆบ่ฝๅบ็จ็ผๆๆกๆถ๏ผ้่ฟๅฏ่งๅๅทฅไฝๆต้ฉฑๅจ AI ไธๆฐๆฎ่ชๅจๅไปปๅกใๆฏๆ ๆบ่ฝๅฎขๆ๏ผๅค่ฝฎๅฏน่ฏ/็ฅ่ฏๅบ๏ผใAI ๅพๆ/็ญ่ง้ข็ๆใๆบ่ฝๆ้็ญๅบ็จ๏ผ็ตๆดปๆฉๅฑๅคๆจกๆไบคไบไธๅคงๆจกๅ่ฝๅใ
์ฌ์ฉ์๊ฐ ์ฑํ ์น์ ํตํด ์์ ์ด ์ฒํ ๋ฒ๋ฅ ์ ์ํฉ์ ์ ์ํ๋ฉด, ์ ๋ ฅ์ ๋ํ ๋ฌธ๋งฅ์ ๋ชจ๋ธ์ด ์ดํดํ์ฌ ๊ฐ์ด๋๋ผ์ธ์ ์ ์ํ๊ณ , ์ ์ฌํ ์ํฉ์ ํ๋ก๋ฅผ ์ ๊ณตํ๋ ์น ์๋น์ค์ ๋๋ค. (2023.08.18 ์๋น์ค ์ข ๋ฃ)
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Projects done in the Data Engineer Nanodegree Program by Udacity.com
User friendly and open source platform for workflow creation and monitoring
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
ๅบไบ Apache Airflow ็ๅพฎไฟกๆบ่ฝๅบ็จ็ผๆๆกๆถ๏ผ้่ฟๅฏ่งๅๅทฅไฝๆต้ฉฑๅจ AI ไธๆฐๆฎ่ชๅจๅไปปๅกใๆฏๆ ๆบ่ฝๅฎขๆ๏ผๅค่ฝฎๅฏน่ฏ/็ฅ่ฏๅบ๏ผใAI ๅพๆ/็ญ่ง้ข็ๆใๆบ่ฝๆ้็ญๅบ็จ๏ผ็ตๆดปๆฉๅฑๅคๆจกๆไบคไบไธๅคงๆจกๅ่ฝๅใ
์ฌ์ฉ์๊ฐ ์ฑํ ์น์ ํตํด ์์ ์ด ์ฒํ ๋ฒ๋ฅ ์ ์ํฉ์ ์ ์ํ๋ฉด, ์ ๋ ฅ์ ๋ํ ๋ฌธ๋งฅ์ ๋ชจ๋ธ์ด ์ดํดํ์ฌ ๊ฐ์ด๋๋ผ์ธ์ ์ ์ํ๊ณ , ์ ์ฌํ ์ํฉ์ ํ๋ก๋ฅผ ์ ๊ณตํ๋ ์น ์๋น์ค์ ๋๋ค. (2023.08.18 ์๋น์ค ์ข ๋ฃ)
Data Foundation - Google Cloud Cortex Framework
Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
A CLI tool to streamline getting started with Apache Airflowโข and managing multiple Airflow projects
Projects done in the Data Engineer Nanodegree Program by Udacity.com
User friendly and open source platform for workflow creation and monitoring
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
A series of DAGs/Workflows to help maintain the operation of Airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
More than 2000+ Data engineer interview questions.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
ๅบไบ Apache Airflow ็ๅพฎไฟกๆบ่ฝๅบ็จ็ผๆๆกๆถ๏ผ้่ฟๅฏ่งๅๅทฅไฝๆต้ฉฑๅจ AI ไธๆฐๆฎ่ชๅจๅไปปๅกใๆฏๆ ๆบ่ฝๅฎขๆ๏ผๅค่ฝฎๅฏน่ฏ/็ฅ่ฏๅบ๏ผใAI ๅพๆ/็ญ่ง้ข็ๆใๆบ่ฝๆ้็ญๅบ็จ๏ผ็ตๆดปๆฉๅฑๅคๆจกๆไบคไบไธๅคงๆจกๅ่ฝๅใ
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Dynamically generate Apache Airflow DAGs from YAML configuration files
Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)
A series of DAGs/Workflows to help maintain the operation of Airflow
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
User friendly and open source platform for workflow creation and monitoring
A CLI tool to streamline getting started with Apache Airflowโข and managing multiple Airflow projects
ๅบไบ Apache Airflow ็ๅพฎไฟกๆบ่ฝๅบ็จ็ผๆๆกๆถ๏ผ้่ฟๅฏ่งๅๅทฅไฝๆต้ฉฑๅจ AI ไธๆฐๆฎ่ชๅจๅไปปๅกใๆฏๆ ๆบ่ฝๅฎขๆ๏ผๅค่ฝฎๅฏน่ฏ/็ฅ่ฏๅบ๏ผใAI ๅพๆ/็ญ่ง้ข็ๆใๆบ่ฝๆ้็ญๅบ็จ๏ผ็ตๆดปๆฉๅฑๅคๆจกๆไบคไบไธๅคงๆจกๅ่ฝๅใ
๐ A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transforma...
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club
Produce Kafka messages, consume them and upload into Cassandra, MongoDB.
Projects done in the Data Engineer Nanodegree Program by Udacity.com
This repository contains code snippets, steps and other artifacts used in the youtube videos in the demo. You can use this to get access to the code or artifacts.
User friendly and open source platform for workflow creation and monitoring
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Data Foundation - Google Cloud Cortex Framework
A CLI tool to streamline getting started with Apache Airflowโข and managing multiple Airflow projects
Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)
An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer
More than 2000+ Data engineer interview questions.
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
์ฌ์ฉ์๊ฐ ์ฑํ ์น์ ํตํด ์์ ์ด ์ฒํ ๋ฒ๋ฅ ์ ์ํฉ์ ์ ์ํ๋ฉด, ์ ๋ ฅ์ ๋ํ ๋ฌธ๋งฅ์ ๋ชจ๋ธ์ด ์ดํดํ์ฌ ๊ฐ์ด๋๋ผ์ธ์ ์ ์ํ๊ณ , ์ ์ฌํ ์ํฉ์ ํ๋ก๋ฅผ ์ ๊ณตํ๋ ์น ์๋น์ค์ ๋๋ค. (2023.08.18 ์๋น์ค ์ข ๋ฃ)
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club
ๅบไบ Apache Airflow ็ๅพฎไฟกๆบ่ฝๅบ็จ็ผๆๆกๆถ๏ผ้่ฟๅฏ่งๅๅทฅไฝๆต้ฉฑๅจ AI ไธๆฐๆฎ่ชๅจๅไปปๅกใๆฏๆ ๆบ่ฝๅฎขๆ๏ผๅค่ฝฎๅฏน่ฏ/็ฅ่ฏๅบ๏ผใAI ๅพๆ/็ญ่ง้ข็ๆใๆบ่ฝๆ้็ญๅบ็จ๏ผ็ตๆดปๆฉๅฑๅคๆจกๆไบคไบไธๅคงๆจกๅ่ฝๅใ
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
Integrating Airbyte, Kafka, Airflow and MLflow on Azure Linux VMs within private network to continuously retrain LSTM Attention model with 1-minute stock prices and redeploy it on Azure ML AKS real-ti...
๐ A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transforma...
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
More than 2000+ Data engineer interview questions.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
Dynamically generate Apache Airflow DAGs from YAML configuration files
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
A series of DAGs/Workflows to help maintain the operation of Airflow
๐ ๐ง๐ต๐ฒ ๐๐๐น๐น ๐ฆ๐๐ฎ๐ฐ๐ธ ๐ณ-๐ฆ๐๐ฒ๐ฝ๐ ๐ ๐๐ข๐ฝ๐ ๐๐ฟ๐ฎ๐บ๐ฒ๐๐ผ๐ฟ๐ธ | ๐๐ฒ๐ฎ๐ฟ๐ป ๐ ๐๐ & ๐ ๐๐ข๐ฝ๐ for free by designing, building and deploying an end-to-end ML batch system ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค...
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset
A Python package that creates fine-grained dbt tasks on Apache Airflow
Full-stack Highly Scalable Cloud-native Machine Learning system for demand forecasting with realtime data streaming, inference, retraining loop, and more
Dockerized monitoring stack for Apache Airflow
This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source tools
End-to-end data platform leveraging the Modern data stack
User friendly and open source platform for workflow creation and monitoring
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
HashiQube - The Ultimate Hands on DevOps Lab running All the HashiCorp Products in a Github Codespace or a Docker Container using Vagrant or Docker Compose
Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transformed data into Glue database and created real-time dashboards usi...
๐ A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transforma...