9 results found Sort:

Personal Data Engineering Projects
Created 2020-04-20
65 commits to master branch, last one 2 years ago
93
274
apache-2.0
13
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Created 2019-03-21
280 commits to master branch, last one 11 months ago
Redshift Python Connector. It supports Python Database API Specification v2.0.
Created 2020-07-29
338 commits to master branch, last one about a month ago
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
Created 2021-04-15
247 commits to main branch, last one 2 years ago
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
Created 2019-09-30
27 commits to master branch, last one 2 years ago
Build clickstream analytics on AWS for your mobile and web applications
Created 2023-05-26
1,627 commits to main branch, last one 29 days ago
Udacity Data Engineering Nanodegree Program
Created 2021-01-19
47 commits to main branch, last one 3 years ago
:arrows_counterclockwise: :running: EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow
Created 2022-06-17
69 commits to master branch, last one 2 years ago
Project was based on an interest in Data Engineering, ETL pipeline. It also provided a good opportunity to develop skills and experience in a range of tools. As such, project is more complex than req...
Created 2023-09-12
2 commits to main branch, last one about a year ago