7 results found Sort:
- Filter by Primary Language:
- Python (5)
- Jupyter Notebook (2)
- +
🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤...
Created
2023-02-04
391 commits to main branch, last one 7 months ago
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Prod...
Created
2022-11-11
17 commits to master branch, last one 23 days ago
Sample project to demonstrate data engineering best practices
Created
2023-08-04
16 commits to main branch, last one 9 months ago
Nyc_Taxi_Data_Pipeline - DE Project
Created
2024-06-21
33 commits to main branch, last one about a month ago
Learn how to create reliable ML systems by testing code, data and models.
Created
2022-08-01
5 commits to main branch, last one 2 years ago
Data Quality Gate based on AWS
Created
2022-05-31
343 commits to main branch, last one 4 months ago
Tutorial for implementing data validation in data science pipelines
Created
2022-05-14
39 commits to main branch, last one 2 years ago