6 results found Sort:

Implementing best practices for PySpark ETL jobs and applications.
Created 2017-12-28
36 commits to master branch, last one 3 years ago
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Created 2020-02-13
50 commits to master branch, last one 5 years ago
102
726
mit
20
Mass processing data with a complete ETL for .net developers
Created 2018-06-23
925 commits to master branch, last one 10 days ago
Provides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
Created 2014-03-02
245 commits to master branch, last one 9 months ago
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Created 2024-12-30
65 commits to main branch, last one about a month ago
13
45
bsd-3-clause
7
This repository has no description...
Created 2020-07-03
86 commits to master branch, last one 4 months ago