45 results found Sort:

1.7k
10.6k
apache-2.0
143
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Created 2016-03-10
7,014 commits to main branch, last one 2 days ago
55
2.3k
apache-2.0
13
Open-source BI for engineers
Created 2024-02-20
415 commits to main branch, last one 4 months ago
The best place to learn data engineering. Built and maintained by the data engineering community.
Created 2021-05-04
289 commits to main branch, last one 9 days ago
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Created 2020-01-20
80 commits to master branch, last one 5 years ago
100
1.2k
other
20
MetricFlow allows you to define, build, and maintain metrics in code.
Created 2022-04-04
2,702 commits to main branch, last one a day ago
Personal Data Engineering Projects
Created 2020-04-20
65 commits to master branch, last one 2 years ago
33
915
apache-2.0
8
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Created 2023-08-03
2,592 commits to main branch, last one 3 days ago
55
866
mit
6
Data modeling and relation library for testing JavaScript applications.
Created 2020-12-08
297 commits to main branch, last one 6 months ago
76
853
apache-2.0
21
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
This repository has been archived (exclude archived)
Created 2021-01-24
860 commits to main branch, last one 11 months ago
159
794
gpl-3.0
52
Structr is an integrated low-code development and runtime environment that uses a graph database.
Created 2011-02-01
15,912 commits to main branch, last one 5 days ago
92
664
bsd-3-clause
18
LLM-based ontological extraction tools, including SPIRES
Created 2023-01-03
1,945 commits to main branch, last one 2 days ago
882
624
mit
21
Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-aug...
Created 2017-01-05
11,894 commits to main branch, last one 10 days ago
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Created 2019-04-26
332 commits to master branch, last one 5 years ago
:zap: A collection of resources and tutorials to design a better database schema.
Created 2020-07-05
50 commits to master branch, last one 8 months ago
39
539
apache-2.0
9
Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.
Created 2020-02-06
1,013 commits to _dev branch, last one 2 months ago
Sample databases for postgres
Created 2016-01-24
27 commits to master branch, last one about a year ago
20
449
gpl-3.0
10
Create diagrams and plan your code with TypeScript.
Created 2023-12-03
130 commits to master branch, last one 5 days ago
Typed struct and value objects
Created 2016-06-30
798 commits to main branch, last one 22 days ago
Repository for the ActivitySchema spec and supporting materials
Created 2021-03-05
22 commits to main branch, last one 2 years ago
113
359
other
20
Linked Open Data Modeling Language
Created 2021-03-16
2,761 commits to main branch, last one 5 days ago
26
355
apache-2.0
2
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Created 2021-06-17
1,183 commits to main branch, last one 2 years ago
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Created 2020-04-01
60 commits to master branch, last one 4 years ago
16
123
apache-2.0
5
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Created 2020-04-22
195 commits to main branch, last one about a year ago
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
Created 2021-04-15
247 commits to main branch, last one 2 years ago
Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, e...
Created 2025-02-01
280 commits to main branch, last one 12 hours ago
Type annotations for specifying, validating, and serializing arrays with arbitrary backends in Pydantic (and beyond)
Created 2024-02-02
244 commits to main branch, last one 20 days ago
117
93
apache-2.0
13
Legend Studio
Created 2020-08-12
3,070 commits to master branch, last one 7 hours ago
4
83
apache-2.0
5
Define, govern, and model event data for warehouse-first product analytics.
Created 2022-04-13
267 commits to main branch, last one about a year ago
GraphQL Blueprint: a software developer tool for engineers that want to quickly generate React/Express, Apollo and GraphQL boilerplate code using a data modeling interface. Watch your queries, mutatio...
Created 2021-05-10
128 commits to main branch, last one 3 years ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created 2020-05-26
30 commits to main branch, last one 2 years ago