42 results found Sort:

1.6k
10.1k
apache-2.0
141
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Created 2016-03-10
6,961 commits to main branch, last one 21 hours ago
51
2.2k
apache-2.0
13
Open-source BI for engineers
Created 2024-02-20
415 commits to main branch, last one 22 days ago
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Created 2020-01-20
80 commits to master branch, last one 4 years ago
The best place to learn data engineering. Built and maintained by the data engineering community.
Created 2021-05-04
272 commits to main branch, last one about a month ago
96
1.2k
other
20
MetricFlow allows you to define, build, and maintain metrics in code.
Created 2022-04-04
2,629 commits to main branch, last one 15 hours ago
Personal Data Engineering Projects
Created 2020-04-20
65 commits to master branch, last one 2 years ago
75
854
apache-2.0
22
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
This repository has been archived (exclude archived)
Created 2021-01-24
860 commits to main branch, last one 8 months ago
52
840
mit
6
Data modeling and relation library for testing JavaScript applications.
Created 2020-12-08
297 commits to main branch, last one 3 months ago
156
786
gpl-3.0
54
Structr is an integrated low-code development and runtime environment that uses a graph database.
Created 2011-02-01
15,828 commits to main branch, last one a day ago
83
628
bsd-3-clause
19
LLM-based ontological extraction tools, including SPIRES
Created 2023-01-03
1,905 commits to main branch, last one 23 hours ago
19
616
apache-2.0
6
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Created 2023-08-03
1,687 commits to main branch, last one 18 hours ago
847
594
mit
22
Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-aug...
Created 2017-01-05
11,749 commits to main branch, last one 16 days ago
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Created 2019-04-26
332 commits to master branch, last one 5 years ago
37
526
apache-2.0
11
Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.
Created 2020-02-06
1,009 commits to _dev branch, last one 3 days ago
:zap: A collection of resources and tutorials to design a better database schema.
Created 2020-07-05
50 commits to master branch, last one 5 months ago
Sample databases for postgres
Created 2016-01-24
27 commits to master branch, last one about a year ago
16
434
gpl-3.0
10
Create diagrams and plan your code with TypeScript.
Created 2023-12-03
127 commits to master branch, last one 3 months ago
Typed struct and value objects
Created 2016-06-30
748 commits to main branch, last one 5 months ago
Repository for the ActivitySchema spec and supporting materials
Created 2021-03-05
22 commits to main branch, last one 2 years ago
26
355
apache-2.0
2
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Created 2021-06-17
1,183 commits to main branch, last one 2 years ago
104
332
other
15
Linked Open Data Modeling Language
Created 2021-03-16
2,654 commits to main branch, last one 17 hours ago
15
121
apache-2.0
7
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Created 2020-04-22
195 commits to main branch, last one 9 months ago
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
Created 2021-04-15
247 commits to main branch, last one 2 years ago
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Created 2020-04-01
60 commits to master branch, last one 4 years ago
113
91
apache-2.0
13
Legend Studio
Created 2020-08-12
2,886 commits to master branch, last one 18 hours ago
4
82
apache-2.0
6
Define, govern, and model event data for warehouse-first product analytics.
Created 2022-04-13
267 commits to main branch, last one 9 months ago
Type annotations for specifying, validating, and serializing arrays with arbitrary backends in Pydantic (and beyond)
Created 2024-02-02
234 commits to main branch, last one 7 days ago
GraphQL Blueprint: a software developer tool for engineers that want to quickly generate React/Express, Apollo and GraphQL boilerplate code using a data modeling interface. Watch your queries, mutatio...
Created 2021-05-10
128 commits to main branch, last one 3 years ago
:book: R 语言数据分析实战(写作中) Data Analysis in Action Using R
Created 2021-10-30
702 commits to main branch, last one 5 months ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created 2020-05-26
30 commits to main branch, last one about a year ago