40 results found Sort:

1.6k
10.0k
apache-2.0
141
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Created 2016-03-10
6,927 commits to main branch, last one a day ago
51
2.2k
apache-2.0
12
Open-source BI for engineers
Created 2024-02-20
411 commits to main branch, last one 8 days ago
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Created 2020-01-20
80 commits to master branch, last one 4 years ago
The best place to learn data engineering. Built and maintained by the data engineering community.
Created 2021-05-04
272 commits to main branch, last one 2 days ago
95
1.1k
other
20
MetricFlow allows you to define, build, and maintain metrics in code.
Created 2022-04-04
2,598 commits to main branch, last one 6 days ago
Personal Data Engineering Projects
Created 2020-04-20
65 commits to master branch, last one 2 years ago
74
853
apache-2.0
22
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
This repository has been archived (exclude archived)
Created 2021-01-24
860 commits to main branch, last one 7 months ago
52
822
mit
6
Data modeling and relation library for testing JavaScript applications.
Created 2020-12-08
297 commits to main branch, last one 2 months ago
156
785
gpl-3.0
55
Structr is an integrated low-code development and runtime environment that uses a graph database.
Created 2011-02-01
15,785 commits to main branch, last one 3 days ago
77
611
bsd-3-clause
20
LLM-based ontological extraction tools, including SPIRES
Created 2023-01-03
1,765 commits to main branch, last one 8 days ago
845
589
mit
22
Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-aug...
Created 2017-01-05
11,747 commits to main branch, last one 13 days ago
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Created 2019-04-26
332 commits to master branch, last one 5 years ago
37
522
apache-2.0
11
Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.
Created 2020-02-06
1,007 commits to _dev branch, last one 3 months ago
:zap: A collection of resources and tutorials to design a better database schema.
Created 2020-07-05
50 commits to master branch, last one 4 months ago
Sample databases for postgres
Created 2016-01-24
27 commits to master branch, last one about a year ago
16
430
gpl-3.0
10
Create diagrams and plan your code with TypeScript.
Created 2023-12-03
127 commits to master branch, last one 2 months ago
Typed struct and value objects
Created 2016-06-30
748 commits to main branch, last one 4 months ago
Repository for the ActivitySchema spec and supporting materials
Created 2021-03-05
22 commits to main branch, last one about a year ago
26
354
apache-2.0
2
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Created 2021-06-17
1,183 commits to main branch, last one 2 years ago
101
323
other
15
Linked Open Data Modeling Language
Created 2021-03-16
2,607 commits to main branch, last one a day ago
15
121
apache-2.0
7
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Created 2020-04-22
195 commits to main branch, last one 8 months ago
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
Created 2021-04-15
247 commits to main branch, last one 2 years ago
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Created 2020-04-01
60 commits to master branch, last one 4 years ago
114
89
apache-2.0
13
Legend Studio
Created 2020-08-12
2,806 commits to master branch, last one 11 hours ago
4
82
apache-2.0
6
Define, govern, and model event data for warehouse-first product analytics.
Created 2022-04-13
267 commits to main branch, last one 8 months ago
GraphQL Blueprint: a software developer tool for engineers that want to quickly generate React/Express, Apollo and GraphQL boilerplate code using a data modeling interface. Watch your queries, mutatio...
Created 2021-05-10
128 commits to main branch, last one 3 years ago
6
72
apache-2.0
4
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Created 2023-08-03
1,134 commits to main branch, last one a day ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created 2020-05-26
30 commits to main branch, last one about a year ago
:book: R 语言数据分析实战(写作中) Data Analysis in Action Using R
Created 2021-10-30
702 commits to main branch, last one 4 months ago
Type annotations for specifying, validating, and serializing arrays with arbitrary backends in Pydantic (and beyond)
Created 2024-02-02
218 commits to main branch, last one about a month ago