36 results found Sort:

1.5k
9.1k
apache-2.0
137
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Created 2016-03-10
6,728 commits to main branch, last one 12 hours ago
34
2.0k
apache-2.0
12
Open-source BI for engineers
Created 2024-02-20
241 commits to main branch, last one 4 days ago
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Created 2020-01-20
80 commits to master branch, last one 4 years ago
The best place to learn data engineering. Built and maintained by the data engineering community.
Created 2021-05-04
257 commits to main branch, last one 8 days ago
88
1.1k
other
19
MetricFlow allows you to define, build, and maintain metrics in code.
Created 2022-04-04
2,383 commits to main branch, last one a day ago
72
852
apache-2.0
22
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
This repository has been archived (exclude archived)
Created 2021-01-24
860 commits to main branch, last one about a month ago
156
769
gpl-3.0
56
Structr is an integrated low-code development and runtime environment that uses a graph database.
Created 2011-02-01
15,451 commits to main branch, last one 2 days ago
50
756
mit
6
Data modeling and relation library for testing JavaScript applications.
Created 2020-12-08
295 commits to main branch, last one 6 months ago
Personal Data Engineering Projects
Created 2020-04-20
65 commits to master branch, last one about a year ago
842
575
mit
23
Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-aug...
Created 2017-01-05
11,161 commits to main branch, last one 9 days ago
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Created 2019-04-26
332 commits to master branch, last one 4 years ago
66
531
bsd-3-clause
18
LLM-based ontological extraction tools, including SPIRES
Created 2023-01-03
1,364 commits to main branch, last one a day ago
36
488
apache-2.0
11
Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.
Created 2020-02-06
994 commits to _dev branch, last one 12 days ago
Sample databases for postgres
Created 2016-01-24
27 commits to master branch, last one 8 months ago
Typed struct and value objects
Created 2016-06-30
742 commits to main branch, last one 4 months ago
12
387
gpl-3.0
10
Create diagrams and plan your code with TypeScript.
Created 2023-12-03
122 commits to master branch, last one about a month ago
Repository for the ActivitySchema spec and supporting materials
Created 2021-03-05
22 commits to main branch, last one about a year ago
:zap: A collection of resources and tutorials to design a better database schema.
Created 2020-07-05
49 commits to master branch, last one about a month ago
26
352
apache-2.0
2
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Created 2021-06-17
1,183 commits to main branch, last one about a year ago
89
291
other
14
Linked Open Data Modeling Language
Created 2021-03-16
2,387 commits to main branch, last one 2 days ago
14
117
apache-2.0
7
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Created 2020-04-22
195 commits to main branch, last one 2 months ago
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
Created 2021-04-15
247 commits to main branch, last one about a year ago
111
80
apache-2.0
11
Legend Studio
Created 2020-08-12
2,392 commits to master branch, last one 16 hours ago
3
76
apache-2.0
6
Define, govern, and model event data for warehouse-first product analytics.
Created 2022-04-13
267 commits to main branch, last one 2 months ago
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Created 2020-04-01
60 commits to master branch, last one 4 years ago
GraphQL Blueprint: a software developer tool for engineers that want to quickly generate React/Express, Apollo and GraphQL boilerplate code using a data modeling interface. Watch your queries, mutatio...
Created 2021-05-10
128 commits to main branch, last one 2 years ago
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Created 2020-05-26
30 commits to main branch, last one about a year ago
:book: R 语言数据分析实战(写作中) Data Analysis in Action Using R
Created 2021-10-30
698 commits to main branch, last one 2 months ago
1
52
apache-2.0
5
Bruin is a data pipeline tool that is designed to be easy-to-use. It allows building data pipelines using SQL and Python, and has built-in data quality checks.
Created 2023-08-03
425 commits to main branch, last one 22 hours ago
Malloy Composer is a simple application to build dashboards or run ad-hoc queries using an existing Malloy model
Created 2022-11-01
461 commits to main branch, last one 2 months ago