63 results found Sort:

2.5k
12.9k
mit
846
A curated list of awesome big data frameworks, ressources and other awesomeness.
Created 2014-07-04
577 commits to master branch, last one about a year ago
1.1k
8.7k
apache-2.0
68
Open-Source Web UI for Apache Kafka Management
Created 2019-11-26
1,918 commits to master branch, last one about a month ago
202
8.6k
other
68
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created 2015-05-03
8,836 commits to main branch, last one 19 hours ago
763
7.8k
unknown
109
Fancy stream processing made operationally mundane
Created 2016-03-22
4,977 commits to main branch, last one a day ago
The data warehouse for operational workloads.
Created 2019-02-22
35,184 commits to main branch, last one 17 hours ago
525
4.8k
bsd-3-clause
85
🌊 Online machine learning in Python
Created 2019-01-24
3,875 commits to main branch, last one 12 days ago
110
3.9k
other
21
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the ...
Created 2022-05-24
9,836 commits to main branch, last one 7 days ago
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Created 2015-01-02
1,078 commits to develop branch, last one 23 days ago
197
2.7k
apache-2.0
35
Lean and mean distributed stream processing system written in rust and web assembly.
Created 2019-08-31
2,254 commits to master branch, last one 14 hours ago
98
2.1k
other
20
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Created 2020-09-21
3,871 commits to master branch, last one 3 days ago
404
2.0k
apache-2.0
107
Pravega - Streaming as a new software defined storage primitive
Created 2016-07-11
3,294 commits to master branch, last one 2 months ago
146
1.8k
mit
26
A lightweight stream processing library for Go
Created 2019-04-30
180 commits to master branch, last one 18 days ago
56
1.3k
apache-2.0
14
Python Stream Processing
Created 2022-02-04
2,345 commits to main branch, last one a day ago
132
1.2k
mit
63
Trill is a single-node query processor for temporal or streaming data.
Created 2018-09-26
232 commits to master branch, last one 4 months ago
144
1.2k
bsd-3-clause
35
Real-time stream processing for python
Created 2017-04-04
805 commits to master branch, last one about a year ago
39
963
other
22
📐 Pushing the boundaries of simplicity
Created 2017-07-10
1,406 commits to master branch, last one 9 months ago
⚡ Single-pass algorithms for statistics
Created 2015-02-04
2,525 commits to master branch, last one 10 days ago
A machine learning package for streaming data in Python. The other ancestor of River.
Created 2017-11-14
1,088 commits to master branch, last one 3 years ago
56
693
bsd-3-clause
23
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Created 2020-08-31
1,718 commits to main branch, last one 23 hours ago
160
566
unknown
31
A list about Apache Kafka
Created 2016-04-29
82 commits to master branch, last one 3 months ago
13
540
apache-2.0
6
Code-Native Data Pipelines
Created 2023-08-04
4,360 commits to main branch, last one a day ago
111
485
mit
20
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Created 2018-10-20
266 commits to master branch, last one 9 months ago
41
473
apache-2.0
31
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Created 2019-02-19
617 commits to main branch, last one 6 months ago
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Created 2019-12-31
894 commits to master branch, last one 4 months ago
26
355
apache-2.0
8
Open-Source Web UI for managing Apache Kafka clusters
Created 2024-01-22
2,051 commits to main branch, last one 4 days ago
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
Created 2021-05-04
14 commits to main branch, last one 3 months ago
92
323
apache-2.0
16
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Created 2019-11-08
1,203 commits to main branch, last one 10 months ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created 2019-03-14
942 commits to master branch, last one 21 days ago
Source code for the Kafka Streams in Action Book
Created 2018-08-28
119 commits to master branch, last one 2 years ago