68 results found Sort:

2.6k
13.3k
mit
847
A curated list of awesome big data frameworks, ressources and other awesomeness.
Created 2014-07-04
577 commits to master branch, last one about a year ago
1.2k
9.8k
apache-2.0
68
Open-Source Web UI for Apache Kafka Management
Created 2019-11-26
1,918 commits to master branch, last one 7 months ago
217
9.0k
other
71
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created 2015-05-03
8,946 commits to main branch, last one a day ago
840
8.1k
unknown
120
Fancy stream processing made operationally mundane
Created 2016-03-22
5,651 commits to main branch, last one 23 hours ago
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
Created 2019-02-22
40,526 commits to main branch, last one 18 hours ago
552
5.1k
bsd-3-clause
83
🌊 Online machine learning in Python
Created 2019-01-24
3,917 commits to main branch, last one 2 days ago
125
4.5k
other
25
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the ...
Created 2022-05-24
10,246 commits to main branch, last one 16 hours ago
489
3.9k
apache-2.0
42
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
Created 2019-08-31
2,371 commits to master branch, last one a day ago
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Created 2015-01-02
1,091 commits to develop branch, last one 21 days ago
121
2.4k
other
21
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Created 2020-09-21
4,081 commits to master branch, last one a day ago
407
2.0k
apache-2.0
106
Pravega - Streaming as a new software defined storage primitive
Created 2016-07-11
3,295 commits to master branch, last one 3 months ago
157
1.9k
mit
28
A lightweight stream processing library for Go
Created 2019-04-30
191 commits to master branch, last one 11 days ago
64
1.6k
apache-2.0
18
Python Stream Processing
Created 2022-02-04
2,548 commits to main branch, last one 5 days ago
132
1.2k
mit
63
Trill is a single-node query processor for temporal or streaming data.
Created 2018-09-26
232 commits to master branch, last one 10 months ago
148
1.2k
bsd-3-clause
35
Real-time stream processing for python
Created 2017-04-04
805 commits to master branch, last one about a year ago
41
993
other
21
📐 Pushing the boundaries of simplicity
Created 2017-07-10
1,409 commits to master branch, last one 3 months ago
⚡ Single-pass algorithms for statistics
Created 2015-02-04
2,527 commits to master branch, last one about a month ago
A machine learning package for streaming data in Python. The other ancestor of River.
Created 2017-11-14
1,088 commits to master branch, last one 4 years ago
55
708
bsd-3-clause
25
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Created 2020-08-31
1,725 commits to main branch, last one about a month ago
5
697
unknown
3
Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, ...
Created 2022-12-23
59 commits to master branch, last one 17 days ago
80
622
apache-2.0
8
Open-Source Web UI for managing Apache Kafka clusters
Created 2024-01-22
2,087 commits to main branch, last one 3 days ago
16
584
apache-2.0
6
Code-Native Data Privacy
Created 2023-08-04
4,559 commits to main branch, last one 5 days ago
163
579
unknown
31
A list about Apache Kafka
Created 2016-04-29
82 commits to master branch, last one 9 months ago
112
497
mit
19
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Created 2018-10-20
266 commits to master branch, last one about a year ago
39
489
apache-2.0
30
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Created 2019-02-19
618 commits to main branch, last one 5 months ago
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Created 2019-12-31
917 commits to master branch, last one 23 days ago
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
Created 2021-05-04
14 commits to main branch, last one 9 months ago
90
321
apache-2.0
15
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Created 2019-11-08
1,204 commits to main branch, last one 3 months ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created 2019-03-14
960 commits to master branch, last one about a month ago