72 results found Sort:

2.6k
13.5k
mit
846
A curated list of awesome big data frameworks, ressources and other awesomeness.
Created 2014-07-04
586 commits to master branch, last one about a month ago
1.3k
10.5k
apache-2.0
70
Open-Source Web UI for Apache Kafka Management
Created 2019-11-26
1,918 commits to master branch, last one 12 months ago
222
9.2k
other
69
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created 2015-05-03
8,991 commits to main branch, last one 5 days ago
859
8.3k
unknown
119
Fancy stream processing made operationally mundane
Created 2016-03-22
6,269 commits to main branch, last one 7 hours ago
Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.
Created 2019-02-22
43,001 commits to main branch, last one 2 hours ago
562
5.3k
bsd-3-clause
84
🌊 Online machine learning in Python
Created 2019-01-24
3,949 commits to main branch, last one about a month ago
138
4.9k
other
29
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the ...
Created 2022-05-24
10,600 commits to main branch, last one a day ago
503
4.4k
apache-2.0
46
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
Created 2019-08-31
2,486 commits to master branch, last one a day ago
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Created 2015-01-02
1,105 commits to develop branch, last one 8 days ago
140
2.7k
other
21
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Created 2020-09-21
4,247 commits to master branch, last one a day ago
165
2.0k
mit
28
A lightweight stream processing library for Go
Created 2019-04-30
202 commits to master branch, last one 14 days ago
408
2.0k
apache-2.0
106
Pravega - Streaming as a new software defined storage primitive
Created 2016-07-11
3,297 commits to master branch, last one about a month ago
77
1.7k
apache-2.0
19
Python Stream Processing
Created 2022-02-04
2,575 commits to main branch, last one 14 days ago
150
1.3k
bsd-3-clause
37
Real-time stream processing for python
Created 2017-04-04
811 commits to master branch, last one 4 months ago
131
1.3k
mit
62
Trill is a single-node query processor for temporal or streaming data.
Created 2018-09-26
232 commits to master branch, last one about a year ago
46
1.0k
other
20
📐 Pushing the boundaries of simplicity
Created 2017-07-10
1,409 commits to master branch, last one 7 months ago
125
962
apache-2.0
13
Open-Source Web UI for managing Apache Kafka clusters
Created 2024-01-22
2,192 commits to main branch, last one 3 days ago
8
913
unknown
3
Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, ...
Created 2022-12-23
60 commits to master branch, last one about a month ago
⚡ Single-pass algorithms for statistics
Created 2015-02-04
2,527 commits to master branch, last one 6 months ago
A machine learning package for streaming data in Python. The other ancestor of River.
Created 2017-11-14
1,088 commits to master branch, last one 4 years ago
55
722
bsd-3-clause
23
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Created 2020-08-31
1,726 commits to main branch, last one 3 months ago
15
593
apache-2.0
6
Code-Native Data Privacy
Created 2023-08-04
4,559 commits to main branch, last one 4 months ago
163
580
unknown
30
A list about Apache Kafka
Created 2016-04-29
85 commits to master branch, last one 23 days ago
113
507
mit
18
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Created 2018-10-20
266 commits to master branch, last one about a year ago
41
492
apache-2.0
30
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Created 2019-02-19
618 commits to main branch, last one 9 months ago
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Created 2019-12-31
921 commits to master branch, last one about a month ago
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
Created 2021-05-04
14 commits to main branch, last one about a year ago
90
319
apache-2.0
13
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
This repository has been archived (exclude archived)
Created 2019-11-08
1,204 commits to main branch, last one 7 months ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created 2019-03-14
961 commits to master branch, last one 2 months ago