69 results found Sort:

2.6k
13.3k
mit
845
A curated list of awesome big data frameworks, ressources and other awesomeness.
Created 2014-07-04
577 commits to master branch, last one about a year ago
1.2k
10.0k
apache-2.0
71
Open-Source Web UI for Apache Kafka Management
Created 2019-11-26
1,918 commits to master branch, last one 8 months ago
219
9.1k
other
71
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created 2015-05-03
8,959 commits to main branch, last one 3 days ago
846
8.2k
unknown
120
Fancy stream processing made operationally mundane
Created 2016-03-22
5,800 commits to main branch, last one a day ago
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
Created 2019-02-22
41,057 commits to main branch, last one 15 hours ago
554
5.1k
bsd-3-clause
85
🌊 Online machine learning in Python
Created 2019-01-24
3,944 commits to main branch, last one 15 days ago
130
4.6k
other
25
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the ...
Created 2022-05-24
10,305 commits to main branch, last one a day ago
487
3.9k
apache-2.0
44
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
Created 2019-08-31
2,396 commits to master branch, last one 2 days ago
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Created 2015-01-02
1,097 commits to develop branch, last one 4 days ago
126
2.5k
other
22
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Created 2020-09-21
4,108 commits to master branch, last one 2 days ago
408
2.0k
apache-2.0
106
Pravega - Streaming as a new software defined storage primitive
Created 2016-07-11
3,295 commits to master branch, last one 4 months ago
160
1.9k
mit
29
A lightweight stream processing library for Go
Created 2019-04-30
191 commits to master branch, last one about a month ago
64
1.6k
apache-2.0
19
Python Stream Processing
Created 2022-02-04
2,554 commits to main branch, last one 18 days ago
131
1.2k
mit
63
Trill is a single-node query processor for temporal or streaming data.
Created 2018-09-26
232 commits to master branch, last one 11 months ago
148
1.2k
bsd-3-clause
35
Real-time stream processing for python
Created 2017-04-04
811 commits to master branch, last one 29 days ago
41
996
other
21
📐 Pushing the boundaries of simplicity
Created 2017-07-10
1,409 commits to master branch, last one 4 months ago
⚡ Single-pass algorithms for statistics
Created 2015-02-04
2,527 commits to master branch, last one 2 months ago
A machine learning package for streaming data in Python. The other ancestor of River.
Created 2017-11-14
1,088 commits to master branch, last one 4 years ago
55
713
bsd-3-clause
25
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Created 2020-08-31
1,725 commits to main branch, last one 2 months ago
5
701
unknown
3
Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, ...
Created 2022-12-23
59 commits to master branch, last one about a month ago
88
664
apache-2.0
11
Open-Source Web UI for managing Apache Kafka clusters
Created 2024-01-22
2,103 commits to main branch, last one a day ago
16
584
apache-2.0
6
Code-Native Data Privacy
Created 2023-08-04
4,559 commits to main branch, last one about a month ago
163
579
unknown
31
A list about Apache Kafka
Created 2016-04-29
82 commits to master branch, last one 10 months ago
112
501
mit
19
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Created 2018-10-20
266 commits to master branch, last one about a year ago
39
490
apache-2.0
31
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Created 2019-02-19
618 commits to main branch, last one 6 months ago
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Created 2019-12-31
917 commits to master branch, last one about a month ago
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
Created 2021-05-04
14 commits to main branch, last one 10 months ago
90
321
apache-2.0
15
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
This repository has been archived (exclude archived)
Created 2019-11-08
1,204 commits to main branch, last one 4 months ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created 2019-03-14
960 commits to master branch, last one 2 months ago