69 results found Sort:
- Filter by Primary Language:
- Python (18)
- Java (10)
- Go (6)
- TypeScript (5)
- C# (5)
- C++ (4)
- Rust (3)
- JavaScript (3)
- Scala (2)
- Jupyter Notebook (2)
- Vue (1)
- Dockerfile (1)
- Elixir (1)
- Haskell (1)
- Julia (1)
- Ruby (1)
- C (1)
- +
A curated list of awesome big data frameworks, ressources and other awesomeness.
Created
2014-07-04
577 commits to master branch, last one about a year ago
Open-Source Web UI for Apache Kafka Management
Created
2019-11-26
1,918 commits to master branch, last one 8 months ago
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created
2015-05-03
8,959 commits to main branch, last one 3 days ago
Fancy stream processing made operationally mundane
Created
2016-03-22
5,800 commits to main branch, last one a day ago
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
Created
2019-02-22
41,057 commits to main branch, last one 15 hours ago
🌊 Online machine learning in Python
Created
2019-01-24
3,944 commits to main branch, last one 15 days ago
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the ...
Created
2022-05-24
10,305 commits to main branch, last one a day ago
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
rust
stateful
data-flow
real-time
streaming
serverless
webassembly
cloud-native
data-analytics
data-pipelines
streaming-data
data-integration
stream-processing
distributed-systems
streaming-analytics
stream-processing-engine
streaming-data-pipelines
event-driven-architecture
streaming-data-processing
Created
2019-08-31
2,396 commits to master branch, last one 2 days ago
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Created
2015-01-02
1,097 commits to develop branch, last one 4 days ago
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Created
2020-09-21
4,108 commits to master branch, last one 2 days ago
Pravega - Streaming as a new software defined storage primitive
Created
2016-07-11
3,295 commits to master branch, last one 4 months ago
A lightweight stream processing library for Go
Created
2019-04-30
191 commits to master branch, last one about a month ago
Python Stream Processing
Created
2022-02-04
2,554 commits to main branch, last one 18 days ago
Trill is a single-node query processor for temporal or streaming data.
Created
2018-09-26
232 commits to master branch, last one 11 months ago
Real-time stream processing for python
Created
2017-04-04
811 commits to master branch, last one 29 days ago
Python stream processing for Kafka
Created
2022-11-17
521 commits to main branch, last one a day ago
📐 Pushing the boundaries of simplicity
Created
2017-07-10
1,409 commits to master branch, last one 4 months ago
⚡ Single-pass algorithms for statistics
Created
2015-02-04
2,527 commits to master branch, last one 2 months ago
A machine learning package for streaming data in Python. The other ancestor of River.
Created
2017-11-14
1,088 commits to master branch, last one 4 years ago
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Created
2020-08-31
1,725 commits to main branch, last one 2 months ago
Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, ...
Created
2022-12-23
59 commits to master branch, last one about a month ago
Open-Source Web UI for managing Apache Kafka clusters
Created
2024-01-22
2,103 commits to main branch, last one a day ago
Code-Native Data Privacy
Created
2023-08-04
4,559 commits to main branch, last one about a month ago
A list about Apache Kafka
Created
2016-04-29
82 commits to master branch, last one 10 months ago
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Created
2018-10-20
266 commits to master branch, last one about a year ago
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Created
2019-02-19
618 commits to main branch, last one 6 months ago
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Created
2019-12-31
917 commits to master branch, last one about a month ago
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
Created
2021-05-04
14 commits to main branch, last one 10 months ago
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
This repository has been archived
(exclude archived)
Created
2019-11-08
1,204 commits to main branch, last one 4 months ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created
2019-03-14
960 commits to master branch, last one 2 months ago