63 results found Sort:
- Filter by Primary Language:
- Python (18)
- Java (10)
- Go (6)
- C# (5)
- C++ (4)
- TypeScript (3)
- Rust (3)
- JavaScript (2)
- Jupyter Notebook (2)
- Scala (2)
- Dockerfile (1)
- Haskell (1)
- Ruby (1)
- Elixir (1)
- C (1)
- Julia (1)
- +
A curated list of awesome big data frameworks, ressources and other awesomeness.
Created
2014-07-04
577 commits to master branch, last one about a year ago
Open-Source Web UI for Apache Kafka Management
Created
2019-11-26
1,918 commits to master branch, last one about a month ago
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created
2015-05-03
8,836 commits to main branch, last one 19 hours ago
Fancy stream processing made operationally mundane
Created
2016-03-22
4,977 commits to main branch, last one a day ago
The data warehouse for operational workloads.
Created
2019-02-22
35,184 commits to main branch, last one 17 hours ago
🌊 Online machine learning in Python
Created
2019-01-24
3,875 commits to main branch, last one 12 days ago
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the ...
Created
2022-05-24
9,836 commits to main branch, last one 7 days ago
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Created
2015-01-02
1,078 commits to develop branch, last one 23 days ago
Lean and mean distributed stream processing system written in rust and web assembly.
Created
2019-08-31
2,254 commits to master branch, last one 14 hours ago
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Created
2020-09-21
3,871 commits to master branch, last one 3 days ago
Pravega - Streaming as a new software defined storage primitive
Created
2016-07-11
3,294 commits to master branch, last one 2 months ago
A lightweight stream processing library for Go
Created
2019-04-30
180 commits to master branch, last one 18 days ago
Python Stream Processing
Created
2022-02-04
2,345 commits to main branch, last one a day ago
Trill is a single-node query processor for temporal or streaming data.
Created
2018-09-26
232 commits to master branch, last one 4 months ago
Real-time stream processing for python
Created
2017-04-04
805 commits to master branch, last one about a year ago
📐 Pushing the boundaries of simplicity
Created
2017-07-10
1,406 commits to master branch, last one 9 months ago
100% Python stream processing with Streaming DataFrames
Created
2022-11-17
290 commits to main branch, last one 22 hours ago
⚡ Single-pass algorithms for statistics
Created
2015-02-04
2,525 commits to master branch, last one 10 days ago
A machine learning package for streaming data in Python. The other ancestor of River.
Created
2017-11-14
1,088 commits to master branch, last one 3 years ago
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Created
2020-08-31
1,718 commits to main branch, last one 23 hours ago
A list about Apache Kafka
Created
2016-04-29
82 commits to master branch, last one 3 months ago
Code-Native Data Pipelines
Created
2023-08-04
4,360 commits to main branch, last one a day ago
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Created
2018-10-20
266 commits to master branch, last one 9 months ago
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Created
2019-02-19
617 commits to main branch, last one 6 months ago
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Created
2019-12-31
894 commits to master branch, last one 4 months ago
Open-Source Web UI for managing Apache Kafka clusters
Created
2024-01-22
2,051 commits to main branch, last one 4 days ago
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
Created
2021-05-04
14 commits to main branch, last one 3 months ago
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Created
2019-11-08
1,203 commits to main branch, last one 10 months ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created
2019-03-14
942 commits to master branch, last one 21 days ago
Source code for the Kafka Streams in Action Book
Created
2018-08-28
119 commits to master branch, last one 2 years ago