72 results found Sort:
- Filter by Primary Language:
- Python (19)
- Java (10)
- Go (6)
- TypeScript (5)
- C# (5)
- C++ (4)
- Rust (4)
- JavaScript (4)
- Jupyter Notebook (2)
- Scala (2)
- Vue (1)
- Dockerfile (1)
- Elixir (1)
- Haskell (1)
- Julia (1)
- Ruby (1)
- C (1)
- +
A curated list of awesome big data frameworks, ressources and other awesomeness.
Created
2014-07-04
586 commits to master branch, last one about a month ago
Open-Source Web UI for Apache Kafka Management
Created
2019-11-26
1,918 commits to master branch, last one 12 months ago
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created
2015-05-03
8,991 commits to main branch, last one 5 days ago
Fancy stream processing made operationally mundane
Created
2016-03-22
6,269 commits to main branch, last one 7 hours ago
Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.
Created
2019-02-22
43,001 commits to main branch, last one 2 hours ago
🌊 Online machine learning in Python
Created
2019-01-24
3,949 commits to main branch, last one about a month ago
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the ...
Created
2022-05-24
10,600 commits to main branch, last one a day ago
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
rust
stateful
data-flow
real-time
streaming
serverless
webassembly
cloud-native
data-analytics
data-pipelines
streaming-data
data-integration
stream-processing
distributed-systems
streaming-analytics
stream-processing-engine
streaming-data-pipelines
event-driven-architecture
streaming-data-processing
Created
2019-08-31
2,486 commits to master branch, last one a day ago
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Created
2015-01-02
1,105 commits to develop branch, last one 8 days ago
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Created
2020-09-21
4,247 commits to master branch, last one a day ago
A lightweight stream processing library for Go
Created
2019-04-30
202 commits to master branch, last one 14 days ago
Pravega - Streaming as a new software defined storage primitive
Created
2016-07-11
3,297 commits to master branch, last one about a month ago
Python Stream Processing
Created
2022-02-04
2,575 commits to main branch, last one 14 days ago
Python Streaming DataFrames for Kafka
Created
2022-11-17
625 commits to main branch, last one 23 hours ago
Real-time stream processing for python
Created
2017-04-04
811 commits to master branch, last one 4 months ago
Trill is a single-node query processor for temporal or streaming data.
Created
2018-09-26
232 commits to master branch, last one about a year ago
📐 Pushing the boundaries of simplicity
Created
2017-07-10
1,409 commits to master branch, last one 7 months ago
Open-Source Web UI for managing Apache Kafka clusters
Created
2024-01-22
2,192 commits to main branch, last one 3 days ago
Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, ...
Created
2022-12-23
60 commits to master branch, last one about a month ago
⚡ Single-pass algorithms for statistics
Created
2015-02-04
2,527 commits to master branch, last one 6 months ago
A machine learning package for streaming data in Python. The other ancestor of River.
Created
2017-11-14
1,088 commits to master branch, last one 4 years ago
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Created
2020-08-31
1,726 commits to main branch, last one 3 months ago
Code-Native Data Privacy
Created
2023-08-04
4,559 commits to main branch, last one 4 months ago
A list about Apache Kafka
Created
2016-04-29
85 commits to master branch, last one 23 days ago
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Created
2018-10-20
266 commits to master branch, last one about a year ago
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Created
2019-02-19
618 commits to main branch, last one 9 months ago
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Created
2019-12-31
921 commits to master branch, last one about a month ago
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
Created
2021-05-04
14 commits to main branch, last one about a year ago
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
This repository has been archived
(exclude archived)
Created
2019-11-08
1,204 commits to main branch, last one 7 months ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created
2019-03-14
961 commits to master branch, last one 2 months ago