8 results found Sort:
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created
2015-05-03
8,946 commits to main branch, last one a day ago
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection ...
Created
2014-05-02
1,330 commits to master branch, last one 2 months ago
c++ LINQ -like library of higher-order functions for data manipulation
Created
2019-06-29
190 commits to master branch, last one 4 years ago
Dynatrace hash library for Java
Created
2022-01-26
773 commits to main branch, last one 5 days ago
Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.
Created
2018-09-18
47 commits to master branch, last one 4 years ago
Estimating k-mer coverage histogram of genomics data
Created
2016-05-30
130 commits to master branch, last one 2 years ago
Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering
Created
2013-08-15
130 commits to master branch, last one 9 years ago
t-digest module for Redis
Created
2016-05-16
28 commits to master branch, last one 3 years ago