8 results found Sort:

217
9.0k
other
71
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Created 2015-05-03
8,946 commits to main branch, last one a day ago
356
615
gpl-3.0
54
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection ...
Created 2014-05-02
1,330 commits to master branch, last one 2 months ago
6
194
other
16
c++ LINQ -like library of higher-order functions for data manipulation
Created 2019-06-29
190 commits to master branch, last one 4 years ago
Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.
Created 2018-09-18
47 commits to master branch, last one 4 years ago
9
76
mit
19
Estimating k-mer coverage histogram of genomics data
Created 2016-05-30
130 commits to master branch, last one 2 years ago
20
75
bsd-3-clause
9
Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering
Created 2013-08-15
130 commits to master branch, last one 9 years ago
t-digest module for Redis
Created 2016-05-16
28 commits to master branch, last one 3 years ago