79 results found Sort:
- Filter by Primary Language:
- Python (19)
- Rust (13)
- Java (8)
- Scala (8)
- Go (7)
- Jupyter Notebook (4)
- C# (3)
- JavaScript (3)
- TypeScript (2)
- C++ (2)
- PLpgSQL (1)
- PHP (1)
- R (1)
- Kotlin (1)
- Julia (1)
- Shell (1)
- Svelte (1)
- Thrift (1)
- Common Lisp (1)
- +
Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.
Created
2022-01-10
102 commits to main branch, last one 8 months ago
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
Created
2020-12-11
258 commits to main branch, last one 17 days ago
Apache Parquet
Created
2014-06-10
2,639 commits to master branch, last one 18 hours ago
CSVs sliced, diced & analyzed.
Created
2020-12-11
9,201 commits to master branch, last one 7 hours ago
Apache Drill is a distributed MPP query layer for self describing data
Created
2012-09-05
4,502 commits to master branch, last one a day ago
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
Created
2018-06-15
691 commits to master branch, last one 6 months ago
A large-scale entity and relation database supporting aggregation of properties
Created
2015-12-14
7,213 commits to develop branch, last one a day ago
Apache Parquet
Created
2014-06-10
366 commits to master branch, last one 13 hours ago
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
Created
2021-12-09
3,215 commits to main branch, last one a day ago
Quilt is a data mesh for connecting people with actionable data
Created
2017-02-10
4,677 commits to master branch, last one 11 hours ago
cryo is the easiest way to extract blockchain data to parquet, csv, json, or python dataframes
Created
2023-06-27
417 commits to main branch, last one 21 days ago
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Created
2013-11-19
2,028 commits to master branch, last one 5 months ago
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Created
2016-10-19
1,017 commits to master branch, last one 6 days ago
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML...
Created
2015-10-27
5,335 commits to master branch, last one 9 days ago
Simple windows desktop application for viewing & querying Apache Parquet files
Created
2018-05-31
338 commits to master branch, last one 5 days ago
Graph Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, NetworkX, RAPIDS, RDFlib, pySHACL, PyVis, morph-kgc, pslpython,...
Created
2020-10-25
724 commits to main branch, last one 22 days ago
Fast data store for Pandas time-series data
Created
2018-05-26
203 commits to main branch, last one 2 years ago
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Created
2018-12-12
696 commits to master branch, last one about a year ago
Rust-based WebAssembly bindings to read and write Apache Parquet data
Created
2022-02-27
290 commits to main branch, last one 24 days ago
A Python library for fast, interactive geospatial vector data visualization in Jupyter.
Created
2023-08-31
306 commits to main branch, last one a day ago
Iceberg is a table format for large, slow-moving tabular data
Created
2017-12-13
278 commits to master branch, last one 5 years ago
Apache Parquet
This repository has been archived
(exclude archived)
Created
2014-06-10
503 commits to master branch, last one about a month ago
A tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch
Created
2016-09-17
166 commits to master branch, last one 2 years ago
Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow
Created
2021-03-27
272 commits to main branch, last one 9 months ago
fully asynchronous, pure JavaScript implementation of the Parquet file format
Created
2017-04-30
292 commits to master branch, last one 24 days ago
A tool for data sampling, data generation, and data diffing
Created
2016-08-01
696 commits to master branch, last one 6 days ago
Go library to read/write Parquet files
This repository has been archived
(exclude archived)
Created
2020-10-02
308 commits to main branch, last one 11 months ago
Kotlin Bigdata Toolkit
Created
2013-10-16
432 commits to master branch, last one about a month ago
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Created
2020-02-05
119 commits to master branch, last one 2 months ago
Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Prest...
Created
2020-04-21
523 commits to master branch, last one about a year ago