11 results found Sort:

284
1.8k
apache-2.0
40
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
Created 2018-06-15
691 commits to master branch, last one about a year ago
135
804
mit
49
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Created 2016-10-19
1,025 commits to master branch, last one 4 days ago
49
296
apache-2.0
6
High-performance Go package to read and write Parquet files
Created 2023-07-12
529 commits to main branch, last one 2 days ago
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Created 2018-08-26
489 commits to master branch, last one 21 days ago
7
126
apache-2.0
4
Query and transform data with PRQL
This repository has been archived (exclude archived)
Created 2022-10-11
140 commits to main branch, last one about a year ago
13
126
other
14
:guardsman: Tools to Transform and Query Data with 'Apache' 'Drill'
Created 2016-06-03
152 commits to master branch, last one 3 years ago
14
93
apache-2.0
29
MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
Created 2021-01-19
230 commits to main branch, last one 13 days ago
A converter for the OSM PBFs to Parquet files
Created 2016-04-03
34 commits to master branch, last one 4 years ago
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into partitioned by H3 regions PostGIS pgsnapshot (lossless) OSM schema representation and/or into ArrowIPC/...
Created 2023-01-29
70 commits to master branch, last one 3 months ago
A lightweight Java library that facilitates reading and writing Apache Parquet files without Hadoop dependencies
Created 2020-09-30
135 commits to master branch, last one 21 days ago