11 results found Sort:

284
1.8k
apache-2.0
40
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
Created 2018-06-15
691 commits to master branch, last one about a year ago
136
810
mit
49
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Created 2016-10-19
1,025 commits to master branch, last one about a month ago
51
328
apache-2.0
6
High-performance Go package to read and write Parquet files
Created 2023-07-12
548 commits to main branch, last one a day ago
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Created 2018-08-26
489 commits to master branch, last one about a month ago
7
127
apache-2.0
4
Query and transform data with PRQL
This repository has been archived (exclude archived)
Created 2022-10-11
140 commits to main branch, last one about a year ago
13
126
other
14
:guardsman: Tools to Transform and Query Data with 'Apache' 'Drill'
Created 2016-06-03
152 commits to master branch, last one 3 years ago
13
93
apache-2.0
29
MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
Created 2021-01-19
236 commits to main branch, last one 17 days ago
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into partitioned by H3 regions PostGIS pgsnapshot (lossless) OSM schema representation and/or into ArrowIPC/...
Created 2023-01-29
70 commits to master branch, last one 4 months ago
A converter for the OSM PBFs to Parquet files
Created 2016-04-03
34 commits to master branch, last one 4 years ago
A lightweight Java library that facilitates reading and writing Apache Parquet files without Hadoop dependencies
Created 2020-09-30
137 commits to master branch, last one about a month ago