11 results found Sort:
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
Created
2018-06-15
691 commits to master branch, last one about a year ago
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Created
2016-10-19
1,025 commits to master branch, last one 4 days ago
High-performance Go package to read and write Parquet files
Created
2023-07-12
529 commits to main branch, last one 2 days ago
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Created
2018-08-26
489 commits to master branch, last one 21 days ago
Query and transform data with PRQL
This repository has been archived
(exclude archived)
Created
2022-10-11
140 commits to main branch, last one about a year ago
:guardsman: Tools to Transform and Query Data with 'Apache' 'Drill'
Created
2016-06-03
152 commits to master branch, last one 3 years ago
MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
Created
2021-01-19
230 commits to main branch, last one 13 days ago
A converter for the OSM PBFs to Parquet files
Created
2016-04-03
34 commits to master branch, last one 4 years ago
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into partitioned by H3 regions PostGIS pgsnapshot (lossless) OSM schema representation and/or into ArrowIPC/...
Created
2023-01-29
70 commits to master branch, last one 3 months ago
A lightweight Java library that facilitates reading and writing Apache Parquet files without Hadoop dependencies
Created
2020-09-30
135 commits to master branch, last one 21 days ago
Threat Detection and Visualization
Created
2023-07-28
57 commits to main branch, last one about a year ago