11 results found Sort:
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
Created
2018-06-15
691 commits to master branch, last one about a year ago
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Created
2016-10-19
1,025 commits to master branch, last one about a month ago
High-performance Go package to read and write Parquet files
Created
2023-07-12
548 commits to main branch, last one a day ago
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Created
2018-08-26
489 commits to master branch, last one about a month ago
Query and transform data with PRQL
This repository has been archived
(exclude archived)
Created
2022-10-11
140 commits to main branch, last one about a year ago
:guardsman: Tools to Transform and Query Data with 'Apache' 'Drill'
Created
2016-06-03
152 commits to master branch, last one 3 years ago
MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
Created
2021-01-19
236 commits to main branch, last one 17 days ago
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into partitioned by H3 regions PostGIS pgsnapshot (lossless) OSM schema representation and/or into ArrowIPC/...
Created
2023-01-29
70 commits to master branch, last one 4 months ago
A converter for the OSM PBFs to Parquet files
Created
2016-04-03
34 commits to master branch, last one 4 years ago
A lightweight Java library that facilitates reading and writing Apache Parquet files without Hadoop dependencies
Created
2020-09-30
137 commits to master branch, last one about a month ago
Threat Detection and Visualization
Created
2023-07-28
57 commits to main branch, last one about a year ago