80 results found Sort:

1.6k
2.8k
apache-2.0
107
Apache Avro is a data serialization system.
Created 2009-05-21
4,344 commits to main branch, last one 24 hours ago
57
2.3k
apache-2.0
21
Record Query - A tool for doing record analysis and transformation
Created 2016-02-20
564 commits to master branch, last one about a year ago
Confluent Schema Registry for Kafka
Created 2014-12-09
13,781 commits to master branch, last one 12 hours ago
1.1k
1.9k
apache-2.0
231
Apache Kafka and Confluent Platform examples and demos
Created 2018-04-17
10,209 commits to 7.5.0-post branch, last one 2 days ago
157
1.4k
apache-2.0
21
What's in your data? Extract schema, statistics and entities from datasets
Created 2020-11-09
601 commits to main branch, last one 14 days ago
145
1.3k
mit
30
Avro for JavaScript :zap:
Created 2015-09-12
945 commits to master branch, last one 3 months ago
263
1.0k
other
63
pmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].
Created 2015-12-13
5,191 commits to master branch, last one 4 days ago
305
966
apache-2.0
100
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Created 2013-11-19
2,028 commits to master branch, last one 5 months ago
More than 2000+ Data engineer interview questions.
Created 2021-08-08
16 commits to master branch, last one about a month ago
73
779
apache-2.0
24
Command Line Tool for managing Apache Kafka
Created 2018-12-10
535 commits to main branch, last one 2 months ago
134
753
mit
49
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Created 2016-10-19
1,017 commits to master branch, last one 21 days ago
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML...
Created 2015-10-27
5,350 commits to master branch, last one 2 days ago
236
715
apache-2.0
26
Avro schema generation and serialization / deserialization for Scala
Created 2015-09-16
1,562 commits to master branch, last one about a month ago
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Created 2018-12-12
696 commits to master branch, last one about a year ago
59
468
apache-2.0
348
Iceberg is a table format for large, slow-moving tabular data
Created 2017-12-13
278 commits to master branch, last one 5 years ago
75
450
apache-2.0
16
Lightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.
Created 2016-05-19
606 commits to master branch, last one 2 days ago
Web tool for Avro Schema Registry |
Created 2016-06-12
291 commits to master branch, last one 4 years ago
Flexible, Fast & Compact Serialization with RPC
Created 2020-01-08
224 commits to master branch, last one 2 years ago
81
351
mit
6
A fast Go Avro codec
Created 2019-02-27
250 commits to main branch, last one 4 days ago
55
337
apache-2.0
29
A tool for data sampling, data generation, and data diffing
Created 2016-08-01
698 commits to master branch, last one 8 days ago
238
336
apache-2.0
40
MongoDB Kafka Connector
Created 2019-04-26
539 commits to master branch, last one about a month ago
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Created 2017-05-07
250 commits to master branch, last one 2 years ago
19
329
apache-2.0
13
Mu (μ) is a purely functional framework for building micro services.
Created 2019-09-27
313 commits to master branch, last one about a year ago
A Gradle plugin to allow easily performing Java code generation for Apache Avro. It supports JSON schema declaration files, JSON protocol declaration files, and Avro IDL files.
This repository has been archived (exclude archived)
Created 2013-10-22
533 commits to master branch, last one 6 months ago
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
Created 2019-02-15
768 commits to master branch, last one about a month ago
Uber-project for standard Jackson binary format backends: avro, cbor, ion, protobuf, smile
Created 2016-05-03
1,375 commits to 2.18 branch, last one a day ago
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Created 2020-02-05
119 commits to master branch, last one 2 months ago
70
225
apache-2.0
8
Golang Client for Schema Registry
Created 2019-07-25
154 commits to master branch, last one 6 days ago
74
223
apache-2.0
18
Avro SerDe for Apache Spark structured APIs.
Created 2018-04-30
439 commits to master branch, last one 2 months ago
80
209
other
27
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to d...
Created 2017-09-04
6,426 commits to master branch, last one 4 months ago