85 results found Sort:
- Filter by Primary Language:
- Java (23)
- Scala (14)
- Go (8)
- Python (7)
- C# (5)
- Rust (4)
- Ruby (4)
- JavaScript (4)
- TypeScript (3)
- Elixir (3)
- Kotlin (3)
- PHP (2)
- Haskell (1)
- C++ (1)
- Shell (1)
- C (1)
- +
Apache Avro is a data serialization system.
Created
2009-05-21
4,614 commits to main branch, last one a day ago
Record Query - A tool for doing record analysis and transformation
Created
2016-02-20
564 commits to master branch, last one about a year ago
Confluent Schema Registry for Kafka
Created
2014-12-09
15,839 commits to master branch, last one a day ago
Apache Kafka, Apache Flink and Confluent Platform examples and demos
Created
2018-04-17
11,329 commits to 7.8.0-post branch, last one 2 months ago
What's in your data? Extract schema, statistics and entities from datasets
Created
2020-11-09
602 commits to main branch, last one 18 days ago
Avro for JavaScript :zap:
Created
2015-09-12
958 commits to master branch, last one 22 days ago
More than 2000+ Data engineer interview questions.
Created
2021-08-08
19 commits to master branch, last one 5 days ago
pmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].
Created
2015-12-13
5,336 commits to master branch, last one 10 days ago
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Created
2013-11-19
2,034 commits to master branch, last one about a month ago
Goavro is a library that encodes and decodes Avro data.
Created
2015-02-23
378 commits to master branch, last one 11 days ago
Command Line Tool for managing Apache Kafka
Created
2018-12-10
588 commits to main branch, last one about a month ago
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Created
2016-10-19
1,025 commits to master branch, last one 2 months ago
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML...
Created
2015-10-27
5,410 commits to master branch, last one 2 months ago
Avro schema generation and serialization / deserialization for Scala
Created
2015-09-16
1,576 commits to master branch, last one 2 months ago
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Created
2018-12-12
696 commits to master branch, last one about a year ago
Lightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.
Created
2016-05-19
681 commits to master branch, last one a day ago
Iceberg is a table format for large, slow-moving tabular data
Created
2017-12-13
278 commits to master branch, last one 6 years ago
Web tool for Avro Schema Registry |
Created
2016-06-12
291 commits to master branch, last one 5 years ago
A fast Go Avro codec
Created
2019-02-27
291 commits to main branch, last one a day ago
Flexible, Fast & Compact Serialization with RPC
Created
2020-01-08
224 commits to master branch, last one 2 years ago
MongoDB Kafka Connector
Created
2019-04-26
559 commits to master branch, last one 14 days ago
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Created
2017-05-07
250 commits to master branch, last one 3 years ago
A tool for data sampling, data generation, and data diffing
Created
2016-08-01
735 commits to master branch, last one 28 days ago
Mu (μ) is a purely functional framework for building micro services.
This repository has been archived
(exclude archived)
Created
2019-09-27
313 commits to master branch, last one 2 years ago
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
Created
2019-02-15
792 commits to master branch, last one 7 days ago
A Gradle plugin to allow easily performing Java code generation for Apache Avro. It supports JSON schema declaration files, JSON protocol declaration files, and Avro IDL files.
This repository has been archived
(exclude archived)
Created
2013-10-22
533 commits to master branch, last one about a year ago
Uber-project for standard Jackson binary format backends: avro, cbor, ion, protobuf, smile
Created
2016-05-03
1,471 commits to 2.19 branch, last one 20 hours ago
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Created
2020-02-05
121 commits to master branch, last one a day ago
Replicate data from MySQL, Postgres and MongoDB to ClickHouse®
Created
2022-03-21
2,731 commits to develop branch, last one 8 days ago