19 results found Sort:
- Filter by Primary Language:
- Python (10)
- Java (5)
- Clojure (1)
- Go (1)
- Kotlin (1)
- +
TFX is an end-to-end platform for deploying production ML pipelines
Created
2019-02-04
5,969 commits to master branch, last one about a month ago
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
Created
2018-02-10
4,679 commits to main branch, last one 9 hours ago
Yet Another UserAgent Analyzer
Created
2016-07-04
4,702 commits to main branch, last one a day ago
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Created
2018-09-13
217 commits to master branch, last one about a month ago
Tools to make weather data accessible and useful.
Created
2021-11-22
423 commits to main branch, last one 9 days ago
Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
Created
2021-02-05
847 commits to master branch, last one 21 days ago
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
This repository has been archived
(exclude archived)
Created
2020-07-24
128 commits to main branch, last one 3 years ago
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Created
2020-08-07
687 commits to master branch, last one 2 days ago
Clojure API for a more dynamic Google Dataflow
Created
2015-04-18
628 commits to master branch, last one 8 days ago
Collection of transforms for the Apache beam python SDK.
Created
2018-11-25
144 commits to master branch, last one 3 years ago
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Created
2018-09-17
87 commits to master branch, last one 3 years ago
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
Created
2021-03-27
49 commits to main branch, last one 9 months ago
Mercari Dataflow Template
Created
2020-12-04
452 commits to master branch, last one 6 months ago
Microservices in Post-Kubernetes Era. A polyglot monorepo
Created
2020-01-25
569 commits to main branch, last one about a year ago
Some class materials for a data processing course using PySpark
Created
2016-12-08
150 commits to master branch, last one 2 years ago
Blockchain ETL Architecture
Created
2020-04-09
9 commits to master branch, last one 2 years ago
Asgarde allows simplifying error handling with Apache Beam Python, with less code, more concise and expressive code.
Created
2022-01-03
3 commits to main branch, last one 2 years ago
Efficient streaming data ingestion, transformation & activation
Created
2023-01-05
180 commits to main branch, last one 2 years ago
Libraries for efficient and scalable group-structured dataset pipelines.
Created
2023-05-26
38 commits to main branch, last one 4 months ago