18 results found Sort:
- Filter by Primary Language:
- Python (9)
- Java (5)
- Clojure (1)
- Go (1)
- Kotlin (1)
- +
TFX is an end-to-end platform for deploying production ML pipelines
Created
2019-02-04
5,965 commits to master branch, last one 9 days ago
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
Created
2018-02-10
4,515 commits to main branch, last one 19 hours ago
Yet Another UserAgent Analyzer
Created
2016-07-04
4,539 commits to main branch, last one a day ago
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Created
2018-09-13
215 commits to master branch, last one 2 years ago
Tools to make weather data accessible and useful.
Created
2021-11-22
414 commits to main branch, last one 18 days ago
Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
Created
2021-02-05
818 commits to master branch, last one 19 hours ago
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
This repository has been archived
(exclude archived)
Created
2020-07-24
128 commits to main branch, last one 3 years ago
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Created
2020-08-07
624 commits to master branch, last one 19 days ago
Clojure API for a more dynamic Google Dataflow
Created
2015-04-18
614 commits to master branch, last one 14 days ago
Collection of transforms for the Apache beam python SDK.
Created
2018-11-25
144 commits to master branch, last one 3 years ago
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Created
2018-09-17
87 commits to master branch, last one 3 years ago
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
Created
2021-03-27
49 commits to main branch, last one 5 months ago
Mercari Dataflow Template
Created
2020-12-04
452 commits to master branch, last one 2 months ago
Microservices in Post-Kubernetes Era. A polyglot monorepo
Created
2020-01-25
569 commits to main branch, last one about a year ago
Some class materials for a data processing course using PySpark
Created
2016-12-08
150 commits to master branch, last one 2 years ago
Blockchain ETL Architecture
Created
2020-04-09
9 commits to master branch, last one 2 years ago
Asgarde allows simplifying error handling with Apache Beam Python, with less code, more concise and expressive code.
Created
2022-01-03
3 commits to main branch, last one 2 years ago
Efficient streaming data ingestion, transformation & activation
Created
2023-01-05
180 commits to main branch, last one about a year ago