19 results found Sort:

723
2.1k
apache-2.0
87
TFX is an end-to-end platform for deploying production ML pipelines
Created 2019-02-04
5,969 commits to master branch, last one about a month ago
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
Created 2018-02-10
4,679 commits to main branch, last one 9 hours ago
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Created 2018-09-13
217 commits to master branch, last one about a month ago
43
226
apache-2.0
14
Tools to make weather data accessible and useful.
Created 2021-11-22
423 commits to main branch, last one 9 days ago
Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
Created 2021-02-05
847 commits to master branch, last one 21 days ago
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
This repository has been archived (exclude archived)
Created 2020-07-24
128 commits to main branch, last one 3 years ago
97
176
apache-2.0
24
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Created 2020-08-07
687 commits to master branch, last one 2 days ago
32
131
epl-1.0
18
Clojure API for a more dynamic Google Dataflow
Created 2015-04-18
628 commits to master branch, last one 8 days ago
Collection of transforms for the Apache beam python SDK.
Created 2018-11-25
144 commits to master branch, last one 3 years ago
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Created 2018-09-17
87 commits to master branch, last one 3 years ago
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
Created 2021-03-27
49 commits to main branch, last one 9 months ago
Mercari Dataflow Template
Created 2020-12-04
452 commits to master branch, last one 6 months ago
Microservices in Post-Kubernetes Era. A polyglot monorepo
Created 2020-01-25
569 commits to main branch, last one about a year ago
Some class materials for a data processing course using PySpark
Created 2016-12-08
150 commits to master branch, last one 2 years ago
Asgarde allows simplifying error handling with Apache Beam Python, with less code, more concise and expressive code.
Created 2022-01-03
3 commits to main branch, last one 2 years ago
Efficient streaming data ingestion, transformation & activation
Created 2023-01-05
180 commits to main branch, last one 2 years ago
Libraries for efficient and scalable group-structured dataset pipelines.
Created 2023-05-26
38 commits to main branch, last one 4 months ago