34 results found Sort:
- Filter by Primary Language:
- Python (8)
- Scala (7)
- Jupyter Notebook (6)
- Java (5)
- C# (3)
- TypeScript (1)
- JavaScript (1)
- Rust (1)
- C (1)
- +
A Flexible and Powerful Parameter Server for large-scale machine learning
Created
2017-04-25
2,963 commits to master branch, last one about a year ago
SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.
Created
2022-01-28
10,598 commits to main branch, last one 16 hours ago
酷玩 Spark: Spark 源代码解析、Spark 类库等
Created
2015-12-03
138 commits to master branch, last one 2 years ago
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Created
2018-04-18
27 commits to master branch, last one 5 years ago
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Created
2019-04-22
372 commits to main branch, last one about a year ago
scala、spark使用过程中,各种测试用例以及相关资料整理
Created
2015-09-24
226 commits to master branch, last one 5 years ago
Wormhole is a SPaaS (Stream Processing as a Service) Platform
Created
2017-09-05
6,922 commits to master branch, last one 3 years ago
C# and F# language binding and extensions to Apache Spark
Created
2015-10-27
1,083 commits to master branch, last one 4 months ago
An open source framework for building data analytic applications.
Created
2014-08-02
52,744 commits to develop branch, last one 16 days ago
Streaming System 相关的论文读物
Created
2017-01-24
3 commits to master branch, last one 7 years ago
Stream computing platform for bigdata
Created
2018-06-19
625 commits to main branch, last one 2 years ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created
2019-03-14
942 commits to master branch, last one 21 days ago
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Created
2019-07-23
290 commits to master branch, last one 9 hours ago
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Created
2018-04-04
167 commits to master branch, last one about a year ago
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Created
2015-09-04
377 commits to master branch, last one about a year ago
StreamLine - Streaming Analytics
Created
2015-05-14
2,413 commits to master branch, last one 4 years ago
Updated repository
Created
2017-03-15
23 commits to master branch, last one 5 years ago
Kinesis Connector for Structured Streaming
Created
2018-03-01
90 commits to master branch, last one about a year ago
:star2: :sparkles: Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Created
2016-08-18
14 commits to master branch, last one 7 years ago
Apache Spark 3 - Structured Streaming Course Material
Created
2020-07-21
29 commits to master branch, last one 3 years ago
This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a ...
Created
2017-01-13
245 commits to master branch, last one about a year ago
Code examples on Apache Spark using python
Created
2017-08-09
138 commits to master branch, last one about a year ago
Custom state store providers for Apache Spark
Created
2018-08-13
43 commits to master branch, last one 2 years ago
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
Created
2022-05-10
45 commits to master branch, last one about a year ago
A data engineering project (Twitter monitor app)
Created
2022-04-14
83 commits to main branch, last one about a year ago
For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR
Created
2021-08-10
88 commits to main branch, last one 2 years ago
Repository used for Spark Trainings
Created
2015-12-28
358 commits to master branch, last one 2 years ago
秒杀,音乐商店项目实战,Redis源码,推荐系统
Created
2019-05-22
583 commits to master branch, last one 2 years ago
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
Created
2022-11-12
27 commits to main branch, last one 10 months ago
This repository contains the code base for the Open Stream Processing Benchmark.
Created
2019-07-26
27 commits to master branch, last one 2 years ago