36 results found Sort:

599
7.3k
apache-2.0
80
Streaming database. Unified experience for real-time data ingestion, stream processing, and low-latency serving. Best-in-class performance and cost-efficiency. Supports SQL (Postgres-style) and Python...
Created 2022-01-28
12,179 commits to main branch, last one 18 hours ago
1.6k
6.8k
other
447
A Flexible and Powerful Parameter Server for large-scale machine learning
Created 2017-04-25
2,963 commits to master branch, last one 2 years ago
1.4k
3.5k
unknown
441
酷玩 Spark: Spark 源代码解析、Spark 类库等
Created 2015-12-03
138 commits to master branch, last one 2 years ago
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Created 2018-04-18
27 commits to master branch, last one 5 years ago
319
2.0k
mit
87
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Created 2019-04-22
380 commits to main branch, last one 14 days ago
434
1.1k
unknown
129
scala、spark使用过程中,各种测试用例以及相关资料整理
Created 2015-09-24
226 commits to master branch, last one 5 years ago
480
979
apache-2.0
103
Wormhole is a SPaaS (Stream Processing as a Service) Platform
Created 2017-09-05
6,922 commits to master branch, last one 4 years ago
212
938
mit
143
C# and F# language binding and extensions to Apache Spark
Created 2015-10-27
1,083 commits to master branch, last one 12 months ago
344
763
other
96
An open source framework for building data analytic applications.
Created 2014-08-02
52,928 commits to develop branch, last one 3 days ago
Streaming System 相关的论文读物
Created 2017-01-24
3 commits to master branch, last one 7 years ago
173
402
apache-2.0
46
Stream computing platform for bigdata
Created 2018-06-19
625 commits to main branch, last one 3 years ago
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Created 2019-07-23
300 commits to master branch, last one about a month ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created 2019-03-14
961 commits to master branch, last one 18 days ago
81
244
apache-2.0
36
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Created 2018-04-04
171 commits to master branch, last one about a month ago
175
235
apache-2.0
47
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Created 2015-09-04
377 commits to master branch, last one 2 years ago
23
165
other
8
Databricks framework to validate Data Quality of pySpark DataFrames
Created 2024-04-23
64 commits to main branch, last one 3 days ago
96
164
apache-2.0
685
StreamLine - Streaming Analytics
Created 2015-05-14
2,413 commits to master branch, last one 5 years ago
126
157
unknown
18
Updated repository
Created 2017-03-15
23 commits to master branch, last one 6 years ago
:star2: :sparkles: Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Created 2016-08-18
14 commits to master branch, last one 7 years ago
80
136
apache-2.0
13
Kinesis Connector for Structured Streaming
Created 2018-03-01
90 commits to master branch, last one 2 years ago
Apache Spark 3 - Structured Streaming Course Material
Created 2020-07-21
29 commits to master branch, last one 4 years ago
Code examples on Apache Spark using python
Created 2017-08-09
138 commits to master branch, last one 2 years ago
This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a ...
Created 2017-01-13
245 commits to master branch, last one 2 years ago
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
Created 2022-05-10
45 commits to master branch, last one 2 years ago
26
92
apache-2.0
8
Custom state store providers for Apache Spark
Created 2018-08-13
43 commits to master branch, last one 2 years ago
A data engineering project (Twitter monitor app)
Created 2022-04-14
83 commits to main branch, last one 2 years ago
For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR
Created 2021-08-10
88 commits to main branch, last one 3 years ago
Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
Created 2023-04-17
102 commits to main branch, last one 7 months ago
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
Created 2022-11-12
27 commits to main branch, last one about a year ago
Repository used for Spark Trainings
Created 2015-12-28
358 commits to master branch, last one 3 years ago