37 results found Sort:

5.4k
31.8k
apache-2.0
472
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Created 2016-10-25
21,944 commits to master branch, last one a day ago
2.2k
6.1k
apache-2.0
240
A flexible, high-performance serving system for machine learning models
Created 2016-01-26
8,577 commits to master branch, last one 5 days ago
579
5.4k
apache-2.0
159
AI + Data, online. https://vespa.ai
Created 2016-06-03
83,820 commits to master branch, last one 5 hours ago
827
4.3k
other
83
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
Created 2017-12-20
7,327 commits to master branch, last one 17 days ago
In this repository, I will share some useful notes and references about deploying deep learning-based models in production.
Created 2018-05-03
219 commits to master branch, last one about a month ago
819
4.0k
apache-2.0
58
Serve, optimize and scale PyTorch models in production
Created 2019-10-03
3,792 commits to master branch, last one 21 hours ago
438
2.8k
apache-2.0
56
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end ...
Created 2022-06-27
2,422 commits to develop branch, last one 2 months ago
261
2.6k
apache-2.0
27
Database system for AI-powered apps
Created 2018-09-10
2,415 commits to staging branch, last one 6 months ago
TensorFlow template application for deep learning
Created 2016-07-18
205 commits to master branch, last one 3 years ago
292
1.6k
apache-2.0
66
DELTA is a deep learning based natural language and speech processing platform.
Created 2019-05-29
932 commits to master branch, last one 3 years ago
201
1.6k
cc-by-4.0
16
A comprehensive guide to building RAG-based LLM applications for production.
Created 2023-08-16
85 commits to main branch, last one about a month ago
85
1.2k
apache-2.0
20
RayLLM - LLMs on Ray
This repository has been archived (exclude archived)
Created 2023-05-31
198 commits to master branch, last one 22 days ago
245
884
apache-2.0
97
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
Created 2019-03-31
8,762 commits to v0.9.0 branch, last one 4 months ago
Generic and easy-to-use serving service for machine learning models
Created 2018-01-23
269 commits to master branch, last one 3 years ago
167
745
apache-2.0
72
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ...
Created 2021-10-14
1,061 commits to develop branch, last one a day ago
197
641
apache-2.0
30
A scalable inference server for models optimized with OpenVINO™
Created 2018-09-26
2,368 commits to main branch, last one a day ago
116
630
apache-2.0
70
A unified end-to-end machine intelligence platform
Created 2022-03-24
910 commits to main branch, last one 2 months ago
85
559
apache-2.0
41
Python + Inference - Model Deployment library in Python. Simplest model inference server ever.
Created 2022-04-04
55 commits to main branch, last one about a year ago
213
453
agpl-3.0
37
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
Created 2016-05-15
449 commits to master branch, last one 8 days ago
ML pipeline orchestration and model deployments on Kubernetes.
This repository has been archived (exclude archived)
Created 2020-11-17
946 commits to master branch, last one about a year ago
부스트캠프 AI Tech - Product Serving 자료
Created 2021-08-29
150 commits to main branch, last one 4 months ago
23
308
apache-2.0
15
A high-performance inference system for large language models, designed for production environments.
Created 2023-07-24
536 commits to main branch, last one 13 hours ago
MLOps Platform
Created 2017-05-22
1,275 commits to master branch, last one 2 years ago
33
188
apache-2.0
18
MLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging the gap between current ML training and serving systems.
Created 2020-04-22
843 commits to master branch, last one 3 years ago
58
179
apache-2.0
12
A universal scalable machine learning model deployment solution
Created 2021-08-16
1,769 commits to master branch, last one 18 hours ago
27
153
apache-2.0
16
bring keras-models to production with tensorflow-serving and nodejs + docker :pizza:
Created 2017-05-19
20 commits to master branch, last one 5 years ago
12
135
apache-2.0
6
Boosting DL Service Throughput 1.5-4x by Ensemble Pipeline Serving with Concurrent CUDA Streams for PyTorch/LibTorch Frontend and TensorRT/CVCUDA, etc., Backends
Created 2023-10-24
147 commits to main branch, last one 15 days ago
39
131
apache-2.0
11
ClearML - Model-Serving Orchestration and Repository Solution
Created 2021-04-12
138 commits to main branch, last one 3 months ago
12
119
apache-2.0
15
Deploy AI models at scale. High-throughput serving engine for AI/ML models that uses the latest state-of-the-art model deployment techniques.
Created 2023-12-12
171 commits to main branch, last one 6 hours ago
TensorFlow Serving ARM - A project for cross-compiling TensorFlow Serving targeting popular ARM cores
Created 2018-09-26
49 commits to master branch, last one 2 years ago