264 results found Sort:

4.3k
38.1k
apache-2.0
379
Making large AI models cheaper, faster and more accessible
Created 2021-10-28
3,357 commits to main branch, last one 22 hours ago
3.9k
33.2k
apache-2.0
335
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Created 2020-01-23
2,316 commits to master branch, last one 13 hours ago
3.2k
32.2k
mit
293
Port of OpenAI's Whisper model in C/C++
Created 2022-09-25
1,298 commits to master branch, last one a day ago
5.0k
25.8k
apache-2.0
491
Cross-platform, customizable ML solutions for live and streaming media.
Created 2019-06-13
3,796 commits to master branch, last one a day ago
2.8k
20.3k
apache-2.0
194
A high-throughput and memory-efficient inference and serving engine for LLMs
Created 2023-02-09
1,444 commits to main branch, last one 12 hours ago
4.1k
19.5k
other
572
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Created 2017-06-30
3,233 commits to master branch, last one 23 hours ago
114
11.1k
mit
31
🎨 The exhaustive Pattern Matching library for TypeScript, with smart type inference.
Created 2020-05-24
922 commits to main branch, last one 8 days ago
6.6k
9.6k
apache-2.0
266
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Created 2017-10-23
3,099 commits to main branch, last one a day ago
Faster Whisper transcription with CTranslate2
Created 2023-02-11
205 commits to master branch, last one 12 days ago
2.0k
9.3k
apache-2.0
149
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Created 2019-05-02
544 commits to release/10.0 branch, last one 3 days ago
Large Language Model Text Generation Inference
Created 2022-10-08
732 commits to main branch, last one 17 hours ago
1.4k
7.5k
bsd-3-clause
138
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Created 2018-10-04
3,374 commits to main branch, last one a day ago
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
Created 2016-07-30
2,225 commits to master branch, last one 2 months ago
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
Created 2019-10-10
189 commits to master branch, last one 2 years ago
330
6.6k
mit
54
Runtime type system for IO decoding/encoding
Created 2017-01-28
635 commits to master branch, last one 6 months ago
2.0k
6.1k
apache-2.0
186
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Created 2018-10-15
15,582 commits to master branch, last one 21 hours ago
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Created 2018-03-15
12,226 commits to main branch, last one 25 days ago
434
4.4k
apache-2.0
39
🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalab...
Created 2022-08-30
1,761 commits to main branch, last one a day ago
An easy to use PyTorch to TensorRT converter
Created 2019-04-27
1,203 commits to master branch, last one 28 days ago
756
4.3k
other
92
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cr...
Created 2020-05-29
2,511 commits to master branch, last one 8 months ago
1.4k
4.0k
apache-2.0
122
Pre-trained Deep Learning models and demos (high quality and extremely fast)
Created 2018-10-15
8,708 commits to master branch, last one 14 days ago
398
3.9k
mit
34
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Created 2023-04-13
744 commits to main branch, last one about a month ago
336
3.7k
mpl-2.0
118
TypeDB: the polymorphic database powered by types
Created 2016-07-11
6,590 commits to development branch, last one 19 hours ago
325
3.1k
other
59
LightSeq: A High Performance Library for Sequence Processing and Generation
Created 2019-12-06
269 commits to master branch, last one about a year ago
246
3.0k
apache-2.0
34
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Created 2023-06-14
712 commits to main branch, last one a day ago
259
2.9k
mit
56
Fast inference engine for Transformer models
Created 2019-09-23
2,162 commits to master branch, last one 2 days ago
169
2.9k
other
55
Sparsity-aware deep learning inference runtime for CPUs
Created 2020-12-14
1,050 commits to main branch, last one 25 days ago
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国...
Created 2021-08-17
922 commits to master branch, last one about a month ago
692
2.6k
mit
74
Python Library for learning (Structure and Parameter), inference (Probabilistic and Causal), and simulations in Bayesian Networks.
Created 2013-09-20
2,953 commits to dev branch, last one 4 days ago
Swift native on-device speech recognition with Whisper for Apple Silicon
Created 2024-01-26
122 commits to main branch, last one a day ago