264 results found Sort:
Making large AI models cheaper, faster and more accessible
Created
2021-10-28
3,357 commits to main branch, last one 22 hours ago
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Created
2020-01-23
2,316 commits to master branch, last one 13 hours ago
Port of OpenAI's Whisper model in C/C++
Created
2022-09-25
1,298 commits to master branch, last one a day ago
Cross-platform, customizable ML solutions for live and streaming media.
Created
2019-06-13
3,796 commits to master branch, last one a day ago
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
1,444 commits to main branch, last one 12 hours ago
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Created
2017-06-30
3,233 commits to master branch, last one 23 hours ago
🎨 The exhaustive Pattern Matching library for TypeScript, with smart type inference.
Created
2020-05-24
922 commits to main branch, last one 8 days ago
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Created
2017-10-23
3,099 commits to main branch, last one a day ago
Faster Whisper transcription with CTranslate2
Created
2023-02-11
205 commits to master branch, last one 12 days ago
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Created
2019-05-02
544 commits to release/10.0 branch, last one 3 days ago
Large Language Model Text Generation Inference
Created
2022-10-08
732 commits to main branch, last one 17 hours ago
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Created
2018-10-04
3,374 commits to main branch, last one a day ago
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
Created
2016-07-30
2,225 commits to master branch, last one 2 months ago
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
Created
2019-10-10
189 commits to master branch, last one 2 years ago
Runtime type system for IO decoding/encoding
Created
2017-01-28
635 commits to master branch, last one 6 months ago
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Created
2018-10-15
15,582 commits to master branch, last one 21 hours ago
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Created
2018-03-15
12,226 commits to main branch, last one 25 days ago
🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalab...
Created
2022-08-30
1,761 commits to main branch, last one a day ago
An easy to use PyTorch to TensorRT converter
Created
2019-04-27
1,203 commits to master branch, last one 28 days ago
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cr...
Created
2020-05-29
2,511 commits to master branch, last one 8 months ago
Pre-trained Deep Learning models and demos (high quality and extremely fast)
Created
2018-10-15
8,708 commits to master branch, last one 14 days ago
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Created
2023-04-13
744 commits to main branch, last one about a month ago
TypeDB: the polymorphic database powered by types
Created
2016-07-11
6,590 commits to development branch, last one 19 hours ago
LightSeq: A High Performance Library for Sequence Processing and Generation
Created
2019-12-06
269 commits to master branch, last one about a year ago
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Created
2023-06-14
712 commits to main branch, last one a day ago
Fast inference engine for Transformer models
Created
2019-09-23
2,162 commits to master branch, last one 2 days ago
Sparsity-aware deep learning inference runtime for CPUs
Created
2020-12-14
1,050 commits to main branch, last one 25 days ago
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国...
Created
2021-08-17
922 commits to master branch, last one about a month ago
Python Library for learning (Structure and Parameter), inference (Probabilistic and Causal), and simulations in Bayesian Networks.
Created
2013-09-20
2,953 commits to dev branch, last one 4 days ago
Swift native on-device speech recognition with Whisper for Apple Silicon
Created
2024-01-26
122 commits to main branch, last one a day ago