53 results found Sort:

743
3.8k
apache-2.0
94
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on a...
Created 2020-07-21
12,120 commits to master branch, last one 9 months ago
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Created 2022-11-21
793 commits to main branch, last one 3 months ago
Rule engine implementation in Golang
Created 2019-12-13
236 commits to master branch, last one about a year ago
122
1.8k
apache-2.0
41
OneDiff: An out-of-the-box acceleration library for diffusion models.
Created 2022-09-21
601 commits to main branch, last one about a month ago
Large-scale LLM inference engine
Created 2023-06-23
1,158 commits to main branch, last one 9 hours ago
283
1.2k
unknown
100
FeatherCNN is a high performance inference engine for convolutional neural networks.
Created 2018-04-27
19 commits to master branch, last one 5 years ago
140
1.0k
apache-2.0
91
Paddle.js is a web project for Baidu PaddlePaddle, which is an open source deep learning framework running in the browser. Paddle.js can either load a pre-trained model, or transforming a model from p...
Created 2020-03-26
694 commits to release/v2.2.5 branch, last one 2 years ago
102
852
apache-2.0
51
A highly optimized LLM inference acceleration engine for Llama and its variants.
Created 2024-12-06
45 commits to main branch, last one 6 days ago
82
797
apache-2.0
28
Adlik: Toolkit for Accelerating Deep Learning Inference
Created 2019-09-23
406 commits to master branch, last one about a year ago
147
742
mit
25
🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.
Created 2020-07-05
611 commits to master branch, last one about a year ago
53
615
apache-2.0
10
Python Computer Vision & Video Analytics Framework With Batteries Included
Created 2022-05-07
824 commits to develop branch, last one 2 days ago
93
596
bsd-3-clause
18
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Created 2023-12-20
63 commits to main branch, last one 2 days ago
67
552
other
22
A library for high performance deep learning inference on NVIDIA GPUs.
Created 2021-03-11
103 commits to master branch, last one 3 years ago
279
539
lgpl-2.1
24
A common base representation of python source code for pylint and other projects
Created 2015-12-08
4,912 commits to main branch, last one 13 days ago
134
532
apache-2.0
57
High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.
This repository has been archived (exclude archived)
Created 2018-05-18
788 commits to master branch, last one 5 years ago
129
519
apache-2.0
30
A Machine Learning System for Data Enrichment.
Created 2018-11-09
266 commits to master branch, last one 5 years ago
A rule engine written in Ruby.
Created 2012-07-13
428 commits to master branch, last one 8 months ago
Julia package for automated Bayesian inference on a factor graph with reactive message passing
Created 2022-06-10
1,434 commits to main branch, last one 2 days ago
校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
Created 2024-04-25
190 commits to main branch, last one about a month ago
21
241
unknown
7
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
Created 2024-10-02
114 commits to main branch, last one about a month ago
26
212
apache-2.0
12
docs for search system and ai infra
Created 2023-04-02
138 commits to master branch, last one 7 months ago
75
190
mit
25
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimize...
Created 2018-12-20
1,004 commits to develop branch, last one 3 days ago
This is a repository for an object detection inference API using the Tensorflow framework.
Created 2019-12-11
415 commits to master branch, last one 2 years ago
Context parallel attention that accelerates DiT model inference with dynamic caching
Created 2024-10-28
183 commits to main branch, last one a day ago
A quick view of high-performance convolution neural networks (CNNs) inference engines on mobile devices.
Created 2018-08-06
59 commits to master branch, last one 2 years ago
35
149
apache-2.0
13
Ai edge toolbox,专门面向边端设备尤其是嵌入式RTOS平台,AI模型部署工具链,包括模型推理引擎和模型压缩工具
Created 2023-01-30
6 commits to master branch, last one about a year ago
TinyTensor is a tool for running already trained NN (Neural Network) models to be able to use them for inference of various tasks such as image classification, semantic segmentation, etc.
Created 2023-03-02
157 commits to main branch, last one about a year ago
9
136
apache-2.0
3
A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).
Created 2024-05-11
40 commits to master branch, last one 7 months ago
28
135
bsd-3-clause
6
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Created 2024-03-21
11 commits to main branch, last one 25 days ago
11
129
apache-2.0
5
PyTorch library for cost-effective, fast and easy serving of MoE models.
Created 2024-01-22
19 commits to main branch, last one a day ago