18 results found Sort:
- Filter by Primary Language:
- Python (9)
- C++ (5)
- Jupyter Notebook (3)
- Rust (1)
- +
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Created
2023-10-19
121 commits to main branch, last one 7 hours ago
Add bisenetv2. My implementation of BiSeNet
Created
2018-11-29
61 commits to master branch, last one about a month ago
This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server
Created
2020-08-10
100 commits to master branch, last one 2 years ago
OpenAI compatible API for TensorRT LLM triton backend
Created
2023-11-06
33 commits to main branch, last one 3 months ago
【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架,支持dynamic batching、streaming模式,支持python/c++双语言,可限制,可拓展,高性能。帮助用户快速地将模型部署到线上,并通过http/rpc接口方式提供服务。
Created
2024-07-04
56 commits to master branch, last one 9 days ago
Serving Inside Pytorch
Created
2023-10-24
247 commits to main branch, last one 12 days ago
ClearML - Model-Serving Orchestration and Repository Solution
Created
2021-04-12
140 commits to main branch, last one 4 months ago
The Triton backend for the ONNX Runtime.
Created
2020-08-26
130 commits to main branch, last one 5 days ago
Deploy stable diffusion model with onnx/tenorrt + tritonserver
Created
2022-08-31
14 commits to master branch, last one about a year ago
NVIDIA-accelerated DNN model inference ROS 2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU
Created
2021-10-13
40 commits to main branch, last one about a month ago
Deploy DL/ ML inference pipelines with minimal extra code.
Created
2020-04-09
493 commits to master branch, last one 3 days ago
Анализ трафика на круговом движении с использованием компьютерного зрения
Created
2024-02-16
76 commits to main branch, last one 20 days ago
Compare multiple optimization methods on triton to imporve model service performance
Created
2022-10-30
8 commits to main branch, last one 10 months ago
Set up CI in DL/ cuda/ cudnn/ TensorRT/ onnx2trt/ onnxruntime/ onnxsim/ Pytorch/ Triton-Inference-Server/ Bazel/ Tesseract/ PaddleOCR/ NVIDIA-docker/ minIO/ Supervisord on AGX or PC from scratch.
Created
2020-02-27
79 commits to master branch, last one about a year ago
Tiny configuration for Triton Inference Server
Created
2022-05-26
55 commits to main branch, last one 3 months ago
Build Recommender System with PyTorch + Redis + Elasticsearch + Feast + Triton + Flask. Vector Recall, DeepFM Ranking and Web Application.
Created
2023-08-05
7 commits to master branch, last one about a year ago
Diffusion Model for Voice Conversion
Created
2023-03-10
53 commits to main branch, last one 7 months ago
Provides an ensemble model to deploy a YoloV8 ONNX model to Triton
Created
2023-03-08
6 commits to main branch, last one about a year ago