19 results found Sort:

717
3.0k
apache-2.0
68
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Created 2023-10-19
147 commits to main branch, last one 5 days ago
322
1.5k
mit
16
Add bisenetv2. My implementation of BiSeNet
Created 2018-11-29
63 commits to master branch, last one 4 months ago
This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server
Created 2020-08-10
100 commits to master branch, last one 3 years ago
OpenAI compatible API for TensorRT LLM triton backend
Created 2023-11-06
33 commits to main branch, last one 8 months ago
13
160
apache-2.0
5
Serving Inside Pytorch
Created 2023-10-24
348 commits to v0 branch, last one 2 months ago
13
157
apache-2.0
9
Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming modes. It is dual-language compatible with Python and C++, offeri...
Created 2024-07-04
62 commits to master branch, last one 28 days ago
42
149
apache-2.0
10
ClearML - Model-Serving Orchestration and Repository Solution
Created 2021-04-12
143 commits to main branch, last one 3 months ago
The Triton backend for the ONNX Runtime.
Created 2020-08-26
139 commits to main branch, last one 20 days ago
Deploy stable diffusion model with onnx/tenorrt + tritonserver
Created 2022-08-31
14 commits to master branch, last one 2 years ago
NVIDIA-accelerated DNN model inference ROS 2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU
Created 2021-10-13
46 commits to main branch, last one about a month ago
Анализ трафика на круговом движении с использованием компьютерного зрения
Created 2024-02-16
127 commits to feature/influx branch, last one about a month ago
Diffusion Model for Voice Conversion
Created 2023-03-10
53 commits to main branch, last one about a year ago
Compare multiple optimization methods on triton to imporve model service performance
Created 2022-10-30
8 commits to main branch, last one about a year ago
Build Recommender System with PyTorch + Redis + Elasticsearch + Feast + Triton + Flask. Vector Recall, DeepFM Ranking and Web Application.
Created 2023-08-05
7 commits to master branch, last one about a year ago
1
45
bsd-3-clause
1
Tiny configuration for Triton Inference Server
Created 2022-05-26
59 commits to main branch, last one 3 months ago
Set up CI in DL/ cuda/ cudnn/ TensorRT/ onnx2trt/ onnxruntime/ onnxsim/ Pytorch/ Triton-Inference-Server/ Bazel/ Tesseract/ PaddleOCR/ NVIDIA-docker/ minIO/ Supervisord on AGX or PC from scratch.
Created 2020-02-27
79 commits to master branch, last one about a year ago
Provides an ensemble model to deploy a YoloV8 ONNX model to Triton
Created 2023-03-08
6 commits to main branch, last one about a year ago
🧠🛡️ Web service for detecting network attacks in PCAP using ML.
Created 2024-10-22
35 commits to master branch, last one 2 months ago