6 results found Sort:

701
11.0k
apache-2.0
57
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Created 2023-04-19
1,881 commits to main branch, last one a day ago
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
Created 2023-01-19
125 commits to main branch, last one 2 months ago
CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search
Created 2023-04-18
26 commits to main branch, last one about a year ago
2
38
apache-2.0
1
Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support
Created 2025-02-03
239 commits to main branch, last one 10 hours ago
EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU
Created 2024-06-17
41 commits to main branch, last one 5 months ago
Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.
Created 2024-02-16
42 commits to main branch, last one 11 months ago