3 results found Sort:

186
521
apache-2.0
23
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
Created 2023-07-05
1,103 commits to develop branch, last one 22 hours ago
Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, d...
Created 2024-08-21
144 commits to master branch, last one 21 hours ago
14
85
bsd-3-clause
5
Explore LLM model deployment based on AXera's AI chips
Created 2024-03-14
66 commits to prefill branch, last one 26 days ago