3 results found Sort:
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
Created
2023-07-05
1,103 commits to develop branch, last one 22 hours ago
Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, d...
Created
2024-08-21
144 commits to master branch, last one 21 hours ago
Explore LLM model deployment based on AXera's AI chips
Created
2024-03-14
66 commits to prefill branch, last one 26 days ago