4 results found Sort:
- Filter by Primary Language:
- Python (2)
- C++ (1)
- Markdown (1)
- +
Famous Vision Language Models and Their Architectures
Created
2024-02-15
231 commits to main branch, last one about a month ago
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
Created
2023-07-05
905 commits to develop branch, last one 23 hours ago
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, qwen-vl, qwen2-vl, phi3-v etc.
Created
2024-07-20
95 commits to main branch, last one 15 days ago
【grps接入trtllm】通过接入TensorRT-LLM以及Tokenizers.cpp实现纯c++版本高性能LLM服务,兼容OpenAI接口协议,支持chat和function call模式,支持ai agent,支持多卡推理,支持多模态,支持gradio聊天界面。
Created
2024-08-21
101 commits to master branch, last one 4 days ago