5 results found Sort:

235
2.6k
bsd-3-clause
31
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Created 2023-05-06
145 commits to main branch, last one 25 days ago
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利
Created 2023-04-07
65 commits to main branch, last one 10 months ago
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
Created 2023-07-15
15 commits to master branch, last one 10 months ago
99
235
apache-2.0
21
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
Created 2023-07-05
675 commits to develop branch, last one 15 days ago
3
56
unknown
10
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
Created 2023-08-24
67 commits to main branch, last one about a month ago