7 results found Sort:

426
6.3k
apache-2.0
69
a state-of-the-art-level open visual language model | 多模态预训练模型
Created 2023-09-18
184 commits to main branch, last one 8 months ago
29
210
unknown
1
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/
Created 2024-05-11
54 commits to main branch, last one 3 months ago
Commanding robots using only Language Models' prompts
Created 2023-08-20
59 commits to main branch, last one 5 months ago
Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"
Created 2024-05-23
24 commits to master branch, last one 6 months ago
2
30
cc-by-sa-4.0
1
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
Created 2024-06-06
86 commits to main branch, last one 15 days ago
Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖
Created 2024-06-05
44 commits to main branch, last one 7 months ago