7 results found Sort:
a state-of-the-art-level open visual language model | 多模态预训练模型
Created
2023-09-18
184 commits to main branch, last one 8 months ago
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/
Created
2024-05-11
54 commits to main branch, last one 3 months ago
Commanding robots using only Language Models' prompts
Created
2023-08-20
59 commits to main branch, last one 5 months ago
https://arxiv.org/abs/2312.10807
Created
2024-01-03
22 commits to main branch, last one 2 months ago
Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"
Created
2024-05-23
24 commits to master branch, last one 6 months ago
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
Created
2024-06-06
86 commits to main branch, last one 15 days ago
Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖
Created
2024-06-05
44 commits to main branch, last one 7 months ago