6 results found Sort:
a state-of-the-art-level open visual language model | 多模态预训练模型
Created
2023-09-18
184 commits to main branch, last one 6 months ago
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/
Created
2024-05-11
54 commits to main branch, last one about a month ago
Commanding robots using only Language Models' prompts
Created
2023-08-20
59 commits to main branch, last one 4 months ago
https://arxiv.org/abs/2312.10807
Created
2024-01-03
22 commits to main branch, last one 20 days ago
Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"
Created
2024-05-23
24 commits to master branch, last one 5 months ago
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
Created
2024-06-06
82 commits to main branch, last one 3 days ago