5 results found Sort:
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Created
2023-10-02
21 commits to main branch, last one 3 months ago
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...
Created
2025-01-24
7 commits to main branch, last one 20 days ago
Research Code for Multimodal-Cognition Team in Ant Group
Created
2023-08-21
142 commits to main branch, last one 8 months ago
[NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "
Created
2024-03-15
34 commits to main branch, last one 2 months ago
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Created
2023-11-23
7 commits to main branch, last one about a year ago