5 results found Sort:

52
864
apache-2.0
13
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Created 2023-10-02
21 commits to main branch, last one 3 months ago
59
810
apache-2.0
16
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...
Created 2025-01-24
7 commits to main branch, last one 20 days ago
Research Code for Multimodal-Cognition Team in Ant Group
Created 2023-08-21
142 commits to main branch, last one 8 months ago
6
84
apache-2.0
4
[NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "
Created 2024-03-15
34 commits to main branch, last one 2 months ago
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Created 2023-11-23
7 commits to main branch, last one about a year ago