4 results found Sort:

64
980
apache-2.0
14
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Created 2023-05-18
136 commits to main branch, last one about a month ago
17
245
mit
18
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Created 2023-05-29
13 commits to master branch, last one 8 months ago
8
152
unknown
10
Audio Large Language Models
Created 2024-06-15
53 commits to main branch, last one 7 days ago
9
81
apache-2.0
6
Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Created 2024-06-15
26 commits to main branch, last one 6 days ago