4 results found Sort:
- Filter by Primary Language:
- Python (2)
- Jupyter Notebook (1)
- +
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Created
2023-05-18
136 commits to main branch, last one about a month ago
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Created
2023-05-29
13 commits to master branch, last one 8 months ago
Audio Large Language Models
Created
2024-06-15
53 commits to main branch, last one 7 days ago
Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Created
2024-06-15
26 commits to main branch, last one 6 days ago