3 results found Sort:

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Created 2023-11-17
356 commits to main branch, last one 3 days ago
11
189
unknown
7
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Created 2024-05-26
33 commits to main branch, last one 5 days ago
Create a Movie animation plus Audio plus Subtitle from a text file
Created 2023-03-22
9 commits to main branch, last one 2 years ago