3 results found Sort:

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Created 2023-11-17
338 commits to main branch, last one a day ago
5
99
unknown
6
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Created 2024-05-26
28 commits to main branch, last one about a month ago
Create a Movie animation plus Audio plus Subtitle from a text file
Created 2023-03-22
9 commits to main branch, last one about a year ago