3 results found Sort:

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Created 2023-11-17
346 commits to main branch, last one 6 days ago
5
104
unknown
8
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Created 2024-05-26
28 commits to main branch, last one about a month ago
Create a Movie animation plus Audio plus Subtitle from a text file
Created 2023-03-22
9 commits to main branch, last one about a year ago