8 results found Sort:
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Created
2024-12-07
100 commits to main branch, last one 17 hours ago
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Created
2023-06-17
19 commits to main branch, last one 9 months ago
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Created
2024-06-25
8 commits to main branch, last one 6 months ago
A stable and Fast telegram video convertor bot which can encode into different libs and resolution, compress videos, convert video into audio and other video formats, rename with thumbnail support, ge...
Created
2021-12-13
569 commits to public branch, last one about a year ago
Text and image to video generation: Kandinsky 4.0 (2024)
Created
2024-12-08
120 commits to main branch, last one 2 months ago
Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation
Created
2022-10-29
3 commits to main branch, last one 2 years ago
Extract, timestamp, and analyze specific content from video collections using LLM-powered audio/video processing.
Created
2025-01-10
18 commits to main branch, last one about a month ago
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
Created
2023-07-16
11 commits to main branch, last one about a year ago