13 results found Sort:
- Filter by Primary Language:
- Python (11)
- Swift (1)
- +
Lumina-T2X is a unified framework for Text to Any Modality Generation
Created
2024-03-28
365 commits to main branch, last one 4 months ago
[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Created
2024-10-24
13 commits to main branch, last one 6 days ago
OpenMusic: SOTA Text-to-music (TTM) Generation
Created
2024-05-24
113 commits to main branch, last one 8 days ago
Implementation of F5-TTS in MLX
Created
2024-10-13
71 commits to main branch, last one 4 days ago
Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)
Created
2024-11-05
61 commits to main branch, last one 2 days ago
🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"
Created
2023-07-06
26 commits to main branch, last one 7 months ago
[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy
Created
2023-03-22
17 commits to main branch, last one 8 days ago
Adaptive Caching for Faster Video Generation with Diffusion Transformers
Created
2024-10-31
8 commits to main branch, last one about a month ago
The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"
Created
2023-07-20
34 commits to master branch, last one 5 months ago
ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
Created
2024-10-10
2 commits to main branch, last one 2 months ago
Implementation of F5-TTS in Swift using MLX
Created
2024-10-19
16 commits to main branch, last one 6 days ago
Implementation of Diffusion Transformer Model in Pytorch
Created
2023-12-14
22 commits to main branch, last one 12 days ago
FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
Created
2024-06-26
26 commits to main branch, last one 5 months ago