Trending repositories for topic diffusion-models

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

219 (+202)

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

1,285 (+121)

apache-2.0

Tencent/Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

8,384 (+59)

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

3,580 (+54)

apache-2.0

Tencent/HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

9,615 (+42)

FoundationVision/VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultr...

7,428 (+41)

mit

TingsongYu/PyTorch-Tutorial-2nd

《Pytorch实用教程》（第二版）无论是零基础入门，还是CV、NLP、LLM项目应用，或是进阶工程化部署落地，在这里都有。相信在本书的帮助下，读者将能够轻松掌握 PyTorch 的使用，成为一名优秀的深度学习工程师。

3,452 (+35)

Tencent/HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

1,295 (+17)

cure-lab/MagicDrive

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

934 (+14)

apache-2.0

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

5,634 (+14)

mit

openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

8,107 (+14)

apache-2.0

diff-usion/Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

11,634 (+13)

mit

showlab/Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

1,338 (+12)

apache-2.0

showlab/Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,276 (+12)

ZhengYinan-AIR/Diffusion-Planner

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

363 (+11)

ThuCCSLab/Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,346 (+10)

apache-2.0

wangkai930418/awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

1,648 (+10)

flymin/MagicDriveDiT

Official implementation of the paper “MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control”

461 (+10)

agpl-3.0

baaivision/NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

466 (+8)

apache-2.0

xie-lab-ml/awesome-alignment-of-diffusion-models

The collection of awesome papers on alignment of diffusion models.

172 (+7)

mit

Last 3 days (relative gain)

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

219 (+1,188%)

harlanhong/ACTalker

ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).

135 (+14%)

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

1,285 (+10%)

apache-2.0

Guaishou74851/AdcSR

(CVPR 2025) Adversarial Diffusion Compression for Real-World Image Super-Resolution [PyTorch]

65 (+10%)

apache-2.0

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

83 (+8%)

Westlake-AGI-Lab/Awesome-Style-Transfer-with-Diffusion-Models

A curated list of recent style transfer methods with diffusion models

61 (+7%)

xie-lab-ml/awesome-alignment-of-diffusion-models

The collection of awesome papers on alignment of diffusion models.

172 (+4%)

mit

FilippoMB/Diffusion_models_tutorial

Collection of tutorials on diffusion models, step-by-step implementation guide, scripts for generating images with AI, prompt engineering guide, and resources for further learning.

113 (+4%)

mit

zelaki/eqvae

EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.

90 (+3%)

Guaishou74851/IDM

(TPAMI 2025) Invertible Diffusion Models for Compressed Sensing [PyTorch]

124 (+3%)

ZhengYinan-AIR/Diffusion-Planner

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

363 (+3%)

JyChen9811/FaithDiff

[CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival, Social Media Restoration, Image Enhancement and AIGC Enhancement.

92 (+2%)

flymin/MagicDriveDiT

Official implementation of the paper “MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control”

461 (+2%)

agpl-3.0

xandergos/sCM-mnist

Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST

47 (+2%)

mit

lxa9867/ImageFolder

High-performance Image Tokenizers for VAR and AR

240 (+2%)

baaivision/NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

466 (+2%)

apache-2.0

chairc/Integrated-Design-Diffusion-Model

IDDM (Industrial, landscape, animate, spectrogram...), support DDPM, DDIM, PLMS, webui and distributed training. Pytorch实现扩散模型，生成模型，分布式训练

199 (+2%)

apache-2.0

chunyu-li/ddpm

扩散模型的简易 PyTorch 实现

69 (+1%)

Kobaayyy/Awesome-AIGC-Research-Groups

A Collection of AIGC Research Groups

73 (+1%)

CIntellifusion/VideoDPO

Official Implementation of VideoDPO

82 (+1%)

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

1,285 (+208)

apache-2.0

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

219 (+202)

Tencent/Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

8,384 (+160)

harlanhong/ACTalker

ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).

135 (+118)

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

3,580 (+101)

apache-2.0

FoundationVision/VAR

7,428 (+100)

mit

Tencent/HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

9,615 (+75)

TingsongYu/PyTorch-Tutorial-2nd

3,452 (+59)

Tencent/HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

1,295 (+42)

Lightricks/LTX-Video

Official repository for LTX-Video

3,295 (+37)

apache-2.0

openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

8,107 (+36)

apache-2.0

ali-vilab/TeaCache

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

659 (+33)

apache-2.0

YanWenKun/Hunyuan3D-2-WinPortable

📦A portable package for running Hunyuan3D-2 on Windows. | 混元 3D 2.0 整合包

438 (+31)

gpl-3.0

showlab/Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,276 (+29)

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

5,634 (+29)

mit

diff-usion/Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

11,634 (+25)

mit

QuanjianSong/LightMotion

Official Implementation Code of Our Paper "LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation"

38 (+22)

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

1,684 (+20)

mit

ZhengYinan-AIR/Diffusion-Planner

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

363 (+19)

ThuCCSLab/Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,346 (+19)

apache-2.0

Last week (relative gain)

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

219 (+1,188%)

harlanhong/ACTalker

ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).

135 (+694%)

QuanjianSong/LightMotion

Official Implementation Code of Our Paper "LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation"

38 (+138%)

Guaishou74851/AdcSR

(CVPR 2025) Adversarial Diffusion Compression for Real-World Image Super-Resolution [PyTorch]

65 (+30%)

apache-2.0

JyChen9811/FaithDiff

[CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival, Social Media Restoration, Image Enhancement and AIGC Enhancement.

92 (+19%)

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

1,285 (+19%)

apache-2.0

Westlake-AGI-Lab/Awesome-Style-Transfer-with-Diffusion-Models

A curated list of recent style transfer methods with diffusion models

61 (+17%)

Guaishou74851/IDM

(TPAMI 2025) Invertible Diffusion Models for Compressed Sensing [PyTorch]

124 (+11%)

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

83 (+9%)

YanWenKun/Hunyuan3D-2-WinPortable

📦A portable package for running Hunyuan3D-2 on Windows. | 混元 3D 2.0 整合包

438 (+8%)

gpl-3.0

xie-lab-ml/awesome-alignment-of-diffusion-models

The collection of awesome papers on alignment of diffusion models.

172 (+8%)

mit

xandergos/sCM-mnist

Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST

47 (+7%)

mit

ZhengYinan-AIR/Diffusion-Planner

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

363 (+6%)

ali-vilab/TeaCache

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

659 (+5%)

apache-2.0

CIntellifusion/VideoDPO

Official Implementation of VideoDPO

82 (+5%)

zelaki/eqvae

EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.

90 (+5%)

FilippoMB/Diffusion_models_tutorial

Collection of tutorials on diffusion models, step-by-step implementation guide, scripts for generating images with AI, prompt engineering guide, and resources for further learning.

113 (+5%)

mit

Kobaayyy/Awesome-AIGC-Research-Groups

A Collection of AIGC Research Groups

73 (+4%)

0606zt/TwinDiffusion

[ECAI 2024] Official code for "TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models".

28 (+4%)

baaivision/NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

466 (+3%)

apache-2.0

Last month (new repositories)

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

219

TQTQliu/Free4D

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

138

harlanhong/ACTalker

ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).

135

yandex-research/swd

Scale-wise Distillation of Diffusion Models

apache-2.0

Last month (absolute gain)

Tencent/Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

8,384 (+1,554)

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

3,580 (+667)

apache-2.0

FoundationVision/VAR

7,428 (+518)

mit

Tencent/HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

9,615 (+414)

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

1,285 (+350)

apache-2.0

Tencent/HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

1,295 (+299)

ali-vilab/TeaCache

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

659 (+174)

apache-2.0

Lightricks/LTX-Video

Official repository for LTX-Video

3,295 (+170)

apache-2.0

TingsongYu/PyTorch-Tutorial-2nd

3,452 (+166)

showlab/Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,276 (+149)

openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

8,107 (+147)

apache-2.0

harlanhong/ACTalker

ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).

135 (+134)

YanWenKun/Hunyuan3D-2-WinPortable

📦A portable package for running Hunyuan3D-2 on Windows. | 混元 3D 2.0 整合包

438 (+131)

gpl-3.0

diff-usion/Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

11,634 (+120)

mit

TQTQliu/Free4D

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

138 (+107)

hymie122/RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,586 (+102)

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

5,634 (+97)

mit

ThuCCSLab/Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,346 (+95)

apache-2.0

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

958 (+88)

apache-2.0

yandex-research/swd

Scale-wise Distillation of Diffusion Models

86 (+85)

apache-2.0

Last month (relative gain)

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

219 (+1,188%)

soraproducer/Awesome-Human-Interaction-Motion-Generation

No description

47 (+1,075%)

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

83 (+453%)

TQTQliu/Free4D

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

138 (+345%)

Thinklab-SJTU/ML4TSPBench

Official implementation of ICLR 2025 paper: "Unify ML4TSP: Drawing Methodological Principles for TSP and Beyond from Streamlined Design Space of Learning and Search".

32 (+256%)

Guaishou74851/AdcSR

(CVPR 2025) Adversarial Diffusion Compression for Real-World Image Super-Resolution [PyTorch]

65 (+210%)

apache-2.0

JyChen9811/FaithDiff

[CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival, Social Media Restoration, Image Enhancement and AIGC Enhancement.

92 (+114%)

devzhk/InverseBench

InverseBench (ICLR 2025 spotlight)

25 (+108%)

mit

nktoan/h-edit

[CVPR 2025] h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform

39 (+86%)

apache-2.0

Westlake-AGI-Lab/Awesome-Style-Transfer-with-Diffusion-Models

A curated list of recent style transfer methods with diffusion models

61 (+74%)

madaror/tiled-diffusion

Official implementation of "Tiled Diffusion" [CVPR 2025]

27 (+69%)

xinli2008/diffusion_from_scratch

A PyTorch implementation of diffusion models built from scratch

38 (+65%)

masa-ue/AlignInversePro

Inference-Time Alignment in Protein Diffusion Models

28 (+47%)

YanWenKun/Hunyuan3D-2-WinPortable

📦A portable package for running Hunyuan3D-2 on Windows. | 混元 3D 2.0 整合包

438 (+43%)

gpl-3.0

knightyxp/VideoGrain

[ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing"

116 (+40%)

mit

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

1,285 (+37%)

apache-2.0

Guaishou74851/IDM

(TPAMI 2025) Invertible Diffusion Models for Compressed Sensing [PyTorch]

124 (+36%)

ali-vilab/TeaCache

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

659 (+36%)

apache-2.0

Westlake-AGI-Lab/StyleStudio

[CVPR 2025] Official implementation of StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

85 (+35%)

xie-lab-ml/Zigzag-Diffusion-Sampling

[ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection".

64 (+33%)

apache-2.0

Last 12-months (new repositories)

Tencent/HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

9,615

Tencent/Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

8,384

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

3,580

apache-2.0

Lightricks/LTX-Video

Official repository for LTX-Video

3,295

apache-2.0

jy0205/Pyramid-Flow

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

2,890

mit

Tencent/MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

2,308

eloialonso/diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

1,784

mit

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

1,684

mit

menyifang/MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

1,478

apache-2.0

showlab/Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

1,338

apache-2.0

Zheng-Chong/CatVTON

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and...

1,298

Tencent/HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

1,295

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

1,285

apache-2.0

muzishen/IMAGDressing

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high f...

1,213

apache-2.0

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

1,026

apache-2.0

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

958

apache-2.0

3DTopia/3DTopia-XL

[CVPR 2025] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

945

UCSC-VLAA/story-adapter

A Training-free Iterative Framework for Long Story Visualization

868

mit

MyNiuuu/MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

736

zubair-irshad/Awesome-Robotics-3D

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

677

Last 12-months (absolute gain)

Tencent/HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

9,615 (+8,158)

Tencent/Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

8,384 (+7,690)

FoundationVision/VAR

7,428 (+5,647)

mit

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

3,580 (+3,579)

apache-2.0

Lightricks/LTX-Video

Official repository for LTX-Video

3,295 (+3,150)

apache-2.0

TingsongYu/PyTorch-Tutorial-2nd

3,452 (+3,139)

jy0205/Pyramid-Flow

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

2,890 (+2,445)

mit

openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

8,107 (+2,292)

apache-2.0

Tencent/MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

2,308 (+2,235)

Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

2,180 (+2,178)

mit

bghira/SimpleTuner

A general fine-tuning kit geared toward diffusion models.

2,199 (+2,047)

agpl-3.0

showlab/Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,276 (+1,923)

Fanghua-Yu/SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

4,989 (+1,810)

eloialonso/diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

1,784 (+1,783)

mit

diff-usion/Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

11,634 (+1,705)

mit

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

1,684 (+1,683)

mit

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

5,634 (+1,670)

mit

menyifang/MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

1,478 (+1,477)

apache-2.0

showlab/Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

1,338 (+1,337)

apache-2.0

Zheng-Chong/CatVTON

1,298 (+1,295)

Last 12-months (relative gain)

ali-vilab/TeaCache

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

659 (+16,375%)

apache-2.0

ohayonguy/PMRF

[ICLR 2025] Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration

623 (+15,475%)

mit

zubair-irshad/Awesome-Robotics-3D

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

677 (+9,571%)

stevenlsw/physgen

PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)

291 (+7,175%)

MyNiuuu/MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

736 (+6,591%)

haidog-yaqub/EzAudio

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

261 (+6,425%)

mit

YanWenKun/Hunyuan3D-2-WinPortable

📦A portable package for running Hunyuan3D-2 on Windows. | 混元 3D 2.0 整合包

438 (+6,157%)

gpl-3.0

EDAPINENUT/CBGBench

Official code repository of < CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph >

277 (+5,440%)

gpl-3.0

xuyang-liu16/Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

191 (+4,675%)

KupynOrest/head_detector

Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..

176 (+4,300%)

mit

WHU-USI3DV/VistaDream

[Single/Sparse View-to-Scene on a 4090(24G)] VistaDream: Sampling multiview consistent images for single-view scene reconstruction

426 (+3,450%)

mit

basilevh/gcd

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation

241 (+3,343%)

gpl-3.0

iamNCJ/DiLightNet

Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

168 (+3,260%)

mit

Tencent/MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

2,308 (+3,062%)

open-mmlab/Live2Diff

Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.

183 (+2,950%)

apache-2.0

huanngzh/Parts2Whole

[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

182 (+2,500%)

mit

ZichengDuan/EZIGen

Official code base for paper EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance

103 (+2,475%)

apache-2.0

flymin/MagicDrive3D

Official implementation of the paper “MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes”

283 (+2,473%)

agpl-3.0

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

1,285 (+2,470%)

apache-2.0

horseee/learning-to-cache

[NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching

100 (+2,400%)