Trending repositories for topic diffusion

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

bytedance/UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

679 (+197)

apache-2.0

AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

151,228 (+193)

agpl-3.0

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376 (+96)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931 (+53)

apache-2.0

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

18,112 (+28)

apache-2.0

pollinations/pollinations

Free Open-Source Image and Text Generation

1,601 (+27)

mit

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

28,567 (+23)

apache-2.0

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

1,044 (+17)

apache-2.0

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

3,979 (+15)

apache-2.0

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,941 (+15)

datawhalechina/tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

2,697 (+9)

ChenHsing/Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,064 (+7)

easydiffusion/easydiffusion

An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and ...

9,889 (+6)

riffusion/riffusion-hobby

Stable diffusion for real-time music generation

3,641 (+5)

mit

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

3,931 (+4)

mit

yuanchenyang/smalldiffusion

Simple and readable code for training and sampling from diffusion models

473 (+4)

mit

ChocoWu/Any2Caption

This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation

30 (+3)

Lakonik/GMFlow

Gaussian Mixture Flow Matching Models (GMFlow)

70 (+3)

mit

bansky-cl/diffusion-nlp-paper-arxiv

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".

110 (+2)

mihirp1998/VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various rewa...

265 (+2)

Last 3 days (relative gain)

bytedance/UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

679 (+41%)

apache-2.0

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376 (+34%)

ChocoWu/Any2Caption

This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation

30 (+11%)

Lakonik/GMFlow

Gaussian Mixture Flow Matching Models (GMFlow)

70 (+4%)

mit

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931 (+3%)

apache-2.0

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

241 (+3%)

mit

bansky-cl/diffusion-nlp-paper-arxiv

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".

110 (+2%)

pollinations/pollinations

Free Open-Source Image and Text Generation

1,601 (+2%)

mit

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

1,044 (+2%)

apache-2.0

yuanchenyang/smalldiffusion

Simple and readable code for training and sampling from diffusion models

473 (+0.9%)

mit

LeCAR-Lab/dial-mpc

Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged rob...

640 (+0.8%)

apache-2.0

mihirp1998/VADER

265 (+0.8%)

csslc/PiSA-SR

[CVPR 2025] Official code repository for "Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach"

140 (+0.7%)

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

671 (+0.6%)

mpl-2.0

sanweiliti/RoHM

The official PyTorch code for RoHM: Robust Human Motion Reconstruction via Diffusion.

360 (+0.6%)

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

3,979 (+0.4%)

apache-2.0

ChenHsing/Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,064 (+0.3%)

datawhalechina/tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

2,697 (+0.3%)

prs-eth/RollingDepth

[CVPR 2025] Video Depth without Video Models

480 (+0.2%)

apache-2.0

PKU-YuanGroup/ConsisID

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

665 (+0.2%)

apache-2.0

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

bytedance/UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

679 (+649)

apache-2.0

AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

151,228 (+394)

agpl-3.0

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376 (+359)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931 (+200)

apache-2.0

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

28,567 (+85)

apache-2.0

pollinations/pollinations

Free Open-Source Image and Text Generation

1,601 (+76)

mit

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

18,112 (+67)

apache-2.0

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

3,979 (+52)

apache-2.0

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,941 (+39)

Lakonik/GMFlow

Gaussian Mixture Flow Matching Models (GMFlow)

70 (+37)

mit

datawhalechina/tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

2,697 (+34)

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

3,931 (+25)

mit

leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

4,011 (+23)

mit

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

1,696 (+20)

mit

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

1,044 (+20)

apache-2.0

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

503 (+17)

riffusion/riffusion-hobby

Stable diffusion for real-time music generation

3,641 (+14)

mit

cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

7,306 (+13)

apache-2.0

easydiffusion/easydiffusion

9,889 (+12)

NVIDIA/Cosmos-Tokenizer

A suite of image and video neural tokenizers

1,610 (+12)

apache-2.0

Last week (relative gain)

bytedance/UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

679 (+2,163%)

apache-2.0

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376 (+2,112%)

Lakonik/GMFlow

Gaussian Mixture Flow Matching Models (GMFlow)

70 (+112%)

mit

ChocoWu/Any2Caption

This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation

30 (+25%)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931 (+12%)

apache-2.0

csslc/PiSA-SR

[CVPR 2025] Official code repository for "Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach"

140 (+5%)

pollinations/pollinations

Free Open-Source Image and Text Generation

1,601 (+5%)

mit

A-suozhang/Awesome-Efficient-Diffusion

Curated list of methods that focuses on improving the efficiency of diffusion models

44 (+5%)

THUDM/VisionReward

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

216 (+4%)

apache-2.0

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

241 (+4%)

mit

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

503 (+3%)

JyChen9811/FaithDiff

[CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival, Social Media Restoration, Image Enhancement and AIGC Enhancement.

93 (+3%)

Haochen-Wang409/ross

[ICLR'25] Reconstructive Visual Instruction Tuning

78 (+3%)

apache-2.0

xandergos/sCM-mnist

Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST

47 (+2%)

mit

mbodiai/embodied-agents

Seamlessly integrate state-of-the-art transformer models into robotics stacks

200 (+2%)

apache-2.0

gpustack/llama-box

LM inference server implementation based on *.cpp.

165 (+2%)

mit

bansky-cl/diffusion-nlp-paper-arxiv

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".

110 (+2%)

mihirp1998/VADER

265 (+2%)

ximinng/SVGDreamer

[CVPR 2024] Official implementation for "SVGDreamer: Text Guided SVG Generation with Diffusion Model" https://arxiv.org/abs/2312.16476

350 (+1%)

mit

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

3,979 (+1%)

apache-2.0

Last month (new repositories)

bytedance/UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

679

apache-2.0

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376

Lakonik/GMFlow

Gaussian Mixture Flow Matching Models (GMFlow)

mit

Last month (absolute gain)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931 (+1,924)

apache-2.0

AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

151,228 (+1,726)

agpl-3.0

bytedance/UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

679 (+678)

apache-2.0

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

28,567 (+510)

apache-2.0

pollinations/pollinations

Free Open-Source Image and Text Generation

1,601 (+403)

mit

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376 (+359)

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

18,112 (+344)

apache-2.0

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

3,979 (+312)

apache-2.0

datawhalechina/tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

2,697 (+225)

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,941 (+210)

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

3,931 (+170)

mit

leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

4,011 (+95)

mit

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

1,044 (+87)

apache-2.0

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

1,696 (+79)

mit

wangkai930418/awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

1,654 (+72)

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

503 (+69)

cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

7,306 (+57)

apache-2.0

riffusion/riffusion-hobby

Stable diffusion for real-time music generation

3,641 (+55)

mit

XianfengWu01/LightGen

An Efficient Text-to-Image Generation Pretrain Pipeline

99 (+53)

mit

easydiffusion/easydiffusion

9,889 (+53)

Last month (relative gain)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931 (+27,486%)

apache-2.0

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376 (+2,112%)

ChocoWu/Any2Caption

This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation

30 (+329%)

XianfengWu01/LightGen

An Efficient Text-to-Image Generation Pretrain Pipeline

99 (+115%)

mit

Lakonik/GMFlow

Gaussian Mixture Flow Matching Models (GMFlow)

70 (+112%)

mit

JyChen9811/FaithDiff

[CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival, Social Media Restoration, Image Enhancement and AIGC Enhancement.

93 (+98%)

csslc/PiSA-SR

[CVPR 2025] Official code repository for "Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach"

140 (+39%)

pollinations/pollinations

Free Open-Source Image and Text Generation

1,601 (+34%)

mit

xandergos/sCM-mnist

Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST

47 (+27%)

mit

THUDM/VisionReward

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

216 (+21%)

apache-2.0

gpustack/llama-box

LM inference server implementation based on *.cpp.

165 (+20%)

mit

blepping/comfyui_jankdiffusehigh

Janky implementation of DiffuseHigh for ComfyUI

32 (+19%)

apache-2.0

intuitive-robots/mdt_policy

[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights

129 (+16%)

mit

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

503 (+16%)

A-suozhang/Awesome-Efficient-Diffusion

Curated list of methods that focuses on improving the efficiency of diffusion models

44 (+13%)

bansky-cl/diffusion-nlp-paper-arxiv

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".

110 (+12%)

HKUNLP/diffusion-vs-ar

[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"

47 (+12%)

apache-2.0

blepping/ComfyUI-bleh

ComfyUI nodes collection: better TAESD previews (including batch previews), improved HyperTile and Deep Shrink nodes

90 (+10%)

apache-2.0

william-murray1204/stable-diffusion-cpp-python

stable-diffusion.cpp bindings for python

47 (+9%)

mit

OliBomby/Mapperatorinator

A multi-model framework that generates fully featured osu! beatmaps for all gamemodes from spectrogram inputs.

119 (+9%)

mit

Last 12-months (new repositories)

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

3,979

apache-2.0

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

3,931

mit

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931

apache-2.0

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

1,696

mit

NVIDIA/Cosmos-Tokenizer

A suite of image and video neural tokenizers

1,610

apache-2.0

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

1,044

apache-2.0

bytedance/UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

679

apache-2.0

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

671

mpl-2.0

PKU-YuanGroup/ConsisID

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

665

apache-2.0

LeCAR-Lab/dial-mpc

640

apache-2.0

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

503

prs-eth/RollingDepth

[CVPR 2025] Video Depth without Video Models

480

apache-2.0

VisualComputingInstitute/diffusion-e2e-ft

[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

412

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376

zibojia/COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

287

baaivision/DIVA

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

273

mit

mihirp1998/VADER

265

aredden/flux-fp8-api

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

259

apache-2.0

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

241

mit

LeCAR-Lab/model-based-diffusion

Official implementation for the paper "Model-based Diffusion for Trajectory Optimization". Model-based diffusion (MBD) is a novel diffusion-based trajectory optimization framework that employs a dynam...

232

apache-2.0

Last 12-months (absolute gain)

AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

151,228 (+22,605)

agpl-3.0

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

28,567 (+6,227)

apache-2.0

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,941 (+5,420)

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

18,112 (+4,480)

apache-2.0

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

3,931 (+3,930)

mit

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

3,979 (+3,920)

apache-2.0

datawhalechina/tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

2,697 (+2,692)

Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

2,181 (+2,178)

mit

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931 (+1,924)

apache-2.0

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

1,696 (+1,695)

mit

NVIDIA/Cosmos-Tokenizer

A suite of image and video neural tokenizers

1,610 (+1,608)

apache-2.0

leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

4,011 (+1,487)

mit

pollinations/pollinations

Free Open-Source Image and Text Generation

1,601 (+1,409)

mit

TMElyralab/MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

2,678 (+1,397)

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

1,044 (+1,043)

apache-2.0

prs-eth/Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

2,641 (+1,038)

apache-2.0

wangkai930418/awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

1,654 (+904)

easydiffusion/easydiffusion

9,889 (+829)

ChenHsing/Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,064 (+829)

rupeshs/fastsdcpu

Fast stable diffusion on CPU

1,654 (+796)

mit

Last 12-months (relative gain)

datawhalechina/tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

2,697 (+53,840%)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,931 (+27,486%)

apache-2.0

LeCAR-Lab/dial-mpc

640 (+10,567%)

apache-2.0

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

3,979 (+6,644%)

apache-2.0

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

671 (+4,373%)

mpl-2.0

xlite-dev/Awesome-Diffusion-Inference

📖A curated list of Awesome Diffusion Inference Papers with codes: Sampling, Caching, Multi-GPUs, etc. 🎉🎉

208 (+4,060%)

gpl-3.0

aredden/flux-fp8-api

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

259 (+3,600%)

apache-2.0

iamNCJ/DiLightNet

Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

169 (+3,280%)

mit

ali-vilab/ACE

All-round Creator and Editor

212 (+2,550%)

apache-2.0

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

241 (+2,310%)

mit

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

376 (+2,112%)

THUDM/VisionReward

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

216 (+2,060%)

apache-2.0

intuitive-robots/mdt_policy

[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights

129 (+2,050%)

mit

ali-vilab/FreeScale

Code for FreeScale, a tuning-free method for higher-resolution visual generation

121 (+1,917%)

leffff/FlowModels

The aim of this repository is to test and implement Flow-Matching-based models

85 (+1,600%)

mit

Weixiang-Sun/Bora

Biomedical Generalist Video Generation Model

179 (+1,392%)

bsd-3-clause

blepping/comfyui_jankhidiffusion

Janky implementation of HiDiffusion for ComfyUI

119 (+1,222%)

apache-2.0

line/open-universe

Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.

91 (+1,200%)

apache-2.0

JettHu/ComfyUI-TCD

ComfyUI TCD implementation

126 (+1,160%)

apache-2.0

nicolas-dufour/plonk

Code for "Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation"

67 (+1,017%)

mit