Trending repositories for topic text-to-image

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227 (+420)

apache-2.0

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

308 (+20)

mit

SamurAIGPT/AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

2,038 (+16)

mit

Capsize-Games/airunner

Stable Diffusion and LLMs offline on your own hardware

300 (+13)

apache-2.0

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

460 (+13)

PaddlePaddle/PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

602 (+12)

apache-2.0

FurkanGozukara/Stable-Diffusion

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI,...

2,361 (+8)

gpl-3.0

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

74 (+7)

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

952 (+7)

apache-2.0

promptslab/Awesome-Prompt-Engineering

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

4,276 (+5)

apache-2.0

lucidrains/DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

11,243 (+5)

mit

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216 (+4)

mit

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

926 (+4)

apache-2.0

nupurkmr9/syncd

SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization

127 (+3)

mit

filipecalegario/awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

2,749 (+3)

cc0-1.0

XiaomingX/awesome-qwen-prompt-insight

🧠 世界上覆盖最全的优秀Qwen提示语大全，欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute your own prompts!

205 (+2)

mit

wooyeolbaek/attention-map-diffusers

🚀 Cross attention map tools for huggingface/diffusers

239 (+2)

mit

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

446 (+2)

SamurAIGPT/Text-To-Video-AI

Generate video from text using AI

470 (+2)

mit

Last 3 days (relative gain)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227 (+52%)

apache-2.0

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

74 (+10%)

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

308 (+7%)

mit

Capsize-Games/airunner

Stable Diffusion and LLMs offline on your own hardware

300 (+5%)

apache-2.0

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

460 (+3%)

nupurkmr9/syncd

SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization

127 (+2%)

mit

Westlake-AGI-Lab/Awesome-Style-Transfer-with-Diffusion-Models

A curated list of recent style transfer methods with diffusion models

45 (+2%)

PaddlePaddle/PaddleMIX

602 (+2%)

apache-2.0

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216 (+2%)

mit

aigem/cf-flux-remix

免费使用Workers AI 的FLUX.1生成图片（前端+API）超简单部署在CF Workers中

81 (+1%)

TrustGen/TrustEval-toolkit

TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)

90 (+1%)

mit

XiaomingX/awesome-qwen-prompt-insight

205 (+1.0%)

mit

wooyeolbaek/attention-map-diffusers

🚀 Cross attention map tools for huggingface/diffusers

239 (+0.8%)

mit

SamurAIGPT/AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

2,038 (+0.8%)

mit

LetterLiGo/SafeGen_CCS2024

[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

129 (+0.8%)

apache-2.0

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

952 (+0.7%)

apache-2.0

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

446 (+0.5%)

SamurAIGPT/Text-To-Video-AI

Generate video from text using AI

470 (+0.4%)

mit

AlonzoLeeeooo/awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

568 (+0.4%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227 (+1,017)

apache-2.0

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

308 (+45)

mit

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

74 (+37)

SamurAIGPT/AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

2,038 (+34)

mit

PaddlePaddle/PaddleMIX

602 (+23)

apache-2.0

FoundationVision/Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

1,036 (+16)

mit

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

460 (+16)

FurkanGozukara/Stable-Diffusion

2,361 (+15)

gpl-3.0

promptslab/Awesome-Prompt-Engineering

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

4,276 (+15)

apache-2.0

Capsize-Games/airunner

Stable Diffusion and LLMs offline on your own hardware

300 (+14)

apache-2.0

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

952 (+12)

apache-2.0

filipecalegario/awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

2,749 (+11)

cc0-1.0

SamurAIGPT/Text-To-Video-AI

Generate video from text using AI

470 (+9)

mit

AlonzoLeeeooo/awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

568 (+9)

mit

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

926 (+9)

apache-2.0

lucidrains/DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

11,243 (+8)

mit

TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

1,533 (+8)

SamurAIGPT/AI-Influencer-Generator

Create and customize your AI influencer open-source

73 (+7)

mit

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216 (+6)

mit

Last week (relative gain)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227 (+484%)

apache-2.0

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

74 (+100%)

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

308 (+17%)

mit

Westlake-AGI-Lab/Awesome-Style-Transfer-with-Diffusion-Models

A curated list of recent style transfer methods with diffusion models

45 (+13%)

SamurAIGPT/AI-Influencer-Generator

Create and customize your AI influencer open-source

73 (+11%)

mit

Capsize-Games/airunner

Stable Diffusion and LLMs offline on your own hardware

300 (+5%)

apache-2.0

Westlake-AGI-Lab/StyleStudio

[CVPR 2025] Official implementation of StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

74 (+4%)

nupurkmr9/syncd

SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization

127 (+4%)

mit

PaddlePaddle/PaddleMIX

602 (+4%)

apache-2.0

aigem/cf-flux-remix

免费使用Workers AI 的FLUX.1生成图片（前端+API）超简单部署在CF Workers中

81 (+4%)

xiefan-guo/initno

[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization

55 (+4%)

apache-2.0

TrustGen/TrustEval-toolkit

TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)

90 (+3%)

mit

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216 (+3%)

mit

wooyeolbaek/attention-map-diffusers

🚀 Cross attention map tools for huggingface/diffusers

239 (+2%)

mit

doanbactam/awesome-stable-diffusion

A curated list of awesome stable diffusion resources 🌟

51 (+2%)

mit

XiaomingX/awesome-qwen-prompt-insight

205 (+2%)

mit

ashbuilds/payload-ai

AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.

154 (+2%)

SamurAIGPT/Text-To-Video-AI

Generate video from text using AI

470 (+2%)

mit

SamurAIGPT/AI-Faceless-Video-Generator

Generate a video script, voice and a talking face completely with AI

267 (+2%)

mit

Last month (new repositories)

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216

mit

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

zer0int/CLIP-fine-tune-registers-gated

Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!

mit

Last month (absolute gain)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227 (+1,220)

apache-2.0

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

952 (+667)

apache-2.0

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

308 (+241)

mit

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216 (+204)

mit

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

926 (+184)

apache-2.0

SamurAIGPT/AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

2,038 (+126)

mit

PaddlePaddle/PaddleMIX

602 (+106)

apache-2.0

nupurkmr9/syncd

SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization

127 (+80)

mit

promptslab/Awesome-Prompt-Engineering

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

4,276 (+65)

apache-2.0

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

74 (+59)

FurkanGozukara/Stable-Diffusion

2,361 (+58)

gpl-3.0

FoundationVision/Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

1,036 (+56)

mit

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

226 (+43)

mit

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

460 (+42)

filipecalegario/awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

2,749 (+40)

cc0-1.0

snap-research/stable-flow

Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]

326 (+40)

AlonzoLeeeooo/awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

568 (+39)

mit

TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

1,533 (+38)

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

3,144 (+37)

xuyang-liu16/Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

180 (+31)

Last month (relative gain)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227 (+17,429%)

apache-2.0

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216 (+1,700%)

mit

atfortes/BokehDiffusion

The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"

74 (+393%)

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

308 (+360%)

mit

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

952 (+234%)

apache-2.0

zer0int/CLIP-fine-tune-registers-gated

Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!

36 (+227%)

mit

nupurkmr9/syncd

SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization

127 (+170%)

mit

Westlake-AGI-Lab/Awesome-Style-Transfer-with-Diffusion-Models

A curated list of recent style transfer methods with diffusion models

45 (+125%)

SamurAIGPT/AI-Influencer-Generator

Create and customize your AI influencer open-source

73 (+70%)

mit

Westlake-AGI-Lab/StyleStudio

[CVPR 2025] Official implementation of StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

74 (+61%)

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

926 (+25%)

apache-2.0

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

226 (+23%)

mit

TrustGen/TrustEval-toolkit

TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)

90 (+22%)

mit

PaddlePaddle/PaddleMIX

602 (+21%)

apache-2.0

aigem/cf-flux-remix

免费使用Workers AI 的FLUX.1生成图片（前端+API）超简单部署在CF Workers中

81 (+21%)

xuyang-liu16/Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

180 (+21%)

snap-research/stable-flow

Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]

326 (+14%)

ashbuilds/payload-ai

AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.

154 (+11%)

xiefan-guo/initno

[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization

55 (+10%)

apache-2.0

Last 12-months (new repositories)

SamurAIGPT/AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

2,038

mit

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227

apache-2.0

FoundationVision/Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

1,036

mit

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

952

apache-2.0

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

926

apache-2.0

lehduong/OneDiffusion

Official implementation of OneDiffusion paper (CVPR 2025)

618

gojasper/flash-diffusion

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

569

SamurAIGPT/Text-To-Video-AI

Generate video from text using AI

470

mit

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

460

open-mmlab/StyleShot

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型，无需针对图片微调，即能生成高质量的个性风格化图片!

366

mit

sayakpaul/diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

335

apache-2.0

snap-research/stable-flow

Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]

326

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

308

mit

viiika/Meissonic

[ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

296

apache-2.0

SamurAIGPT/AI-Faceless-Video-Generator

Generate a video script, voice and a talking face completely with AI

267

mit

HolmesShuan/FireFlow-Fast-Inversion-of-Rectified-Flow-for-Image-Semantic-Editing

An 8-step inversion and 8-step editing process works effectively with the FLUX-dev model. (3x speedup with results that are comparable or even superior to baseline methods)

249

apache-2.0

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

226

mit

KwokKwok/Silo

多模型同时对话、文生图，纯前端。Multi-model simultaneous chat、text-to-image generation, all done through pure front-end (API mode, no server-side needed).

220

mit

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216

mit

XiaomingX/awesome-qwen-prompt-insight

205

mit

Last 12-months (absolute gain)

SamurAIGPT/AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

2,038 (+2,034)

mit

TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

1,533 (+1,330)

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227 (+1,220)

apache-2.0

promptslab/Awesome-Prompt-Engineering

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

4,276 (+1,185)

apache-2.0

FoundationVision/Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

1,036 (+1,035)

mit

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

952 (+897)

apache-2.0

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

926 (+880)

apache-2.0

filipecalegario/awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

2,749 (+872)

cc0-1.0

FurkanGozukara/Stable-Diffusion

2,361 (+690)

gpl-3.0

finegrain-ai/refiners

A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation

815 (+600)

mit

lehduong/OneDiffusion

Official implementation of OneDiffusion paper (CVPR 2025)

618 (+580)

gojasper/flash-diffusion

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

569 (+567)

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

3,144 (+541)

lucidrains/DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

11,243 (+499)

mit

lucidrains/imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

8,216 (+494)

mit

AlonzoLeeeooo/awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

568 (+486)

mit

Yutong-Zhou-cv/Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,300 (+470)

mit

SamurAIGPT/Text-To-Video-AI

Generate video from text using AI

470 (+469)

mit

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

460 (+459)

PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

1,022 (+454)

mit

Last 12-months (relative gain)

SamurAIGPT/AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

2,038 (+50,850%)

mit

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

1,227 (+17,429%)

apache-2.0

open-mmlab/StyleShot

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型，无需针对图片微调，即能生成高质量的个性风格化图片!

366 (+7,220%)

mit

SamurAIGPT/AI-Faceless-Video-Generator

Generate a video script, voice and a talking face completely with AI

267 (+6,575%)

mit

xuyang-liu16/Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

180 (+4,400%)

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

226 (+2,160%)

mit

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

926 (+1,913%)

apache-2.0

SamurAIGPT/AI-Influencer-Generator

Create and customize your AI influencer open-source

73 (+1,725%)

mit

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

216 (+1,700%)

mit

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

952 (+1,631%)

apache-2.0

lehduong/OneDiffusion

Official implementation of OneDiffusion paper (CVPR 2025)

618 (+1,526%)

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

446 (+1,493%)

volume988/flux-ai-image-webui

flux ai image generator web ui

105 (+1,400%)

mit

XiaomingX/awesome-qwen-prompt-insight

205 (+1,039%)

mit

ktutak1337/Stellar-Chat

A versatile multi-modal chat application that enables users to develop custom agents, create images, leverage visual recognition, and engage in voice interactions. It integrates seamlessly with local ...

124 (+1,027%)

agpl-3.0

sayakpaul/diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

335 (+915%)

apache-2.0

LetterLiGo/SafeGen_CCS2024

[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

129 (+892%)

apache-2.0

CFGpp-diffusion/CFGpp

Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)

193 (+777%)

TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

1,533 (+655%)

customdiffusion360/custom-diffusion360

CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control

165 (+650%)