Trending repositories for topic text-to-image
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
I'm back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI,...
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
A curated list of Generative AI tools, works, models, and references
A collection of resources on controllable generation with text-to-image diffusion models.
Generate a video script, voice and a talking face completely with AI
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Layout preserving realistic interior design using text and image prompts
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
I'm back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Layout preserving realistic interior design using text and image prompts
Generate a video script, voice and a talking face completely with AI
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
A collection of resources on controllable generation with text-to-image diffusion models.
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI,...
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
A curated list of Generative AI tools, works, models, and references
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
I'm back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️
[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI,...
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
A curated list of Generative AI tools, works, models, and references
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
A collection of awesome text-to-image generation studies.
Diffusion model papers, survey, and taxonomy
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
Generate a video script, voice and a talking face completely with AI
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
📚 Collection of awesome generation acceleration resources.
I'm back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️
[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Generate a video script, voice and a talking face completely with AI
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
多模型同时对话、文生图,纯前端。Multi-model simultaneous chat、text-to-image generation, all done through pure front-end (API mode, no server-side needed).
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!
AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.
Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"
Layout preserving realistic interior design using text and image prompts
A collection of awesome text-to-image generation studies.
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
I'm back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
A curated list of Generative AI tools, works, models, and references
[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
Diffusion model papers, survey, and taxonomy
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI,...
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
A collection of awesome text-to-image generation studies.
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Generate a video script, voice and a talking face completely with AI
A collection of resources on controllable generation with text-to-image diffusion models.
I'm back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️
📚 Collection of awesome generation acceleration resources.
AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.
Generate a video script, voice and a talking face completely with AI
[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Layout preserving realistic interior design using text and image prompts
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Turn any face into a video game character, pixel art, claymation, 3D or toy
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
A collection of awesome text-to-image generation studies.
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!
I'm back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️
Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Turn any face into a video game character, pixel art, claymation, 3D or toy
A curated list of Generative AI tools, works, models, and references
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI,...
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Diffusion model papers, survey, and taxonomy
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
A collection of resources on controllable generation with text-to-image diffusion models.
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
A collection of awesome text-to-image generation studies.
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
Generate a video script, voice and a talking face completely with AI
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)
[NeurIPS 2024] Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
📚 Collection of awesome generation acceleration resources.
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control