Trending repositories for topic image-generation
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our clou...
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transfor...
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The s...
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly y...
Official implementations for paper: Anydoor: zero-shot object-level image customization
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command executio...
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
A powerful tool that translates ComfyUI workflows into executable Python code.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generati...
A simple standalone viewer for reading prompts from Stable Diffusion generated image outside the webui.
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our clou...
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command executio...
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
A powerful tool that translates ComfyUI workflows into executable Python code.
A simple standalone viewer for reading prompts from Stable Diffusion generated image outside the webui.
Using Copilot, Bing Image Creator and DALLE-3 on Discord bot.
A Survey on Text-to-Video Generation/Synthesis.
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our clou...
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transfor...
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The s...
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly y...
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Official implementations for paper: Anydoor: zero-shot object-level image customization
A powerful tool that translates ComfyUI workflows into executable Python code.
A simple standalone viewer for reading prompts from Stable Diffusion generated image outside the webui.
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command executio...
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generati...
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our clou...
Implement a MNIST(also minimal) version of denoising diffusion probabilistic model from scratch.The model only has 4.55MB.
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command executio...
Implementation of diffusion models in pytorch for custom training.
A powerful tool that translates ComfyUI workflows into executable Python code.
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
Idempotent Generative Network's unofficial pytorch implementation
A simple standalone viewer for reading prompts from Stable Diffusion generated image outside the webui.
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
DALL·E 3 Playground (Unofficial) is used to play with OpenAI Image generation API - DALL·E 3
Image Signal Processing (ISP) Guide. Learn all about the process of converting an image/video into digital form by performing tasks like noise reduction, filtering, auto exposure, autofocus, HDR corre...
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our clou...
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transfor...
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The s...
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly y...
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
A powerful tool that translates ComfyUI workflows into executable Python code.
Official implementations for paper: Anydoor: zero-shot object-level image customization
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generati...
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command executio...
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our clou...
Implement a MNIST(also minimal) version of denoising diffusion probabilistic model from scratch.The model only has 4.55MB.
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command executio...
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
This is a pytorch implementation of Denoising Diffusion Implicit Models
A minimal implementation of a denoising diffusion model in PyTorch.
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly y...
(CVPR 2024) 🧩 TokenCompose: Grounding Diffusion with Token-level Supervision
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Official implementations for paper: Anydoor: zero-shot object-level image customization
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly y...
A powerful tool that translates ComfyUI workflows into executable Python code.
This repository contains a pure C++ ONNX implementation of multiple offline AI models, such as StableDiffusion (1.5 and XL), ControlNet, Midas, HED and OpenPose.
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
Python library for designing and training your own Diffusion Models with PyTorch.
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...
a CLI utility/library for AnimateDiff stable diffusion generation
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transfor...
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our clou...
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The s...
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Official implementations for paper: Anydoor: zero-shot object-level image customization
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly y...
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generati...
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
An extensible, easy-to-use, and portable diffusion web UI 👨🎨
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Tracking and collecting papers/projects/others related to Segment Anything.
Official implementations for paper: Anydoor: zero-shot object-level image customization
A simple Windows / Xbox app for generating AI images with Stable Diffusion.
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
Avatar Generation For Characters and Game Assets Using Deep Fakes
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our clou...
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command executio...
Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models
A Survey on Text-to-Video Generation/Synthesis.
DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
A Unified Conditional Framework for Diffusion-based Image Restoration
A Discord chatbot / selfbot that allows users to talk to AI powered by OpenGPT or BARD. The AI runs on a genuine Discord account, not a bot account and has image detection alongside image generation.
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."