Trending repositories for topic image-generation
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, ...
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The sol...
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
A repository for organizing papers, codes and other resources related to Virtual Try-on Models
"Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
A repository for organizing papers, codes and other resources related to Virtual Try-on Models
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
"Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
📚 Collection of awesome generation acceleration resources.
Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
Image Signal Processing (ISP) Guide. Learn all about the process of converting an image/video into digital form by performing tasks like noise reduction, filtering, auto exposure, autofocus, HDR corre...
[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"
[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, ...
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The sol...
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
A repository for organizing papers, codes and other resources related to Virtual Try-on Models
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
"Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
A repository for organizing papers, codes and other resources related to Virtual Try-on Models
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
dgenerate is a scriptable command line tool (and library) for generating images and animation sequences using stable diffusion and related techniques, with an accompanying GUI scripting environment.
[ECAI 2024] Official code for "TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models".
Stable Diffusion WebUI Forge docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
"Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
📚 Collection of awesome generation acceleration resources.
Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"
[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
"Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
A repository for organizing papers, codes and other resources related to Virtual Try-on Models
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, ...
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The sol...
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
A powerful tool that translates ComfyUI workflows into executable Python code.
"Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
"Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)
A Training-free Iterative Framework for Long Story Visualization
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
[MICCAI 2024, Oral] Official implementation of BrLP method from "Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion and Prior Knowledge"
📚 Collection of awesome generation acceleration resources.
Stable Diffusion WebUI Forge docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
🔴VERY LARGE AI TOOL LIST! 🔴 extensive collection of tools and resources covering a broad range of applications in the world of artificial intelligence (AI) and machine learning (ML). This list encom...
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
Official Implementation of KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024 main)
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
OmniTokenizer: one model and one weight for image-video joint tokenization.
To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-driven image generation.
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, ...
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The sol...
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
Official implementations for paper: Anydoor: zero-shot object-level image customization
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generati...
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
A powerful tool that translates ComfyUI workflows into executable Python code.
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-driven image generation.
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
This is a documentation for Free AI API with Stable Diffusion XL, Playground-v2, Flux, PixArt, LLM, Text2GIF, Upscale and many other models.
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
Janky implementation of HiDiffusion for ComfyUI
Official implementations for paper: Anydoor: zero-shot object-level image customization
📚 Collection of awesome generation acceleration resources.
This is a pytorch implementation of Denoising Diffusion Implicit Models
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
AI Discord bot with Image Recognition/OCR Capabilities + Image Generation with SDXL For Free!!