Trending repositories for topic aigc
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Compoder is an open-source AI-driven component code generation engine that integrates modern frontend tech stacks with various AI model capabilities. You can customize Compoder to create AI-powered co...
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
It's not AI that takes away your job, but the people who master the use of AI tools. The most deadly attack is a dimension-reducing strike: destroying you has nothing to do with you - from "The Three-...
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
✨ Yao is an all-in-one application engine that enables developers to create web apps, REST APIs, business applications, and more, with AI as a development partner.
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
A curated list of recent style transfer methods with diffusion models
这个仓库主要是收集CoW(chatgpt-on-wechat)与DoW(dify-on-wechat)的插件,欢迎补充加入看到、用过或新开发的插件。
Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this reposito...
[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Comfyui多功能聚合插件(提示词翻译,提示词润色,贴图编辑器,模型引用修复等),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
Compoder is an open-source AI-driven component code generation engine that integrates modern frontend tech stacks with various AI model capabilities. You can customize Compoder to create AI-powered co...
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
A curated list of recent style transfer methods with diffusion models
[ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection".
这个仓库主要是收集CoW(chatgpt-on-wechat)与DoW(dify-on-wechat)的插件,欢迎补充加入看到、用过或新开发的插件。
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
《动手学SpringAI》包含SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG
Comfyui多功能聚合插件(提示词翻译,提示词润色,贴图编辑器,模型引用修复等),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
【AI应用开发平台+AI知识库+零代码平台】敲敲云是一款免费的AI应用开发平台与零代码平台结合的新一代零码产品,帮助企业快速搭建个性化业务应用!用户无需任何代码,即可搭建出符合业务需求的个性化应用。敲敲云拥有完善的应用搭建能力、表单引擎、流程引擎、仪表盘引擎,可满足企业的日常需求
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Wan: Open and Advanced Large-Scale Video Generative Models
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
It's not AI that takes away your job, but the people who master the use of AI tools. The most deadly attack is a dimension-reducing strike: destroying you has nothing to do with you - from "The Three-...
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Wan: Open and Advanced Large-Scale Video Generative Models
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
A curated list of recent style transfer methods with diffusion models
[ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection".
这个仓库主要是收集CoW(chatgpt-on-wechat)与DoW(dify-on-wechat)的插件,欢迎补充加入看到、用过或新开发的插件。
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
The official code of "Weak-to-Strong Diffusion with Reflection".
Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this reposito...
[NeurIPS 2024] IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation. IMAGPose enables versatile pose-guided image generation with high detail fidelity, pose alignment, and cross...
《动手学SpringAI》包含SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG
【AI应用开发平台+AI知识库+零代码平台】敲敲云是一款免费的AI应用开发平台与零代码平台结合的新一代零码产品,帮助企业快速搭建个性化业务应用!用户无需任何代码,即可搭建出符合业务需求的个性化应用。敲敲云拥有完善的应用搭建能力、表单引擎、流程引擎、仪表盘引擎,可满足企业的日常需求
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Lumina-T2X is a unified framework for Text to Any Modality Generation
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSee...
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Code of [CVPR2025] AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
Comfyui多功能聚合插件(提示词翻译,提示词润色,贴图编辑器,模型引用修复等),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
StyleLLM文风大模型:基于大语言模型的文本风格迁移项目。Text style transfer base on Large Language Model. #文字修饰 # 润色 #风格模仿
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
Wan: Open and Advanced Large-Scale Video Generative Models
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Lumina-T2X is a unified framework for Text to Any Modality Generation
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Collection of Open Source Projects Related to GPT,GPT相关开源项目合集🚀、精选🔥🔥
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Comfyui多功能聚合插件(提示词翻译,提示词润色,贴图编辑器,模型引用修复等),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-driven image generation.
Official code base for paper EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
AnimationGPT:An AIGC tool for generating game combat motion assets
这个仓库主要是收集CoW(chatgpt-on-wechat)与DoW(dify-on-wechat)的插件,欢迎补充加入看到、用过或新开发的插件。
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Wan: Open and Advanced Large-Scale Video Generative Models
Official Inplementation of 《WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole Slide Images》(MICCAI 2024 Oral/ Best Paper Candidate)