Trending repositories for topic aigc
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一...
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
【零代码平台+AI应用平台&知识库】敲敲云是一款免费的AI应用开发平台与零代码平台结合的新一代零码产品,帮助企业快速搭建个性化业务应用!用户无需任何代码,即可搭建出符合业务需求的个性化应用。敲敲云拥有完善的应用搭建能力、表单引擎、流程引擎、仪表盘引擎,可满足企业的日常需求
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
这个仓库主要是收集CoW(chatgpt-on-wechat)与DoW(dify-on-wechat)的插件,欢迎补充加入看到、用过或新开发的插件。
《动手学SpringAI》包含SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
Comfyui多功能聚合插件(提示词翻译,提示词润色,贴图编辑器,模型引用修复等),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
A Collection of Papers and Codes for CVPR2025/CVPR2024/ECCV2024 AIGC
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
AnimationGPT:An AIGC tool for generating game combat motion assets
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
It's not AI that takes away your job, but the people who master the use of AI tools. The most deadly attack is a dimension-reducing strike: destroying you has nothing to do with you - from "The Three-...
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
A curated list of recent style transfer methods with diffusion models
Compoder is an open-source AI-powered component code generation engine that integrates modern frontend tech stacks with various AI model capabilities. You can customize Compoder to create AI-powered c...
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
这个仓库主要是收集CoW(chatgpt-on-wechat)与DoW(dify-on-wechat)的插件,欢迎补充加入看到、用过或新开发的插件。
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
【零代码平台+AI应用平台&知识库】敲敲云是一款免费的AI应用开发平台与零代码平台结合的新一代零码产品,帮助企业快速搭建个性化业务应用!用户无需任何代码,即可搭建出符合业务需求的个性化应用。敲敲云拥有完善的应用搭建能力、表单引擎、流程引擎、仪表盘引擎,可满足企业的日常需求
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
[BMVC 2024] Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
《动手学SpringAI》包含SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG
Comfyui多功能聚合插件(提示词翻译,提示词润色,贴图编辑器,模型引用修复等),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
ComfyUI custom nodes and web utilities for real-time AI generation and interaction
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Wan: Open and Advanced Large-Scale Video Generative Models
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
Compoder is an open-source AI-powered component code generation engine that integrates modern frontend tech stacks with various AI model capabilities. You can customize Compoder to create AI-powered c...
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
A curated list of recent style transfer methods with diffusion models
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
这个仓库主要是收集CoW(chatgpt-on-wechat)与DoW(dify-on-wechat)的插件,欢迎补充加入看到、用过或新开发的插件。
[ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection".
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this reposito...
The official code of "Weak-to-Strong Diffusion with Reflection".
[NeurIPS 2024] IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation. IMAGPose enables versatile pose-guided image generation with high detail fidelity, pose alignment, and cross...
《动手学SpringAI》包含SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG
【零代码平台+AI应用平台&知识库】敲敲云是一款免费的AI应用开发平台与零代码平台结合的新一代零码产品,帮助企业快速搭建个性化业务应用!用户无需任何代码,即可搭建出符合业务需求的个性化应用。敲敲云拥有完善的应用搭建能力、表单引擎、流程引擎、仪表盘引擎,可满足企业的日常需求
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSee...
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Code of [CVPR2025] AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
Comfyui多功能聚合插件(提示词翻译,提示词润色,贴图编辑器,模型引用修复等),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
《动手学SpringAI》包含SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG
Wan: Open and Advanced Large-Scale Video Generative Models
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
Lumina-T2X is a unified framework for Text to Any Modality Generation
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Collection of Open Source Projects Related to GPT,GPT相关开源项目合集🚀、精选🔥🔥
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Comfyui多功能聚合插件(提示词翻译,提示词润色,贴图编辑器,模型引用修复等),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-driven image generation.
Official code base for paper EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
这个仓库主要是收集CoW(chatgpt-on-wechat)与DoW(dify-on-wechat)的插件,欢迎补充加入看到、用过或新开发的插件。
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
AnimationGPT:An AIGC tool for generating game combat motion assets
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Wan: Open and Advanced Large-Scale Video Generative Models
Official Implementation of Video-T1: Test-Time Scaling for Video Generation