Trending repositories for topic aigc
【三年面试五年模拟】算法工程师秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一...
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
[ECCV 2024] Official Implementation of 《WSI-VQA: Interpreting Whole Slide Image by Generative Question Answering》
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
AnimationGPT:An AIGC tool for generating game combat motion assets
[ECCV 2024] Official Implementation of 《WSI-VQA: Interpreting Whole Slide Image by Generative Question Answering》
Official Inplementation of 《WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole Slide Images》(MICCAI 2024 Oral/ Best Paper Candidate)
📚 Collection of awesome generation acceleration resources.
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
【三年面试五年模拟】算法工程师秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
IDDM (Industrial, landscape, animate, spectrogram...), support DDPM, DDIM, PLMS, webui and multi-GPU distributed training. Pytorch实现,生成模型,扩散模型,分布式训练
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
Comfyui万能翻译插件(不再需要添加各种节点,直接在原有节点上加入翻译功能),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
This is an open collection of state-of-the-art (SOTA), novel Text to X (X can be everything) methods (papers, codes and datasets).
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
AnimationGPT:An AIGC tool for generating game combat motion assets
【三年面试五年模拟】算法工程师秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一...
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
[ECCV 2024] Official Implementation of 《WSI-VQA: Interpreting Whole Slide Image by Generative Question Answering》
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
AnimationGPT:An AIGC tool for generating game combat motion assets
[ECCV 2024] Official Implementation of 《WSI-VQA: Interpreting Whole Slide Image by Generative Question Answering》
Official Inplementation of 《WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole Slide Images》(MICCAI 2024 Oral/ Best Paper Candidate)
📚 Collection of awesome generation acceleration resources.
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
【三年面试五年模拟】算法工程师秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
IDDM (Industrial, landscape, animate, spectrogram...), support DDPM, DDIM, PLMS, webui and multi-GPU distributed training. Pytorch实现,生成模型,扩散模型,分布式训练
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
Comfyui万能翻译插件(不再需要添加各种节点,直接在原有节点上加入翻译功能),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
This is an open collection of state-of-the-art (SOTA), novel Text to X (X can be everything) methods (papers, codes and datasets).
AnimationGPT:An AIGC tool for generating game combat motion assets
【三年面试五年模拟】算法工程师秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
Collection of Open Source Projects Related to GPT,GPT相关开源项目合集🚀、精选🔥🔥
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一...
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generati...
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
Official code base for paper EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance
📚 Collection of awesome generation acceleration resources.
Official Inplementation of 《WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole Slide Images》(MICCAI 2024 Oral/ Best Paper Candidate)
[NeurIPS 2024] Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
[ECCV 2024] Official Implementation of 《WSI-VQA: Interpreting Whole Slide Image by Generative Question Answering》
《动手学SpringAI》包含SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG
【三年面试五年模拟】算法工程师秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Comfyui万能翻译插件(不再需要添加各种节点,直接在原有节点上加入翻译功能),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Lumina-T2X is a unified framework for Text to Any Modality Generation
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一...
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
AI Productivity Tool - Free and open-source, enhancing user productivity while ensuring privacy and data security. It provides efficient and convenient AI solutions, including but not limited to: buil...
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing...
Lumina-T2X is a unified framework for Text to Any Modality Generation
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Collection of Open Source Projects Related to GPT,GPT相关开源项目合集🚀、精选🔥🔥
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generati...
🍭 Lobe UI - an open-source UI component library for building AIGC web apps
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一...
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
AnimationGPT:An AIGC tool for generating game combat motion assets
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
[CVPR 2024 Highlight] SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors
To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-driven image generation.
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
📚 AIGC 求职面经、必备基础知识、提示词工程、ChatGPT、Stable Diffusion、Prompt、Embedding、Fintune 等 AIGC 求职你所需要知道的一切~
Comfyui万能翻译插件(不再需要添加各种节点,直接在原有节点上加入翻译功能),让comfyui任意长文本输入框支持中文输入并自动翻译/同时加入报错翻译功能(调用百度翻译),实现翻译自由!同时接入AI大模型实现提示词润色功能, 其它插件功能,请看插件介绍
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
This is an open collection of state-of-the-art (SOTA), novel Text to X (X can be everything) methods (papers, codes and datasets).
🍭 Lobe UI - an open-source UI component library for building AIGC web apps