Trending repositories for topic video-generation
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
A collection of awesome video generation studies.
Adaptive Caching for Faster Video Generation with Diffusion Transformers
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
[NeurIPS 2024] Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
Adaptive Caching for Faster Video Generation with Diffusion Transformers
[NeurIPS 2024] Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
A collection of awesome video generation studies.
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
Adaptive Caching for Faster Video Generation with Diffusion Transformers
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
A collection of awesome video generation studies.
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
Adaptive Caching for Faster Video Generation with Diffusion Transformers
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
A repository for organizing papers, codes and other resources related to Virtual Try-on Models
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
A collection of awesome video generation studies.
JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
[CVPR 2024] On the Content Bias in Fréchet Video Distance
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
[NeurIPS 2024] Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Adaptive Caching for Faster Video Generation with Diffusion Transformers
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Adaptive Caching for Faster Video Generation with Diffusion Transformers
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
Official implementation of 'Motion Inversion For Video Customization'
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
A repository for organizing papers, codes and other resources related to Virtual Try-on Models
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
Adaptive Caching for Faster Video Generation with Diffusion Transformers
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
Official implementation of 'Motion Inversion For Video Customization'
📚 Collection of awesome generation acceleration resources.
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
[NeurIPS 2024] Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)
Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
A collection of awesome video generation studies.
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Fine-Grained Open Domain Image Animation with Motion Guidance
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from text. Explore a curated collection of articles, videos, podcasts...
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)
To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-driven image generation.
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
KandinskyVideo — multilingual end-to-end text2video latent diffusion model
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
[CVPR 2024] On the Content Bias in Fréchet Video Distance