Trending repositories for topic video-processing
Cross-platform, customizable ML solutions for live and streaming media.
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRN...
An open, scalable, online streaming setup. All-in-one toolkit from ingest to adaptive video playback. Built for developers in need of video tooling.
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
VisionFlow is a backend application for image recognition and notification systems, built with Flask, PostgreSQL, Redis, and Docker. It supports multi-camera setups, real-time image processing, and pr...
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
ESP32 Camera motion capture application to record JPEGs to SD card as AVI files and stream to browser as MJPEG. If a microphone is installed then a WAV file is also created. Files can be uploaded via ...
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: 🇺🇸 🇨🇳 🇰🇷 🇪🇸 🇻🇳 🇧🇷
A self-hosted platform to use HandBrake on your headless devices via a bespoke web interface. Harness the processing power of multiple devices to work on a single queue.
An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features :fire:
VisionFlow is a backend application for image recognition and notification systems, built with Flask, PostgreSQL, Redis, and Docker. It supports multi-camera setups, real-time image processing, and pr...
[ NeurIPS 2024 ] The official PyTorch implementation for Learning Truncated Causal History Model for Video Restoration.
Blind video watermarking with great invisibility and robustness.
WACV2024 - Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution
A self-hosted platform to use HandBrake on your headless devices via a bespoke web interface. Harness the processing power of multiple devices to work on a single queue.
An open, scalable, online streaming setup. All-in-one toolkit from ingest to adaptive video playback. Built for developers in need of video tooling.
A cross-platform High-performance FFmpeg based Real-time Video Frames Decoder in Pure Python 🎞️⚡
ESP32 Camera motion capture application to record JPEGs to SD card as AVI files and stream to browser as MJPEG. If a microphone is installed then a WAV file is also created. Files can be uploaded via ...
Video to Text: Natural language description generator for some given video. [Video Captioning]
DrawingBotV3 is a software for converting images into vector art
An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
The free and open source solution for automated video capture and distribution at scale.
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
Cross-platform, customizable ML solutions for live and streaming media.
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRN...
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
An open, scalable, online streaming setup. All-in-one toolkit from ingest to adaptive video playback. Built for developers in need of video tooling.
A self-hosted platform to use HandBrake on your headless devices via a bespoke web interface. Harness the processing power of multiple devices to work on a single queue.
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: 🇺🇸 🇨🇳 🇰🇷 🇪🇸 🇻🇳 🇧🇷
ESP32 Camera motion capture application to record JPEGs to SD card as AVI files and stream to browser as MJPEG. If a microphone is installed then a WAV file is also created. Files can be uploaded via ...
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features :fire:
AI Productivity Tool - Free and open-source, enhancing user productivity while ensuring privacy and data security. It provides efficient and convenient AI solutions, including but not limited to: buil...
VisionFlow is a backend application for image recognition and notification systems, built with Flask, PostgreSQL, Redis, and Docker. It supports multi-camera setups, real-time image processing, and pr...
[ NeurIPS 2024 ] The official PyTorch implementation for Learning Truncated Causal History Model for Video Restoration.
VisionFlow is a backend application for image recognition and notification systems, built with Flask, PostgreSQL, Redis, and Docker. It supports multi-camera setups, real-time image processing, and pr...
A self-hosted platform to use HandBrake on your headless devices via a bespoke web interface. Harness the processing power of multiple devices to work on a single queue.
High-performance, optimized pre-trained template AI application pipelines for systems using Hailo devices
Crispy is a machine-learning platform to make video-games montages efficiently. It uses a neural network to detect highlights in the video-game frames
Clips AI is an open-source Python library that automatically converts long videos into clips.
[ACM MM'24 Oral] RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining
Blind video watermarking with great invisibility and robustness.
Official PyTorch implementation of Scalable Neural Video Representations with Learnable Positional Features (NeurIPS 2022).
An open, scalable, online streaming setup. All-in-one toolkit from ingest to adaptive video playback. Built for developers in need of video tooling.
Advanced Swift library for media conversion and manipulation
A cross-platform High-performance FFmpeg based Real-time Video Frames Decoder in Pure Python 🎞️⚡
AI Productivity Tool - Free and open-source, enhancing user productivity while ensuring privacy and data security. It provides efficient and convenient AI solutions, including but not limited to: buil...
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Video to Text: Natural language description generator for some given video. [Video Captioning]
An open, scalable, online streaming setup. All-in-one toolkit from ingest to adaptive video playback. Built for developers in need of video tooling.
Cross-platform, customizable ML solutions for live and streaming media.
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRN...
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: 🇺🇸 🇨🇳 🇰🇷 🇪🇸 🇻🇳 🇧🇷
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
A self-hosted platform to use HandBrake on your headless devices via a bespoke web interface. Harness the processing power of multiple devices to work on a single queue.
A web-based video editing tool implemented with WebCodecs, similar to CapCut Web.使用webcodecs实现的Web端视频编辑工具,类似剪映Web版。
Compute over Data framework for public, transparent, and optionally verifiable computation
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features :fire:
An open, scalable, online streaming setup. All-in-one toolkit from ingest to adaptive video playback. Built for developers in need of video tooling.
[ NeurIPS 2024 ] The official PyTorch implementation for Learning Truncated Causal History Model for Video Restoration.
VisionFlow is a backend application for image recognition and notification systems, built with Flask, PostgreSQL, Redis, and Docker. It supports multi-camera setups, real-time image processing, and pr...
A self-hosted platform to use HandBrake on your headless devices via a bespoke web interface. Harness the processing power of multiple devices to work on a single queue.
[ACM MM'24 Oral] RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining
This repository contains code for deploying a Gradio application using the SAM2 model for video processing. The application allows users to interact with the model through a user-friendly web interfac...
High-performance, optimized pre-trained template AI application pipelines for systems using Hailo devices
Crispy is a machine-learning platform to make video-games montages efficiently. It uses a neural network to detect highlights in the video-game frames
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
GUI application for Anime4K shaders which allows to save upscaled video to disk
Advanced Swift library for media conversion and manipulation
Unlock the potential of Google's Gemini AI models with this versatile toolkit. Offering seamless chat, text generation, and multimodal interactions, supporting various file types, including PDF's, ima...
Swap faces in images and videos. Create face embeddings. Enhance face image quality. Deploy as a web api.
A web-based video editing tool implemented with WebCodecs, similar to CapCut Web.使用webcodecs实现的Web端视频编辑工具,类似剪映Web版。
An open, scalable, online streaming setup. All-in-one toolkit from ingest to adaptive video playback. Built for developers in need of video tooling.
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
AI Productivity Tool - Free and open-source, enhancing user productivity while ensuring privacy and data security. It provides efficient and convenient AI solutions, including but not limited to: buil...
A web-based video editing tool implemented with WebCodecs, similar to CapCut Web.使用webcodecs实现的Web端视频编辑工具,类似剪映Web版。
A self-hosted platform to use HandBrake on your headless devices via a bespoke web interface. Harness the processing power of multiple devices to work on a single queue.
Clips AI is an open-source Python library that automatically converts long videos into clips.
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
CVPR 2024-Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
VisionFlow is a backend application for image recognition and notification systems, built with Flask, PostgreSQL, Redis, and Docker. It supports multi-camera setups, real-time image processing, and pr...
VideoAlchemy is a toolkit expanding video processing capabilities, emphasizing FFmpeg and broader video technology applications.
[ACM MM'24 Oral] RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining
Vikit.ai SDK let you develop easily video generators leveraging generative AI and other AI models.
Unlock the potential of Google's Gemini AI models with this versatile toolkit. Offering seamless chat, text generation, and multimodal interactions, supporting various file types, including PDF's, ima...
Swap faces in images and videos. Create face embeddings. Enhance face image quality. Deploy as a web api.
Cross-platform, customizable ML solutions for live and streaming media.
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRN...
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
An open, scalable, online streaming setup. All-in-one toolkit from ingest to adaptive video playback. Built for developers in need of video tooling.
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: 🇺🇸 🇨🇳 🇰🇷 🇪🇸 🇻🇳 🇧🇷
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
AI Productivity Tool - Free and open-source, enhancing user productivity while ensuring privacy and data security. It provides efficient and convenient AI solutions, including but not limited to: buil...
A web-based video editing tool implemented with WebCodecs, similar to CapCut Web.使用webcodecs实现的Web端视频编辑工具,类似剪映Web版。
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
A web-based video editing tool implemented with WebCodecs, similar to CapCut Web.使用webcodecs实现的Web端视频编辑工具,类似剪映Web版。
A self-hosted platform to use HandBrake on your headless devices via a bespoke web interface. Harness the processing power of multiple devices to work on a single queue.
GUI application for Anime4K shaders which allows to save upscaled video to disk
[ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Media server for real-time, low latency, programmable video and audio mixing.
FFmpeg Xfade easing and extensions: custom expressions; CSS easings; GL Transitions
Rankings include: BetterDepth ChronoDepth Depth Any Video Depth Anything Depth Pro DepthCrafter DPT FutureDepth GBDMF GenPercept GeoWizard LeReS LightedDepth Marigold Metric3D MiDaS MonST3R NeWCRFs NV...
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
dgenerate is a scriptable command line tool (and library) for generating images and animation sequences using stable diffusion and related techniques, with an accompanying GUI scripting environment.
Advanced Swift library for media conversion and manipulation
🎞️ Crop, flip and trim videos right from your web browser.
Help cut and shorten your video by removing all low audio parts
Multiple python based scripts used for video and image editing