22 results found Sort:

214
1.5k
apache-2.0
25
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
Created 2022-08-26
272 commits to master branch, last one 3 months ago
137
1.4k
other
16
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Created 2022-03-23
64 commits to main branch, last one about a year ago
255
1.2k
mit
25
High-performance multiple object tracking based on YOLO, Deep SORT, and KLT 🚀
Created 2020-01-30
597 commits to master branch, last one 4 months ago
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Created 2017-06-14
91 commits to master branch, last one 3 years ago
61
478
apache-2.0
19
SiamMOT: Siamese Multi-Object Tracking
Created 2021-05-20
19 commits to main branch, last one about a year ago
Code release for ActionFormer (ECCV 2022)
Created 2021-11-24
68 commits to main branch, last one 7 months ago
Official implementation of Paper Future Frame Prediction for Anomaly Detection -- A New Baseline, CVPR 2018
Created 2018-03-13
33 commits to master branch, last one 10 months ago
Library with dynamic audio/video composition and runtime control
This repository has been archived (exclude archived)
Created 2018-10-20
171 commits to master branch, last one 3 years ago
45
220
unknown
8
Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
Created 2020-03-08
20 commits to master branch, last one about a year ago
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
Created 2021-11-18
8 commits to main branch, last one about a year ago
A curated list of awesome self-supervised learning methods in videos
Created 2023-04-04
100 commits to main branch, last one 14 days ago
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
Created 2024-10-23
67 commits to main branch, last one 12 days ago
An official TensorFlow implementation of "Neural Program Synthesis from Diverse Demonstration Videos" (ICML 2018) by Shao-Hua Sun, Hyeonwoo Noh, Sriram Somasundaram, and Joseph J. Lim
Created 2018-06-23
26 commits to master branch, last one 6 years ago
To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper
Created 2017-08-12
53 commits to master branch, last one 2 years ago
A short script showing how to build simple real-time video analytics apps using YOLOv8 and Supervision. Try it out, and most importantly have fun! 🤪
Created 2023-02-09
9 commits to master branch, last one about a year ago
[MM'21] Former-DFER: Dynamic Facial Expression Recognition Transformer
Created 2021-02-26
18 commits to main branch, last one 2 years ago
Research materials about multimedia network and system, including paper list, tools, etc.
Created 2021-04-15
44 commits to main branch, last one 3 years ago
9
60
unknown
3
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"
Created 2019-12-15
22 commits to master branch, last one 4 years ago
智能视频分析:视频目标检测,视频人群计数
Created 2020-06-07
9 commits to master branch, last one 8 months ago