Search Results - RepositoryStats

InternVideo OpenGVLab

106

1.8k

apache-2.0

27

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Created 2022-11-23

245 commits to main branch, last one 26 days ago

Awsome-Deep-Learning-for-Video-Analysis HuaizhengZhang

171

788

mit

33

Papers, code and datasets about deep learning and multi-modal learning for video analysis

paper deep-learning video-dataset video-analysis machine-learning multimodal-learning video-classification

Created 2017-06-14

91 commits to master branch, last one 3 years ago

DriveAGI OpenDriveLab

32

697

apache-2.0

32

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

embodied-ai world-models large-dataset video-dataset policy-learning foundation-model video-generation autonomous-driving general-artificial-intelligence

Created 2023-04-24

125 commits to main branch, last one 2 months ago

Video-Dataset-Loading-Pytorch RaivoKoot

44

459

bsd-2-clause

5

Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.

videos pytorch dataloader deep-learning video-dataset machine-learning action-recognition

Created 2020-11-13

97 commits to main branch, last one 2 years ago

VideoPainter TencentARC

17

276

other

7

Any-length Video Inpainting and Editing with Plug-and-Play Context Control

video video-dataset video-editing video-inpainting

Created 2025-03-09

16 commits to main branch, last one a day ago

Awesome_Long_Form_Video_Understanding ttengwang

12

262

unknown

10

Awesome papers & datasets specifically focused on long-term videos.

video-llms video-dataset long-term-video video-grounding dense-video-captioning temporal-action-detection temporal-sentence-grounding video-large-language-models temporal-action-localization video-representation-learning audio-visual-event-localization

Created 2022-07-11

47 commits to main branch, last one 4 months ago

video_captioning_datasets jssprz

12

121

unknown

2

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

msvd vatex review msr-vtt trecvid charades tgif-dataset video-dataset video-to-text state-of-the-art video-captioning video-description vision-and-language activitynet-captions

Created 2021-03-12

24 commits to main branch, last one 2 years ago

SoccerAct10 ascuet

0

95

unknown

1

SoccerAct10 is a dataset which contains 10 different soccer actions. This dataset was developed using the videos from YouTube.

video-dataset sports-analysis football-dataset action-recognition video-classification sports-classification keras-action-recognition video-action-recognition action-recognition-dataset pytorch-action-recognition sports-recognition-dataset fine-grained-classification football-action-recognition soccer-video-classification sports-video-classification football-action-classification soccer-activity-classification

Created 2023-04-18

6 commits to main branch, last one about a year ago

pytorch-VideoDataset YuxinZhaozyx

19

69

mit

2

Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.

video dataset pytorch transforms preprocessing video-dataset

Created 2019-08-03

3 commits to master branch, last one 5 years ago

Video-Quality-Assessment-A-Comprehensive-Survey taco-group

1

68

unknown

0

The Most Comprehensive Survey of Video Quality Assessment to Date.

iqa vqa vision visual video-dataset video-quality perceptual-models video-compression visual-perception video-understanding image-quality-assessment video-quality-assessment

Created 2024-12-09

2 commits to main branch, last one 3 months ago

BSCV-Dataset LIUTIGHE

16

35

unknown

2

Official repository for the paper titled "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method", accepted by NeurIPS 2023 Dataset and Benchmark Track

video-dataset computer-vision image-processing low-level-vision video-inpainting video-processing video-restoration

Created 2023-06-07

183 commits to main branch, last one 8 months ago

VideoSwin innat

4

33

apache-2.0

2

Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling

keras torch tensorflow video-dataset video-classification

Created 2023-09-28

173 commits to main branch, last one 3 months ago

MMWorld eric-ai-lab

1

25

mit

2

Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

evaluation world-model video-dataset multi-disciplinary video-understanding multimodal-large-language-models

Created 2024-06-11

17 commits to main branch, last one 6 months ago