6 results found Sort:

117
814
mit
12
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Created 2021-04-13
29 commits to master branch, last one 2 years ago
54
332
mit
10
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
Created 2020-10-30
20 commits to main branch, last one about a year ago
15
119
mit
2
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
Created 2022-09-19
4 commits to main branch, last one about a year ago
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Created 2022-05-20
17 commits to main branch, last one about a year ago
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
Created 2021-01-28
133 commits to main branch, last one 11 months ago
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
Created 2019-06-03
55 commits to master branch, last one 2 years ago