6 results found Sort:
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Created
2021-04-13
29 commits to master branch, last one 2 years ago
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
Created
2020-10-30
20 commits to main branch, last one 2 years ago
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
Created
2022-09-19
4 commits to main branch, last one 2 years ago
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Created
2022-05-20
17 commits to main branch, last one 2 years ago
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
Created
2021-01-28
133 commits to main branch, last one about a year ago
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
Created
2019-06-03
55 commits to master branch, last one 3 years ago