6 results found Sort:

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Created 2017-05-26
55 commits to master branch, last one 3 years ago
24
131
mit
4
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
Created 2019-06-11
84 commits to master branch, last one 4 months ago
Real-time semantic image segmentation on mobile devices
Created 2018-11-12
151 commits to master branch, last one 4 years ago
Using LSTM or Transformer to solve Image Captioning in Pytorch
Created 2020-03-01
27 commits to master branch, last one 2 years ago
Adds SPICE metric to coco-caption evaluation server codes
Created 2016-06-19
114 commits to master branch, last one 6 years ago
Pytorch implementation of image captioning using transformer-based model.
Created 2021-09-26
43 commits to main branch, last one about a year ago