Search Results - RepositoryStats

bottom-up-attention peteanderson80

377

1.4k

mit

25

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

vqa caffe mscoco faster-rcnn mscoco-dataset image-captioning captioning-images visual-question-answering

Created 2017-05-26

55 commits to master branch, last one 4 years ago

pvse yalesong

23

134

mit

3

Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)

mrw-dataset tgif-dataset mscoco-dataset metric-learning cross-modal-retrieval

Created 2019-06-11

84 commits to master branch, last one about a year ago

mobile-segmentation sercant

20

111

apache-2.0

8

Real-time semantic image segmentation on mobile devices

android real-time mobilenetv2 shufflenet-v2 mscoco-dataset neural-network deeplab-v3-plus tensorflow-lite image-processing semantic-segmentation semantic-image-segmentation

Created 2018-11-12

151 commits to master branch, last one 5 years ago

Image-Caption RoyalSkye

26

76

mit

1

Using LSTM or Transformer to solve Image Captioning in Pytorch

pytorch cnn-lstm beam-search transformer mscoco-dataset encoder-decoder image-captioning attention-mechanism

Created 2020-03-01

27 commits to master branch, last one 3 years ago

image_captioning_with_transformers zarzouram

9

65

mit

1

Pytorch implementation of image captioning using transformer-based model.

pytorch beam-search transformers mscoco-dataset encoder-decoder image-captioning transformer-pytorch transformers-models pytorch-implementation

Created 2021-09-26

43 commits to main branch, last one about a year ago

coco-caption peteanderson80

42

49

other

6

Adds SPICE metric to coco-caption evaluation server codes

spice mscoco mscoco-dataset image-captioning captioning-images mscoco-image-dataset

Created 2016-06-19

114 commits to master branch, last one 6 years ago