2 results found Sort:

936
5.5k
other
114
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created 2018-06-27
1,099 commits to main branch, last one about a month ago
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
Created 2020-10-05
19 commits to main branch, last one 3 years ago