2 results found Sort:

925
5.4k
other
115
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created 2018-06-27
1,097 commits to main branch, last one about a month ago
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
Created 2020-10-05
19 commits to main branch, last one 2 years ago