2 results found Sort:
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created
2018-06-27
1,099 commits to main branch, last one about a month ago
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
Created
2020-10-05
19 commits to main branch, last one 3 years ago