8 results found Sort:

974
9.9k
bsd-3-clause
98
LAVIS - A One-stop Library for Language-Vision Intelligence
Created 2022-08-24
492 commits to main branch, last one 2 days ago
This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information...
Created 2022-01-09
51 commits to main branch, last one 2 years ago
13
211
unknown
5
Compose multimodal datasets 🎹
Created 2024-02-17
133 commits to main branch, last one 2 days ago
19
186
unknown
5
Pytorch implementation of Multimodal Fusion Transformer for Remote Sensing Image Classification.
Created 2022-04-01
46 commits to main branch, last one 11 months ago
[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.
Created 2023-06-07
9 commits to main branch, last one 10 months ago
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl....
Created 2022-03-13
66 commits to main branch, last one about a year ago
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
Created 2022-09-28
12 commits to main branch, last one 2 years ago
Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.
Created 2023-03-17
28 commits to main branch, last one 11 months ago