8 results found Sort:

890
9.0k
bsd-3-clause
95
LAVIS - A One-stop Library for Language-Vision Intelligence
Created 2022-08-24
490 commits to main branch, last one 5 months ago
This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information...
Created 2022-01-09
51 commits to main branch, last one 2 years ago
12
156
unknown
5
Pytorch implementation of Multimodal Fusion Transformer for Remote Sensing Image Classification.
Created 2022-04-01
46 commits to main branch, last one 5 months ago
[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.
Created 2023-06-07
9 commits to main branch, last one 4 months ago
5
90
unknown
4
Compose multimodal datasets 🎹
Created 2024-02-17
63 commits to main branch, last one 2 months ago
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl....
Created 2022-03-13
66 commits to main branch, last one 7 months ago
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
Created 2022-09-28
12 commits to main branch, last one about a year ago
Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.
Created 2023-03-17
28 commits to main branch, last one 6 months ago