3 results found Sort:

17
241
mit
18
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Created 2023-05-29
13 commits to master branch, last one 8 months ago
Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"
Created 2024-06-11
29 commits to main branch, last one 2 days ago
MADELEINE: multi-stain slide representation learning (ECCV'24)
Created 2024-07-16
41 commits to main branch, last one about a month ago