3 results found Sort:

16
238
mit
18
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Created 2023-05-29
13 commits to master branch, last one 7 months ago
Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"
Created 2024-06-11
25 commits to main branch, last one 3 months ago
MADELEINE: multi-stain slide representation learning (ECCV'24)
Created 2024-07-16
41 commits to main branch, last one 22 days ago