3 results found Sort:
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Created
2023-05-29
13 commits to master branch, last one 9 months ago
Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"
Created
2024-06-11
29 commits to main branch, last one about a month ago
MADELEINE: multi-stain slide representation learning (ECCV'24)
Created
2024-07-16
41 commits to main branch, last one 2 months ago