2 results found Sort:

347
3.5k
bsd-3-clause
60
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Created 2023-08-30
249 commits to main branch, last one 4 months ago
270
3.0k
bsd-3-clause
33
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Created 2023-05-06
145 commits to main branch, last one 9 months ago