150 results found Sort:

4.3k
38.8k
apache-2.0
385
Making large AI models cheaper, faster and more accessible
Created 2021-10-28
3,765 commits to main branch, last one 21 hours ago
2.2k
20.3k
apache-2.0
158
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created 2023-04-17
460 commits to main branch, last one 6 months ago
2.6k
20.2k
mit
308
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created 2019-07-23
1,216 commits to master branch, last one 11 days ago
243
3.6k
mit
100
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Created 2023-04-01
626 commits to main branch, last one 8 months ago
330
3.3k
bsd-3-clause
59
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Created 2023-08-30
249 commits to main branch, last one 18 days ago
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Created 2023-04-19
204 commits to main branch, last one 2 months ago
97
3.0k
unknown
39
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
Created 2023-05-02
247 commits to main branch, last one 6 months ago
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
Created 2024-02-23
49 commits to main branch, last one a day ago
167
2.3k
mit
30
EVA Series: Visual Representation Fantasies from BAAI
Created 2022-11-14
276 commits to master branch, last one 3 months ago
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Created 2024-03-07
11 commits to main branch, last one 7 months ago
158
2.0k
apache-2.0
21
Images to inference with no labeling (use foundation models to train supervised models).
Created 2023-06-06
341 commits to main branch, last one 2 months ago
200
1.8k
mit
16
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Created 2021-09-01
66 commits to main branch, last one 2 years ago
86
1.7k
apache-2.0
22
Emu Series: Generative Multimodal Models from BAAI
Created 2023-07-11
41 commits to main branch, last one about a month ago
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
Created 2023-07-25
164 commits to main branch, last one 2 days ago
244
1.5k
apache-2.0
8
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Created 2023-05-25
589 commits to main branch, last one 10 days ago
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
Created 2024-02-07
77 commits to main branch, last one 2 months ago
51
1.1k
mit
22
Janus-Series: Unified Multimodal Understanding and Generation Models
Created 2024-10-18
16 commits to main branch, last one 8 days ago
63
973
apache-2.0
14
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Created 2023-05-18
136 commits to main branch, last one about a month ago
218
932
apache-2.0
19
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
Created 2020-03-04
398 commits to master branch, last one 2 months ago
Must-read Papers on Knowledge Editing for Large Language Models.
Created 2022-12-06
241 commits to main branch, last one 20 hours ago
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
Created 2023-06-10
42 commits to main branch, last one 5 months ago
A curated list of foundation models for vision and language tasks
Created 2023-04-04
274 commits to main branch, last one a day ago
43
829
apache-2.0
15
Creative interactive views of any dataset.
Created 2021-05-07
1,810 commits to main branch, last one 8 months ago
43
820
other
17
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Created 2024-06-10
40 commits to main branch, last one about a month ago
62
789
other
17
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Created 2023-05-19
275 commits to main branch, last one 5 months ago
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Created 2023-11-02
42 commits to main branch, last one 5 months ago
[MICCAI 2019 Young Scientist Award] [MEDIA 2020 Best Paper Award] Models Genesis
Created 2019-07-24
271 commits to master branch, last one 9 months ago
32
651
unknown
14
[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds
Created 2023-08-17
51 commits to master branch, last one 22 days ago