194 results found Sort:
- Filter by Primary Language:
- Python (130)
- Jupyter Notebook (28)
- TypeScript (2)
- Bicep (1)
- Assembly (1)
- Rust (1)
- Shell (1)
- +
Making large AI models cheaper, faster and more accessible
Created
2021-10-28
3,807 commits to main branch, last one about a month ago
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created
2023-04-17
460 commits to main branch, last one 11 months ago
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created
2019-07-23
1,236 commits to master branch, last one about a month ago
Janus-Series: Unified Multimodal Understanding and Generation Models
Created
2024-10-18
21 commits to main branch, last one 2 months ago
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Created
2025-01-23
126 commits to main branch, last one 13 days ago
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Created
2024-03-07
11 commits to main branch, last one 11 months ago
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
Created
2023-08-30
249 commits to main branch, last one 5 months ago
⚡ TabPFN: Foundation Model for Tabular Data ⚡
Created
2022-07-01
398 commits to main branch, last one 17 days ago
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Created
2023-04-01
626 commits to main branch, last one about a year ago
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Created
2023-04-19
207 commits to main branch, last one 2 months ago
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
Created
2024-02-23
75 commits to main branch, last one 2 days ago
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
Created
2023-05-02
247 commits to main branch, last one 10 months ago
EVA Series: Visual Representation Fantasies from BAAI
Created
2022-11-14
276 commits to master branch, last one 8 months ago
Images to inference with no labeling (use foundation models to train supervised models).
Created
2023-06-06
351 commits to main branch, last one 23 days ago
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
Created
2023-07-25
185 commits to main branch, last one 17 days ago
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Created
2021-09-01
66 commits to main branch, last one 2 years ago
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
benchmark
multimodal
video-clip
video-data
video-dataset
self-supervised
video-retrieval
foundation-models
action-recognition
instruction-tuning
masked-autoencoder
vision-transformer
video-understanding
zero-shot-retrieval
contrastive-learning
open-set-recognition
video-question-answering
zero-shot-classification
temporal-action-localization
spatio-temporal-action-localization
Created
2022-11-23
247 commits to main branch, last one 3 days ago
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Created
2023-05-25
595 commits to main branch, last one 3 months ago
Emu Series: Generative Multimodal Models from BAAI
Created
2023-07-11
41 commits to main branch, last one 6 months ago
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
Created
2024-02-07
78 commits to main branch, last one 2 months ago
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Created
2024-06-10
72 commits to main branch, last one 14 days ago
日本語LLMまとめ - Overview of Japanese LLMs
Created
2023-07-09
517 commits to main branch, last one 6 days ago
Must-read Papers on Knowledge Editing for Large Language Models.
Created
2022-12-06
245 commits to main branch, last one about a month ago
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
Created
2023-06-10
44 commits to main branch, last one 3 months ago
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Created
2023-05-18
136 commits to main branch, last one 6 months ago
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
Created
2020-03-04
405 commits to master branch, last one 28 days ago
A curated list of foundation models for vision and language tasks
Created
2023-04-04
301 commits to main branch, last one 4 days ago
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Created
2023-11-02
43 commits to main branch, last one 4 months ago
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Created
2023-05-19
281 commits to main branch, last one 23 days ago
Creative interactive views of any dataset.
Created
2021-05-07
1,810 commits to main branch, last one about a year ago