147 results found Sort:

1.4k
19.2k
apache-2.0
139
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Created 2024-01-29
524 commits to main branch, last one about a month ago
656
8.5k
apache-2.0
93
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
Created 2019-08-09
9,213 commits to main branch, last one 16 days ago
794
7.7k
apache-2.0
80
ModelScope: bring the notion of Model-as-a-Service to life.
Created 2022-07-25
2,714 commits to master branch, last one 6 days ago
574
7.5k
mit
59
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Created 2023-11-22
237 commits to main branch, last one 3 days ago
403
7.0k
apache-2.0
37
Start building LLM-empowered multi-agent applications in an easier way.
Created 2024-01-12
320 commits to main branch, last one 21 hours ago
430
6.5k
apache-2.0
70
a state-of-the-art-level open visual language model | 多模态预训练模型
Created 2023-09-18
184 commits to main branch, last one 10 months ago
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Created 2021-01-05
540 commits to main branch, last one about a year ago
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Created 2022-07-08
382 commits to master branch, last one 8 months ago
202
4.8k
apache-2.0
39
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Created 2022-08-01
1,546 commits to mainline branch, last one a day ago
724
4.8k
other
105
Open Source Routing Engine for OpenStreetMap
Created 2016-01-19
14,144 commits to master branch, last one a day ago
226
4.2k
apache-2.0
20
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Created 2023-08-01
349 commits to main branch, last one 3 days ago
424
4.2k
apache-2.0
42
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Created 2023-04-23
95 commits to main branch, last one 7 months ago
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Created 2024-09-16
154 commits to main branch, last one about a month ago
715
3.9k
mit
45
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
Created 2018-08-01
1,696 commits to main branch, last one about a month ago
233
3.2k
apache-2.0
30
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Created 2023-10-23
154 commits to main branch, last one 4 months ago
413
3.1k
mit
58
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
Created 2023-05-09
1,936 commits to master branch, last one 4 days ago
233
3.0k
apache-2.0
45
Represent, send, store and search multimodal data
Created 2021-12-14
1,467 commits to main branch, last one 24 days ago
153
2.3k
apache-2.0
29
GPT4V-level open-source multi-modal model based on Llama3-8B
Created 2024-05-10
87 commits to main branch, last one about a month ago
325
2.2k
apache-2.0
11
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Created 2023-12-01
1,285 commits to main branch, last one a day ago
150
2.1k
apache-2.0
12
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Created 2023-08-01
139 commits to main branch, last one 3 months ago
134
2.1k
apache-2.0
23
Mixture-of-Experts for Large Vision-Language Models
Created 2023-12-14
228 commits to main branch, last one 4 months ago
293
1.8k
mit
30
A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!
Created 2022-12-08
1,380 commits to main branch, last one 11 months ago
240
1.7k
bsd-2-clause
65
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
Created 2022-08-16
187 commits to main branch, last one 21 hours ago
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
Created 2023-06-20
74 commits to main branch, last one about a year ago
139
1.5k
apache-2.0
15
Efficient Retrieval Augmentation and Generation Framework
Created 2023-01-23
73 commits to main branch, last one 4 months ago
Implementation of all RAG techniques in a simpler way
Created 2025-03-07
39 commits to main branch, last one 21 days ago
Recent Transformer-based CV and related works.
Created 2021-02-11
828 commits to main branch, last one about a year ago
89
1.3k
mit
12
The TypeScript library for building AI applications.
Created 2023-05-25
2,522 commits to main branch, last one 11 months ago
96
1.2k
apache-2.0
26
SALMONN: Speech Audio Language Music Open Neural Network
Created 2023-08-11
63 commits to main branch, last one about a month ago
177
1.2k
apache-2.0
14
[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification
Created 2020-10-25
124 commits to main branch, last one 3 months ago