100 results found Sort:
- Filter by Primary Language:
- Python (78)
- Jupyter Notebook (5)
- Julia (2)
- C# (1)
- C++ (1)
- TypeScript (1)
- +
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
Created
2024-01-29
303 commits to main branch, last one 2 days ago
ModelScope: bring the notion of Model-as-a-Service to life.
Created
2022-07-25
2,387 commits to master branch, last one 14 days ago
a state-of-the-art-level open visual language model | 多模态预训练模型
Created
2023-09-18
184 commits to main branch, last one 22 days ago
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Created
2021-01-05
540 commits to main branch, last one 9 months ago
Open Source Routing Engine for OpenStreetMap
Created
2016-01-19
14,007 commits to master branch, last one 6 days ago
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Created
2022-08-01
1,136 commits to mainline branch, last one 11 hours ago
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Created
2023-04-23
92 commits to main branch, last one 6 months ago
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Created
2022-07-08
374 commits to master branch, last one 6 months ago
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Created
2023-11-22
117 commits to main branch, last one a day ago
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
Created
2018-08-01
1,647 commits to main branch, last one 6 days ago
Represent, send, store and search multimodal data
Created
2021-12-14
1,452 commits to main branch, last one 9 days ago
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Created
2023-10-23
150 commits to main branch, last one about a month ago
Start building LLM-empowered multi-agent applications in an easier way.
Created
2024-01-12
185 commits to main branch, last one 5 hours ago
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
Created
2023-05-09
1,453 commits to master branch, last one 20 hours ago
A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!
Created
2022-12-08
1,380 commits to main branch, last one about a month ago
Mixture-of-Experts for Large Vision-Language Models
Created
2023-12-14
226 commits to main branch, last one about a month ago
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Created
2023-08-01
170 commits to main branch, last one 14 days ago
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Created
2023-08-01
136 commits to main branch, last one 2 months ago
🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
Created
2022-02-15
885 commits to main branch, last one 6 months ago
GPT4V-level open-source multi-modal model based on Llama3-8B
Created
2024-05-10
56 commits to main branch, last one 8 days ago
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
Created
2023-06-20
74 commits to main branch, last one 3 months ago
Recent Transformer-based CV and related works.
Created
2021-02-11
828 commits to main branch, last one 10 months ago
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
Created
2022-08-16
155 commits to main branch, last one 5 months ago
Efficient Retrieval Augmentation and Generation Framework
Created
2023-01-23
51 commits to main branch, last one 16 days ago
[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification
Created
2020-10-25
115 commits to main branch, last one about a month ago
The TypeScript library for building AI applications.
Created
2023-05-25
2,522 commits to main branch, last one about a month ago
SALMONN: Speech Audio Language Music Open Neural Network
Created
2023-08-11
24 commits to main branch, last one 23 days ago
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
Created
2019-03-03
36 commits to master branch, last one about a year ago
FarmVibes.AI: Multi-Modal GeoSpatial ML Models for Agriculture and Sustainability
Created
2022-09-06
36 commits to main branch, last one 22 days ago
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Created
2023-10-02
222 commits to main branch, last one 2 months ago