Statistics for topic multimodal

RepositoryStats tracks 638,229 Github repositories, of these 386 are tagged with the multimodal topic. The most common primary language for repositories using this topic is Python (260). Other languages include: Jupyter Notebook (42), TypeScript (11)

Stargazers over time for topic multimodal

Most starred repositories for topic multimodal (view more)

anything-llm Mintplex-Labs

4.1k

42.7k

mit

307

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

llm mcp rag webui crewai llama3 ollama localai deepseek lmstudio ai-agents local-llm multimodal deepseek-r1 mcp-servers vector-database custom-ai-agents agent-framework-javascript

Created 2023-06-04

1,300 commits to master branch, last one 2 days ago

LLaVA haotian-liu

2.4k

22.2k

apache-2.0

158

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

gpt-4 llama llava llama2 chatbot chatgpt llama-2 multimodal multi-modality foundation-models instruction-tuning vision-language-model visual-language-learning

Created 2023-04-17

460 commits to main branch, last one 11 months ago

serve jina-ai

2.2k

21.5k

apache-2.0

216

☁️ Build multimodal AI applications with cloud-native stack

Created 2020-02-13

8,644 commits to master branch, last one 19 days ago

unilm microsoft

2.6k

21.1k

mit

304

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Created 2019-07-23

1,236 commits to master branch, last one about a month ago

Janus deepseek-ai

2.2k

17.1k

mit

150

Janus-Series: Unified Multimodal Understanding and Generation Models

llm any-to-any multimodal unified-model foundation-models vision-language-pretraining

Created 2024-10-18

21 commits to main branch, last one 2 months ago

NeMo NVIDIA

2.8k

13.6k

apache-2.0

219

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

asr tts multimodal deeplearning generative-ai neural-networks speech-synthesis speech-translation machine-translation speaker-recognition speaker-diariazation large-language-models

Created 2019-08-05

8,330 commits to main branch, last one 19 hours ago

Statistics for topic multimodal

Stargazers over time for topic multimodal

Most starred repositories for topic multimodal (view more)

Trending repositories for topic multimodal (view more)