63 results found Sort:
- Filter by Primary Language:
- Python (44)
- Jupyter Notebook (4)
- C++ (2)
- TypeScript (1)
- Markdown (1)
- Julia (1)
- Rust (1)
- +
SGLang is a fast serving framework for large language models and vision language models.
Created
2024-01-08
1,519 commits to main branch, last one 23 hours ago
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR...
Created
2024-08-16
1,110 commits to main branch, last one 4 days ago
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, ...
Created
2024-03-03
35 commits to main branch, last one about a month ago
An AI-powered file management tool that ensures privacy by organizing local texts, images. Using Llama3.2 3B and Llava v1.6 models with the Nexa SDK, it intuitively scans, restructures, and organizes ...
Created
2024-09-21
33 commits to main branch, last one 2 months ago
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Created
2023-10-16
941 commits to main branch, last one 9 days ago
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.
Created
2022-02-19
394 commits to main branch, last one 2 days ago
LLM Agent Framework in ComfyUI includes Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1...
Created
2024-04-13
2,363 commits to main branch, last one 3 days ago
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
Created
2024-01-09
413 commits to main branch, last one a day ago
A family of lightweight multimodal models.
Created
2024-01-31
114 commits to main branch, last one about a month ago
Aircraft design optimization made fast through computational graph transformations (e.g., automatic differentiation). Composable analysis tools for aerodynamics, propulsion, structures, trajectory des...
Created
2019-05-15
4,276 commits to master branch, last one 7 days ago
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
Created
2024-08-12
41 commits to main branch, last one about a month ago
Famous Vision Language Models and Their Architectures
Created
2024-02-15
231 commits to main branch, last one 3 months ago
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
Created
2023-11-23
72 commits to main branch, last one 20 days ago
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Created
2024-01-24
271 commits to main branch, last one about a month ago
Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.
Created
2024-06-27
169 commits to main branch, last one 14 days ago
ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)
Created
2024-01-15
54 commits to main branch, last one 23 days ago
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
Created
2023-07-21
40 commits to main branch, last one 19 days ago
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
Created
2023-12-07
228 commits to main branch, last one a day ago
[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models
Created
2024-06-14
19 commits to main branch, last one 2 months ago
Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon
Created
2024-05-27
57 commits to main branch, last one 3 months ago
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
Created
2024-10-12
4 commits to main branch, last one 18 days ago
Awesome LLM Papers and repos on very comprehensive topics.
Created
2024-01-13
171 commits to main branch, last one 3 months ago
Official code for Paper "Mantis: Multi-Image Instruction Tuning" (TMLR2024)
Created
2024-04-12
228 commits to main branch, last one 5 hours ago
Ptera Software is a fast, easy-to-use, and open-source software package for analyzing flapping-wing flight.
Created
2020-03-23
775 commits to develop branch, last one about a year ago
RAI is a multi-vendor agent framework for robotics, utilizing Langchain and ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and mo...
Created
2024-06-04
242 commits to development branch, last one 7 days ago
Seamlessly integrate state-of-the-art transformer models into robotics stacks
Created
2024-05-22
83 commits to main branch, last one 28 days ago
llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2
Created
2023-04-01
731 commits to main branch, last one a day ago
LLaRA: Large Language and Robotics Assistant
Created
2024-06-07
20 commits to main branch, last one 2 months ago
PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements. (e.g. MBTI Measurement Agent)
Created
2024-04-12
199 commits to main branch, last one 2 days ago