157 results found Sort:

453
4.4k
other
42
开放源码的无App推送服务,iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android客户端、自制设备
Created 2021-12-16
308 commits to main branch, last one 3 months ago
178
4.2k
apache-2.0
35
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Created 2022-08-01
1,121 commits to mainline branch, last one 3 days ago
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Created 2022-07-08
374 commits to master branch, last one 6 months ago
1.0k
3.2k
apache-2.0
30
OpenMMLab Pre-training Toolbox and Benchmark
Created 2020-07-09
973 commits to main branch, last one 4 months ago
314
2.8k
gpl-3.0
28
Effortless data labeling with AI support from Segment Anything and other awesome models.
Created 2023-05-23
355 commits to main branch, last one 2 days ago
中文nlp解决方案(大模型、数据、模型、训练、推理)
Created 2023-02-05
206 commits to main branch, last one 9 days ago
Image to prompt with BLIP and CLIP
Created 2022-08-09
98 commits to main branch, last one 8 months ago
Easily compute clip embeddings and build a clip retrieval system with them
Created 2021-06-07
332 commits to main branch, last one 4 months ago
167
1.8k
unknown
115
Collection of AWESOME vision-language models for vision tasks
Created 2023-03-30
76 commits to main branch, last one 5 days ago
Android UI 快速开发,专治原生控件各种不服
Created 2018-04-26
186 commits to master branch, last one about a year ago
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Created 2023-11-07
40 commits to main branch, last one 6 months ago
🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
Created 2022-02-15
885 commits to main branch, last one 5 months ago
54
1.0k
unknown
21
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
Created 2021-09-05
56 commits to main branch, last one about a year ago
88
996
cc-by-4.0
14
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for ...
Created 2023-05-18
40 commits to main branch, last one 11 days ago
93
944
bsd-3-clause
25
Stable Diffusion in NCNN with c++, supported txt2img and img2img
Created 2022-11-11
60 commits to main branch, last one 11 months ago
Search photos on Unsplash using natural language
Created 2021-01-16
65 commits to main branch, last one about a year ago
55
921
apache-2.0
13
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Created 2023-02-21
286 commits to main branch, last one about a month ago
Search inside YouTube videos using natural language
Created 2021-02-01
20 commits to main branch, last one 2 years ago
82
874
mit
28
Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
Created 2022-07-26
9 commits to main branch, last one about a year ago
116
794
mit
12
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Created 2021-04-13
29 commits to master branch, last one 2 years ago
105
771
mit
23
CLIP + FFT/DWT/RGB = text to image/video
Created 2021-02-28
176 commits to master branch, last one 7 months ago
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-bas...
Created 2021-03-23
77 commits to main branch, last one about a year ago
基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model c...
Created 2022-12-13
36 commits to main branch, last one about a year ago
31
609
unknown
19
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
Created 2021-10-09
34 commits to main branch, last one about a year ago
React component for truncating multi-line spans and adding an ellipsis.
Created 2016-05-11
194 commits to master branch, last one 3 years ago
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,...
Created 2021-08-02
1,083 commits to main branch, last one 2 days ago
44
565
apache-2.0
19
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
Created 2023-03-18
15 commits to main branch, last one 7 months ago
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Created 2023-10-08
31 commits to master branch, last one 3 months ago
59
504
apache-2.0
8
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
Created 2023-12-01
488 commits to main branch, last one 22 hours ago
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created 2020-06-01
96 commits to master branch, last one 2 months ago