103 results found Sort:

362
2.4k
apache-2.0
19
MTEB: Massive Text Embedding Benchmark
Created 2022-04-05
2,825 commits to main branch, last one 2 days ago
647
2.3k
apache-2.0
167
Apache Lucene.NET
Created 2009-03-27
6,981 commits to master branch, last one 23 days ago
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
This repository has been archived (exclude archived)
Created 2022-11-11
2,126 commits to main branch, last one 6 months ago
128
1.9k
apache-2.0
18
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Created 2023-07-14
544 commits to main branch, last one a day ago
Study guides for MIT's 15.003 Data Science Tools
Created 2020-08-10
23 commits to master branch, last one 4 years ago
204
1.8k
apache-2.0
20
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Created 2021-01-18
472 commits to main branch, last one about a month ago
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Created 2024-02-27
20 commits to master branch, last one 7 months ago
63
1.1k
mit
5
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Created 2024-04-10
68 commits to main branch, last one 3 days ago
73
1.0k
apache-2.0
26
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
Created 2023-11-07
1,070 commits to main branch, last one a day ago
66
1.0k
apache-2.0
6
Profile-Based Long-Term Memory for AI Applications
Created 2024-09-03
244 commits to main branch, last one a day ago
125
987
apache-2.0
21
A realtime serving engine for Data-Intensive Generative AI Applications
Created 2023-04-18
3,585 commits to main branch, last one a day ago
127
933
mit
12
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Created 2021-04-13
29 commits to master branch, last one 2 years ago
SGPT: GPT Sentence Embeddings for Semantic Search
Created 2022-02-11
65 commits to main branch, last one about a year ago
107
861
apache-2.0
26
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Created 2022-01-15
140 commits to main branch, last one about a year ago
47
854
apache-2.0
10
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Created 2023-09-14
86 commits to main branch, last one about a year ago
Epsilla is a high performance Vector Database Management System
Created 2023-07-09
353 commits to main branch, last one 7 months ago
80
768
apache-2.0
19
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
Created 2024-09-04
73 commits to main branch, last one 4 months ago
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Created 2023-02-11
212 commits to master branch, last one 7 months ago
53
661
apache-2.0
12
Parsing-free RAG supported by VLMs
Created 2024-10-14
119 commits to master branch, last one about a month ago
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Created 2022-03-21
118 commits to main branch, last one about a year ago
Generative Representational Instruction Tuning
Created 2024-02-15
67 commits to main branch, last one 27 days ago
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
Created 2023-02-10
17 commits to main branch, last one about a month ago
65
469
apache-2.0
5
Rust library for generating vector embeddings, reranking locally
Created 2023-10-01
177 commits to main branch, last one 3 days ago
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Created 2020-02-04
12 commits to master branch, last one 4 years ago
26
417
mit
24
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
Created 2024-01-22
28 commits to main branch, last one about a year ago
31
410
unknown
6
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
Created 2023-11-09
142 commits to main branch, last one 6 months ago
Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案
Created 2024-06-04
109 commits to master branch, last one 4 months ago
🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techni...
Created 2025-02-09
140 commits to main branch, last one a day ago
25
377
unknown
20
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
Created 2019-04-19
499 commits to main branch, last one about a year ago