73 results found Sort:

628
2.2k
apache-2.0
175
Apache Lucene.NET
Created 2009-03-27
6,822 commits to master branch, last one about a month ago
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Created 2022-11-11
2,089 commits to main branch, last one a day ago
137
1.9k
mit
14
R2R is a RAG (Retrieval-Augmented Generation) engine with a RESTful API and prod features. Including hybrid search, knowledge graphs, and more.
Created 2024-02-12
662 commits to main branch, last one 7 hours ago
Study guides for MIT's 15.003 Data Science Tools
Created 2020-08-10
23 commits to master branch, last one 3 years ago
206
1.6k
apache-2.0
8
MTEB: Massive Text Embedding Benchmark
Created 2022-04-05
1,658 commits to main branch, last one 12 hours ago
174
1.5k
apache-2.0
23
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Created 2021-01-18
434 commits to main branch, last one 10 months ago
74
978
apache-2.0
9
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Created 2023-07-14
440 commits to main branch, last one 9 days ago
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
Created 2023-07-09
345 commits to main branch, last one 28 days ago
106
845
apache-2.0
27
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Created 2022-01-15
140 commits to main branch, last one 8 months ago
SGPT: GPT Sentence Embeddings for Semantic Search
Created 2022-02-11
65 commits to main branch, last one 4 months ago
117
813
mit
12
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Created 2021-04-13
29 commits to master branch, last one 2 years ago
44
800
apache-2.0
9
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Created 2023-09-14
86 commits to main branch, last one 5 months ago
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Created 2024-02-27
18 commits to master branch, last one 3 months ago
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Created 2022-03-21
118 commits to main branch, last one 11 months ago
66
606
apache-2.0
19
A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications
Created 2023-04-18
1,890 commits to main branch, last one 13 hours ago
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Created 2023-02-11
209 commits to master branch, last one about a year ago
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
Created 2023-02-10
15 commits to main branch, last one 7 months ago
14
495
mit
1
Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)
Created 2024-04-10
19 commits to main branch, last one 2 days ago
Generative Representational Instruction Tuning
Created 2024-02-15
42 commits to main branch, last one 23 days ago
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Created 2020-02-04
12 commits to master branch, last one 3 years ago
25
376
unknown
21
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
Created 2019-04-19
499 commits to main branch, last one 4 months ago
17
327
mit
23
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
Created 2024-01-22
28 commits to main branch, last one 4 months ago
Deep Recommenders
Created 2020-03-24
659 commits to master branch, last one 2 years ago
13
300
apache-2.0
8
A compute framework for turning complex data into vectors. Build multimodal vectors with ease and define weights at query time so you don't need a custom reranking algorithm to optimise results. Go st...
Created 2023-11-07
246 commits to main branch, last one 2 days ago
33
276
apache-2.0
11
Domain Adapted Language Modeling Toolkit - E2E RAG
Created 2023-07-31
286 commits to main branch, last one 15 days ago
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)
Created 2022-12-21
7 commits to main branch, last one 2 months ago
21
184
apache-2.0
3
Library to generate vector embeddings, reranking. Based on Qdrant's FastEmbed.
Created 2023-10-01
113 commits to main branch, last one 2 days ago
libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)
Created 2020-12-10
44 commits to master branch, last one 4 months ago
26
153
unknown
10
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
Created 2020-04-21
34 commits to master branch, last one about a year ago