95 results found Sort:

639
2.2k
apache-2.0
170
Apache Lucene.NET
Created 2009-03-27
6,928 commits to master branch, last one a day ago
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
This repository has been archived (exclude archived)
Created 2022-11-11
2,126 commits to main branch, last one about a month ago
275
2.0k
apache-2.0
15
MTEB: Massive Text Embedding Benchmark
Created 2022-04-05
2,028 commits to main branch, last one a day ago
Study guides for MIT's 15.003 Data Science Tools
Created 2020-08-10
23 commits to master branch, last one 4 years ago
192
1.6k
apache-2.0
24
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Created 2021-01-18
434 commits to main branch, last one about a year ago
111
1.5k
apache-2.0
13
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Created 2023-07-14
492 commits to main branch, last one a day ago
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Created 2024-02-27
20 commits to master branch, last one 2 months ago
117
916
apache-2.0
22
A realtime serving engine for Data-Intensive Generative AI Applications
Created 2023-04-18
3,098 commits to main branch, last one 11 hours ago
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
Created 2023-07-09
353 commits to main branch, last one 2 months ago
37
906
mit
4
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Created 2024-04-10
54 commits to main branch, last one 7 days ago
125
881
mit
13
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Created 2021-04-13
29 commits to master branch, last one 2 years ago
SGPT: GPT Sentence Embeddings for Semantic Search
Created 2022-02-11
65 commits to main branch, last one 9 months ago
106
851
apache-2.0
26
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Created 2022-01-15
140 commits to main branch, last one about a year ago
48
838
apache-2.0
10
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Created 2023-09-14
86 commits to main branch, last one 10 months ago
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Created 2023-02-11
212 commits to master branch, last one 2 months ago
41
630
apache-2.0
20
A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal vector embeddings.
Created 2023-11-07
731 commits to main branch, last one 20 hours ago
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Created 2022-03-21
118 commits to main branch, last one about a year ago
58
617
apache-2.0
17
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
Created 2024-09-04
73 commits to main branch, last one 8 days ago
Generative Representational Instruction Tuning
Created 2024-02-15
63 commits to main branch, last one 3 days ago
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
Created 2023-02-10
16 commits to main branch, last one 14 days ago
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Created 2020-02-04
12 commits to master branch, last one 3 years ago
30
403
apache-2.0
9
Parsing-free RAG supported by VLMs
Created 2024-10-14
66 commits to master branch, last one a day ago
24
385
mit
24
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
Created 2024-01-22
28 commits to main branch, last one 9 months ago
25
377
unknown
21
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
Created 2019-04-19
499 commits to main branch, last one 9 months ago
19
339
unknown
6
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
Created 2023-11-09
142 commits to main branch, last one about a month ago
Deep Recommenders
Created 2020-03-24
659 commits to master branch, last one 2 years ago
40
311
apache-2.0
11
Domain Adapted Language Modeling Toolkit - E2E RAG
Created 2023-07-31
290 commits to main branch, last one about a month ago
42
287
apache-2.0
3
Library for generating vector embeddings, reranking in Rust
Created 2023-10-01
152 commits to main branch, last one 6 days ago
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)
Created 2022-12-21
7 commits to main branch, last one 7 months ago