49 results found Sort:

606
9.5k
apache-2.0
95
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Created 2020-08-09
1,470 commits to master branch, last one 12 hours ago
Retrieval and Retrieval-augmented LLMs
Created 2023-08-02
1,248 commits to master branch, last one 17 hours ago
765
6.2k
mit
53
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Created 2020-09-22
198 commits to master branch, last one 18 days ago
401
4.5k
apache-2.0
31
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Created 2019-11-12
365 commits to master branch, last one 26 days ago
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Created 2021-04-16
105 commits to main branch, last one about a month ago
188
1.2k
apache-2.0
29
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
Created 2018-02-04
128 commits to master branch, last one 2 years ago
131
875
apache-2.0
22
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Created 2020-08-22
1,539 commits to master branch, last one about a month ago
SGPT: GPT Sentence Embeddings for Semantic Search
Created 2022-02-11
65 commits to main branch, last one 9 months ago
65
833
apache-2.0
8
unified embedding model
Created 2023-05-09
109 commits to main branch, last one about a year ago
164
823
mit
37
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Created 2019-03-23
97 commits to master branch, last one 2 years ago
100
578
other
17
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Created 2018-09-19
30 commits to master branch, last one 2 years ago
43
564
apache-2.0
11
A Python vector database you just need - no more, no less.
Created 2023-05-02
99 commits to main branch, last one 8 months ago
33
488
mit
10
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
Created 2023-10-17
308 commits to main branch, last one 9 days ago
33
379
apache-2.0
12
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
Created 2019-11-26
663 commits to master branch, last one about a year ago
A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
Created 2021-03-29
247 commits to main branch, last one 11 months ago
A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.
Created 2022-10-12
24 commits to main branch, last one about a year ago
27
291
mit
4
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
Created 2022-04-13
38 commits to master branch, last one 2 years ago
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.
Created 2023-03-11
163 commits to master branch, last one about a year ago
Papers and Book to look at when starting AGI 📚
Created 2019-12-10
304 commits to master branch, last one 2 months ago
[NeurIPS 2019] Spherical Text Embedding
Created 2019-10-27
37 commits to master branch, last one 4 years ago
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Created 2019-04-26
29 commits to master branch, last one 4 years ago
Clustering sentence embeddings to extract message intent
Created 2021-07-13
66 commits to main branch, last one 3 years ago
16
136
unknown
3
Code for "Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning (EMNLP 2022)"
Created 2022-01-01
145 commits to master branch, last one about a year ago
12
134
mit
7
How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.
Created 2020-08-12
54 commits to master branch, last one 2 years ago
中文无监督SimCSE Pytorch实现
Created 2021-06-11
8 commits to main branch, last one 3 years ago
12
106
apache-2.0
4
Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)
Created 2020-04-19
70 commits to master branch, last one 2 months ago
Exploring the simple sentence similarity measurements using word embeddings
Created 2018-11-07
177 commits to master branch, last one 3 months ago
Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files
Created 2024-05-26
22 commits to main branch, last one 3 months ago