45 results found Sort:

380
4.2k
apache-2.0
29
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Created 2019-11-12
359 commits to master branch, last one 3 months ago
380
1.6k
apache-2.0
62
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Tran...
Created 2019-04-22
36 commits to master branch, last one 6 months ago
322
1.4k
apache-2.0
40
similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包,java编写,可用于文本相似度计算、情感分析等任务,开箱即用。
Created 2016-11-09
108 commits to master branch, last one 4 months ago
73
1.0k
agpl-3.0
22
Image similarity comparison simulating human perception (multiscale SSIM in Rust)
Created 2011-02-20
149 commits to main branch, last one about a month ago
A library implementing different string similarity and distance measures using Python.
Created 2018-06-21
83 commits to master branch, last one 2 years ago
150
920
bsd-3-clause
32
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created 2015-10-18
912 commits to master branch, last one 10 months ago
综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。
Created 2017-12-27
91 commits to master branch, last one 2 years ago
64
650
apache-2.0
8
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
Created 2022-02-23
161 commits to main branch, last one 17 days ago
自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of NLP,CWS(chinese word segnment),POS(Part-Of-Speech Tagging),NER(n...
Created 2019-12-31
17 commits to master branch, last one 3 years ago
80
530
gpl-3.0
17
pHash - the open source perceptual hash library
Created 2019-08-16
554 commits to master branch, last one about a year ago
Computing similarity of two sentences with google's BERT algorithm。利用Bert计算句子相似度。语义相似度计算。文本相似度计算。
Created 2019-02-19
14 commits to master branch, last one 3 years ago
144
469
other
63
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, t...
Created 2015-10-18
2,621 commits to master branch, last one about a year ago
difPy - Python package for finding duplicate or similar images within folders
Created 2020-12-20
454 commits to main branch, last one 3 months ago
37
355
bsd-3-clause
15
set of functions and operators for executing similarity queries
Created 2011-03-09
72 commits to master branch, last one 3 years ago
72
353
apache-2.0
7
基于哈工大同义词词林扩展版的单词相似度计算方法
Created 2017-10-31
8 commits to master branch, last one about a year ago
The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.
Created 2023-08-22
35 commits to main branch, last one 9 months ago
中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)
Created 2019-07-22
29 commits to master branch, last one 2 years ago
🦀📏 Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.
Created 2023-04-13
125 commits to master branch, last one about a year ago
85
231
mit
10
:globe_with_meridians: 地理编码技术,提供地址标准化和相似度计算。
Created 2018-05-10
61 commits to master branch, last one 7 months ago
50
212
unknown
31
Making sense embedding out of word embeddings using graph-based word sense induction
Created 2015-10-28
283 commits to master branch, last one 3 years ago
23
206
bsd-3-clause
5
Compare html similarity using structural and style metrics
Created 2017-10-26
74 commits to master branch, last one 3 years ago
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
Created 2023-04-19
23 commits to main branch, last one about a year ago
7
182
mpl-2.0
5
Find similar functions and classes in your JavaScript/TypeScript code
Created 2014-07-24
158 commits to master branch, last one 5 months ago
文本相似度(匹配)计算,提供Baseline、训练、推理、指标分析...代码包含TensorFlow/Pytorch双版本
Created 2021-04-10
101 commits to main branch, last one 2 years ago
5
126
apache-2.0
6
This repository has no description...
Created 2021-06-14
198 commits to master branch, last one about a year ago
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Created 2020-08-26
33 commits to master branch, last one about a year ago
23
103
mit
12
Record Linkage ToolKit (Find and link entities)
Created 2017-02-15
636 commits to master branch, last one 2 years ago
13
99
mpl-2.0
11
R package to Embed All the Things! using StarSpace
Created 2018-09-17
249 commits to master branch, last one 3 months ago
中文文本相似度计算器
Created 2020-07-09
52 commits to master branch, last one 4 months ago