33 results found Sort:
- Filter by Primary Language:
- Python (12)
- TypeScript (4)
- Java (3)
- JavaScript (2)
- C++ (2)
- Go (2)
- Jupyter Notebook (2)
- C# (1)
- HTML (1)
- Rust (1)
- Scala (1)
- TSQL (1)
- C (1)
- +
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Created
2014-04-17
245 commits to master branch, last one 3 years ago
FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)
Created
2015-12-03
352 commits to master branch, last one 6 years ago
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
Created
2020-06-15
850 commits to master branch, last one about a year ago
Blazing fast framework for fine-tuning similarity learning models
Created
2021-08-31
221 commits to master branch, last one about a year ago
Generating multiple choice questions from text using Machine Learning.
Created
2018-12-27
35 commits to master branch, last one 3 years ago
A .NET port of java-string-similarity
Created
2016-08-07
100 commits to main branch, last one 6 months ago
Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.
Created
2023-12-24
355 commits to main branch, last one 5 months ago
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
Created
2017-08-01
191 commits to master branch, last one 2 months ago
Score documents using embedding-vectors dot-product or cosine-similarity with ES Lucene engine
Created
2017-08-23
42 commits to master branch, last one 3 years ago
A python project for checking plagiarism of documents based on cosine similarity
Created
2020-05-02
18 commits to master branch, last one 2 years ago
Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
Created
2014-09-03
824 commits to master branch, last one about a year ago
Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows s...
Created
2023-05-02
107 commits to master branch, last one about a year ago
A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.
Created
2024-03-19
75 commits to main branch, last one 5 months ago
Coltt is a vector database that supports Multi-Vector Search, high-performance HNSW, FLAT and quantization, and enables fast searches through sophisticated internal data shard design.
Created
2024-10-09
139 commits to main branch, last one 13 hours ago
Real-Time Face Recognition use SCRFD, ArcFace, ByteTrack and Similarity Measure
Created
2022-03-27
81 commits to master branch, last one about a year ago
Information Retrieval algorithms developed in python. To follow the blog posts, click on the link:
Created
2019-01-27
11 commits to master branch, last one 5 years ago
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Created
2020-08-26
33 commits to master branch, last one 2 years ago
Fast similarity search using DuckDB
Created
2024-01-09
28 commits to main branch, last one 4 months ago
Locality Sensitive Hashing in Rust with Python bindings
Created
2020-03-06
216 commits to master branch, last one about a year ago
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Created
2014-12-13
292 commits to master branch, last one about a year ago
Web Application for checking the similarity between query and document using the concept of Cosine Similarity.
Created
2017-08-19
11 commits to master branch, last one 2 years ago
[NeurIPS 2021] Official implementation of the paper "One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective"
Created
2021-09-29
49 commits to main branch, last one 9 months ago
SAX-VSM public release, visit our website for detail
Created
2014-09-11
267 commits to master branch, last one 3 years ago
Samples on how to use Azure SQL database with Azure OpenAI
Created
2023-05-23
70 commits to main branch, last one 3 months ago
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard si...
jaro
ngram
jaccard
soundex
levenshtein
jaro-winkler
hacktoberfest
jaro-distance
fuzzy-matching
dice-coefficient
hamming-distance
cosine-similarity
soundex-algorithm
string-similarity
jaccard-similarity
levenshtein-distance
jaro-winkler-distance
sorensen-dice-distance
cosine-similarity-scores
longest-common-subsequence
Created
2017-03-02
203 commits to master branch, last one 2 years ago
Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K most similar items for a large number of items by chunking the ...
Created
2023-03-06
145 commits to main branch, last one about a month ago
Fast Image Retrieval (FIRe) is an open source project to promote image retrieval research. It implements most of the major binary hashing methods to date, together with different popular backbone netw...
Created
2021-11-16
122 commits to main branch, last one about a year ago
A starting take on a fast and fully local NLP file organizer that organizes files based on their content.
Created
2024-09-24
51 commits to main branch, last one 2 months ago
Designed for recruiters, Our AI-powered platform can filter out top resumes of the stack
Created
2021-06-29
63 commits to master branch, last one 3 years ago
🤠A library implementing different string similarity using JavaScript.
Created
2019-10-14
73 commits to master branch, last one about a year ago