22 results found Sort:
- Filter by Primary Language:
- Python (8)
- C++ (3)
- C# (2)
- Go (2)
- TypeScript (2)
- Jupyter Notebook (1)
- PLpgSQL (1)
- Java (1)
- Rust (1)
- JavaScript (1)
- +
Tiny and powerful JavaScript full-text search engine for browser and Node
Created
2018-09-17
633 commits to master branch, last one 4 months ago
一款高性能敏感词(非法词/脏字)检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。
Created
2016-12-19
503 commits to master branch, last one about a month ago
Making Postgres and Elasticsearch work together like it's 2023
Created
2015-07-17
1,644 commits to master branch, last one 18 days ago
Top2Vec learns jointly embedded topic, document and word vectors.
Created
2020-03-20
349 commits to master branch, last one 6 days ago
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
Created
2023-02-22
1,957 commits to main branch, last one 20 hours ago
a user-friendly pager for grep
Created
2015-03-27
417 commits to main branch, last one 9 days ago
A text search engine that supports mixed Chinese and English fuzzy search.
Created
2024-08-08
159 commits to master branch, last one about a month ago
Very fast SPARQL Engine, which can handle very large knowledge graphs like the complete Wikidata, offers context-sensitive autocompletion for SPARQL queries, and allows combination with text search. I...
Created
2014-12-05
2,166 commits to master branch, last one a day ago
Find parts of long text or data, allowing for some changes/typos.
Created
2013-11-01
316 commits to main branch, last one 3 months ago
Fuzzy Matching Library for Rust
Created
2019-03-01
65 commits to master branch, last one about a year ago
A lightweight full text indexer for .NET
Created
2019-07-29
351 commits to master branch, last one 4 months ago
hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six
Created
2024-01-12
464 commits to main branch, last one 8 months ago
WuManber text search/matching implementation using c#
Created
2018-01-10
16 commits to master branch, last one 2 years ago
:zap: A telegram bot for searching all the stickers (just like @gif).
This repository has been archived
(exclude archived)
Created
2018-09-23
572 commits to main branch, last one 6 months ago
Expose a Top2Vec model with a REST API.
Created
2020-04-19
19 commits to master branch, last one 4 years ago
efficient string matching in Golang via the aho-corasick algorithm.
Created
2021-04-23
33 commits to master branch, last one 7 months ago
SODA (Simple Oracle Document Access) for Java is an Oracle library for writing Java apps that work with JSON (and not only JSON!) in the Oracle Database. SODA allows your Java app to use the Oracle Da...
Created
2015-10-06
262 commits to master branch, last one 16 days ago
A static site generator for Zettelkasten notes
Created
2020-03-30
1,042 commits to master branch, last one about a year ago
Document Search Engine project with TF-IDF abd Google universal sentence encoder model
tfidf
python
juypter
tensorflow
text-search
data-science
deep-learning
text-analysis
document-search
semantic-search
machine-learning
tfidf-vectorizer
tensorflow-models
document-similarity
tfidf-text-analysis
python-text-analysis
tensorflow-tutorials
semantic-search-engine
text-semantic-similarity
universal-sentence-encoder
Created
2020-01-26
13 commits to master branch, last one 4 years ago
Efficient and effective query auto-completion in C++.
Created
2019-09-23
236 commits to master branch, last one about a year ago
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
Created
2020-07-14
88 commits to master branch, last one 2 years ago
State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP.
Created
2024-09-25
42 commits to main branch, last one 8 days ago