56 results found Sort:
- Filter by Primary Language:
- Python (22)
- Rust (7)
- C (6)
- Go (5)
- Java (3)
- C++ (3)
- Shell (2)
- TypeScript (1)
- JavaScript (1)
- Jupyter Notebook (1)
- PLpgSQL (1)
- Scala (1)
- +
Fast, secure, efficient backup program
Created
2014-04-27
8,032 commits to master branch, last one a day ago
Deduplicating archiver with compression and authenticated encryption.
Created
2015-05-12
8,433 commits to master branch, last one 19 hours ago
Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Created
2015-12-19
3,312 commits to master branch, last one 2 days ago
Prometheus Alertmanager
Created
2013-07-16
3,131 commits to main branch, last one 17 hours ago
Find duplicate files
Created
2013-06-22
2,079 commits to master branch, last one 21 days ago
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Created
2015-03-03
5,370 commits to master branch, last one 9 months ago
A fast high compression read-only file system for Linux, Windows and macOS
Created
2020-11-21
1,912 commits to main branch, last one 24 days ago
Extremely fast tool to remove duplicates and other lint from your filesystem
Created
2010-09-27
3,147 commits to master branch, last one 5 months ago
Simple, configuration-driven backup software for servers and workstations
Created
2014-11-19
2,239 commits to main branch, last one 10 days ago
rustic - fast, encrypted, and deduplicated backups powered by Rust
Created
2022-03-14
1,542 commits to main branch, last one 18 days ago
Config driven, easy backup cli for restic.
Created
2019-06-20
525 commits to master branch, last one 14 days ago
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Created
2019-11-22
6,739 commits to master branch, last one 21 days ago
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created
2015-10-18
912 commits to master branch, last one 10 months ago
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Created
2021-08-25
1,818 commits to main branch, last one 18 days ago
Data deduplication engine, supporting optional compression and public key encryption.
Created
2016-03-25
595 commits to master branch, last one about a year ago
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Created
2016-03-25
556 commits to master branch, last one 3 years ago
Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)
Created
2018-12-26
2,117 commits to master branch, last one 20 hours ago
A list of free data matching and record linkage software.
Created
2018-01-01
53 commits to master branch, last one about a year ago
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
Created
2014-09-03
63 commits to master branch, last one 11 months ago
Filter, Sort & Delete Duplicate Files Recursively
Created
2022-12-25
180 commits to main branch, last one 6 months ago
Productivity improvements for Rust ecosystem: warnings are skipped until errors are fixed, LSP-independent Neovim integration, etc.
Created
2020-09-13
364 commits to master branch, last one about a month ago
A kernel module which provide a pool of deduplicated and/or compressed block storage.
Created
2017-10-22
50 commits to master branch, last one 7 months ago
Deduplicating archiver with encryption and paranoid-level tests. Swiss army knife for the serious backup and disaster recovery manager. Ransomware neutralizer. Win/Linux/Unix
Created
2021-01-20
520 commits to main branch, last one 8 days ago
RocketMQ消息幂等去重消费者,支持使用MySQL或者Redis做幂等表,开箱即用
Created
2020-05-15
17 commits to master branch, last one 3 years ago
Userspace tools for managing VDO volumes.
Created
2017-10-22
42 commits to master branch, last one about a month ago
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
Created
2012-07-15
1,095 commits to main branch, last one a day ago
Fast block-level out-of-band BTRFS deduplication tool.
Created
2020-06-16
110 commits to master branch, last one 6 months ago
📧 CLI to deduplicate mails from mail boxes.
Created
2013-03-25
1,420 commits to main branch, last one a day ago
CLI utility to find near duplicate images and remove all but the best copy.
Created
2018-05-02
520 commits to master branch, last one a day ago
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Created
2021-01-28
373 commits to main branch, last one 2 years ago