56 results found Sort:

1.5k
24.1k
bsd-2-clause
239
Fast, secure, efficient backup program
Created 2014-04-27
8,032 commits to master branch, last one a day ago
729
10.7k
other
150
Deduplicating archiver with compression and authenticated encryption.
Created 2015-05-12
8,433 commits to master branch, last one 19 hours ago
338
6.5k
apache-2.0
51
Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Created 2015-12-19
3,312 commits to master branch, last one 2 days ago
2.1k
6.4k
apache-2.0
183
Prometheus Alertmanager
Created 2013-07-16
3,131 commits to main branch, last one 17 hours ago
394
4.9k
gpl-3.0
100
Find duplicate files
Created 2013-06-22
2,079 commits to master branch, last one 21 days ago
411
4.0k
mit
110
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Created 2015-03-03
5,370 commits to master branch, last one 9 months ago
53
2.0k
gpl-3.0
19
A fast high compression read-only file system for Linux, Windows and macOS
Created 2020-11-21
1,912 commits to main branch, last one 24 days ago
128
1.8k
gpl-3.0
42
Extremely fast tool to remove duplicates and other lint from your filesystem
Created 2010-09-27
3,147 commits to master branch, last one 5 months ago
Simple, configuration-driven backup software for servers and workstations
Created 2014-11-19
2,239 commits to main branch, last one 10 days ago
56
1.6k
apache-2.0
18
rustic - fast, encrypted, and deduplicated backups powered by Rust
Created 2022-03-14
1,542 commits to main branch, last one 18 days ago
66
1.1k
apache-2.0
10
Config driven, easy backup cli for restic.
Created 2019-06-20
525 commits to master branch, last one 14 days ago
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Created 2019-11-22
6,739 commits to master branch, last one 21 days ago
150
916
bsd-3-clause
32
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created 2015-10-18
912 commits to master branch, last one 10 months ago
109
903
agpl-3.0
18
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Created 2021-08-25
1,818 commits to main branch, last one 18 days ago
43
821
unknown
28
Data deduplication engine, supporting optional compression and public key encryption.
Created 2016-03-25
595 commits to master branch, last one about a year ago
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Created 2016-03-25
556 commits to master branch, last one 3 years ago
Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)
Created 2018-12-26
2,117 commits to master branch, last one 20 hours ago
A list of free data matching and record linkage software.
Created 2018-01-01
53 commits to master branch, last one about a year ago
77
274
mit
10
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
Created 2014-09-03
63 commits to master branch, last one 11 months ago
Filter, Sort & Delete Duplicate Files Recursively
Created 2022-12-25
180 commits to main branch, last one 6 months ago
4
244
apache-2.0
3
Productivity improvements for Rust ecosystem: warnings are skipped until errors are fixed, LSP-independent Neovim integration, etc.
Created 2020-09-13
364 commits to master branch, last one about a month ago
45
237
gpl-2.0
37
A kernel module which provide a pool of deduplicated and/or compressed block storage.
Created 2017-10-22
50 commits to master branch, last one 7 months ago
Deduplicating archiver with encryption and paranoid-level tests. Swiss army knife for the serious backup and disaster recovery manager. Ransomware neutralizer. Win/Linux/Unix
Created 2021-01-20
520 commits to main branch, last one 8 days ago
RocketMQ消息幂等去重消费者,支持使用MySQL或者Redis做幂等表,开箱即用
Created 2020-05-15
17 commits to master branch, last one 3 years ago
30
190
gpl-2.0
33
Userspace tools for managing VDO volumes.
Created 2017-10-22
42 commits to master branch, last one about a month ago
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
Created 2012-07-15
1,095 commits to main branch, last one a day ago
18
163
gpl-2.0
11
Fast block-level out-of-band BTRFS deduplication tool.
Created 2020-06-16
110 commits to master branch, last one 6 months ago
📧 CLI to deduplicate mails from mail boxes.
Created 2013-03-25
1,420 commits to main branch, last one a day ago
CLI utility to find near duplicate images and remove all but the best copy.
Created 2018-05-02
520 commits to master branch, last one a day ago
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Created 2021-01-28
373 commits to main branch, last one 2 years ago