65 results found Sort:

1.6k
27.0k
bsd-2-clause
240
Fast, secure, efficient backup program
Created 2014-04-27
8,739 commits to master branch, last one 9 hours ago
752
11.3k
other
151
Deduplicating archiver with compression and authenticated encryption.
Created 2015-05-12
8,907 commits to master branch, last one 7 days ago
410
8.3k
apache-2.0
57
Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Created 2015-12-19
3,536 commits to master branch, last one 2 days ago
2.2k
6.7k
apache-2.0
178
Prometheus Alertmanager
Created 2013-07-16
3,351 commits to main branch, last one 4 days ago
418
5.5k
gpl-3.0
106
Find duplicate files
Created 2013-06-22
2,080 commits to master branch, last one 4 months ago
424
4.1k
mit
110
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Created 2015-03-03
5,370 commits to master branch, last one about a year ago
61
2.2k
gpl-3.0
22
A fast high compression read-only file system for Linux, Windows and macOS
Created 2020-11-21
2,290 commits to main branch, last one 21 days ago
76
2.1k
apache-2.0
20
rustic - fast, encrypted, and deduplicated backups powered by Rust
Created 2022-03-14
1,677 commits to main branch, last one 13 days ago
132
2.0k
gpl-3.0
45
Extremely fast tool to remove duplicates and other lint from your filesystem
Created 2010-09-27
3,161 commits to master branch, last one 14 days ago
Simple, configuration-driven backup software for servers and workstations
Created 2014-11-19
2,584 commits to main branch, last one 3 days ago
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Created 2019-11-22
9,133 commits to master branch, last one 2 days ago
75
1.4k
apache-2.0
12
Config driven, easy backup cli for restic.
Created 2019-06-20
541 commits to master branch, last one about a month ago
153
973
bsd-3-clause
32
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created 2015-10-18
912 commits to master branch, last one about a year ago
121
968
agpl-3.0
16
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Created 2021-08-25
2,345 commits to main branch, last one 2 days ago
43
834
unknown
27
Data deduplication engine, supporting optional compression and public key encryption.
Created 2016-03-25
595 commits to master branch, last one 2 years ago
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Created 2016-03-25
556 commits to master branch, last one 3 years ago
Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)
Created 2018-12-26
2,261 commits to master branch, last one a day ago
143
376
apache-2.0
18
Open source project for data preparation of LLM application builders
Created 2024-04-08
4,290 commits to dev branch, last one 6 hours ago
A list of free data matching and record linkage software.
Created 2018-01-01
53 commits to master branch, last one about a year ago
Filter, Sort & Delete Duplicate Files Recursively
Created 2022-12-25
182 commits to main branch, last one 5 months ago
80
283
mit
10
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
This repository has been archived (exclude archived)
Created 2014-09-03
63 commits to master branch, last one about a year ago
Deduplicating archiver with encryption and paranoid-level tests. Swiss army knife for the serious backup and disaster recovery manager. Ransomware neutralizer. Win/Linux/Unix
Created 2021-01-20
581 commits to main branch, last one 8 days ago
3
271
apache-2.0
3
Productivity improvements for Rust ecosystem: warnings are skipped until errors are fixed, LSP-independent Neovim integration, etc.
Created 2020-09-13
365 commits to master branch, last one 21 days ago
RocketMQ消息幂等去重消费者,支持使用MySQL或者Redis做幂等表,开箱即用
Created 2020-05-15
17 commits to master branch, last one 3 years ago
46
242
gpl-2.0
37
A kernel module which provide a pool of deduplicated and/or compressed block storage.
Created 2017-10-22
54 commits to master branch, last one about a month ago
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
Created 2012-07-15
1,233 commits to main branch, last one 27 days ago
32
193
gpl-2.0
32
Userspace tools for managing VDO volumes.
Created 2017-10-22
44 commits to master branch, last one 2 months ago
5
178
gpl-3.0
3
A secure and efficient file backup solution that fits both system administrators (CLI) and end users (GUI)
Created 2023-01-26
1,436 commits to main branch, last one 12 days ago
18
169
gpl-2.0
12
Fast block-level out-of-band BTRFS deduplication tool.
Created 2020-06-16
111 commits to master branch, last one about a month ago