70 results found Sort:

1.6k
28.1k
bsd-2-clause
240
Fast, secure, efficient backup program
Created 2014-04-27
8,945 commits to master branch, last one a day ago
761
11.7k
other
152
Deduplicating archiver with compression and authenticated encryption.
Created 2015-05-12
8,948 commits to master branch, last one 2 days ago
439
9.2k
apache-2.0
54
Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Created 2015-12-19
3,622 commits to master branch, last one a day ago
2.2k
6.9k
apache-2.0
177
Prometheus Alertmanager
Created 2013-07-16
3,443 commits to main branch, last one a day ago
429
5.9k
gpl-3.0
106
Find duplicate files
Created 2013-06-22
2,080 commits to master branch, last one 7 months ago
432
4.2k
mit
107
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Created 2015-03-03
5,472 commits to master branch, last one about a month ago
84
2.3k
apache-2.0
22
rustic - fast, encrypted, and deduplicated backups powered by Rust
Created 2022-03-14
1,685 commits to main branch, last one 15 days ago
70
2.3k
gpl-3.0
23
A fast high compression read-only file system for Linux, Windows and macOS
Created 2020-11-21
2,550 commits to main branch, last one 3 days ago
135
2.1k
gpl-3.0
41
Extremely fast tool to remove duplicates and other lint from your filesystem
Created 2010-09-27
3,255 commits to master branch, last one 9 days ago
Simple, configuration-driven backup software for servers and workstations
Created 2014-11-19
2,890 commits to main branch, last one a day ago
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Created 2019-11-22
9,269 commits to master branch, last one 8 days ago
81
1.5k
apache-2.0
13
Config driven, easy backup cli for restic.
Created 2019-06-20
544 commits to master branch, last one 12 days ago
125
1.0k
agpl-3.0
16
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Created 2021-08-25
2,573 commits to main branch, last one 3 days ago
156
994
bsd-3-clause
32
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created 2015-10-18
912 commits to master branch, last one about a year ago
45
837
unknown
26
Data deduplication engine, supporting optional compression and public key encryption.
Created 2016-03-25
595 commits to master branch, last one 2 years ago
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Created 2016-03-25
556 commits to master branch, last one 3 years ago
Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)
Created 2018-12-26
2,345 commits to master branch, last one 6 days ago
184
600
apache-2.0
19
Open source project for data preparation of LLM application builders
Created 2024-04-08
5,134 commits to dev branch, last one 18 hours ago
Fast Semantic Text Deduplication
Created 2024-10-13
42 commits to main branch, last one 10 days ago
A list of free data matching and record linkage software.
Created 2018-01-01
53 commits to master branch, last one about a year ago
Filter, Sort & Delete Duplicate Files Recursively
Created 2022-12-25
182 commits to main branch, last one 9 months ago
Deduplicating archiver with encryption and paranoid-level tests. Swiss army knife for the serious backup and disaster recovery manager. Ransomware neutralizer. Win/Linux/Unix
Created 2021-01-20
586 commits to main branch, last one a day ago
80
285
mit
9
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
This repository has been archived (exclude archived)
Created 2014-09-03
63 commits to master branch, last one about a year ago
4
275
apache-2.0
2
Productivity improvements for Rust ecosystem: warnings are skipped until errors are fixed, LSP-independent Neovim integration, etc.
Created 2020-09-13
365 commits to master branch, last one 4 months ago
RocketMQ消息幂等去重消费者,支持使用MySQL或者Redis做幂等表,开箱即用
Created 2020-05-15
17 commits to master branch, last one 4 years ago
46
243
gpl-2.0
35
A kernel module which provide a pool of deduplicated and/or compressed block storage.
Created 2017-10-22
55 commits to master branch, last one 2 months ago
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
Created 2012-07-15
1,345 commits to main branch, last one a day ago
32
196
gpl-2.0
31
Userspace tools for managing VDO volumes.
Created 2017-10-22
46 commits to master branch, last one 21 days ago
6
196
gpl-3.0
3
A secure and efficient file backup solution that fits both system administrators (CLI) and end users (GUI)
Created 2023-01-26
1,787 commits to main branch, last one 6 days ago