65 results found Sort:
- Filter by Primary Language:
- Python (25)
- Rust (8)
- Go (7)
- C (6)
- C++ (3)
- Java (3)
- Shell (2)
- Jupyter Notebook (2)
- TypeScript (1)
- JavaScript (1)
- PLpgSQL (1)
- Scala (1)
- +
Fast, secure, efficient backup program
Created
2014-04-27
8,739 commits to master branch, last one 9 hours ago
Deduplicating archiver with compression and authenticated encryption.
Created
2015-05-12
8,907 commits to master branch, last one 7 days ago
Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Created
2015-12-19
3,536 commits to master branch, last one 2 days ago
Prometheus Alertmanager
Created
2013-07-16
3,351 commits to main branch, last one 4 days ago
Find duplicate files
Created
2013-06-22
2,080 commits to master branch, last one 4 months ago
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Created
2015-03-03
5,370 commits to master branch, last one about a year ago
A fast high compression read-only file system for Linux, Windows and macOS
Created
2020-11-21
2,290 commits to main branch, last one 21 days ago
rustic - fast, encrypted, and deduplicated backups powered by Rust
Created
2022-03-14
1,677 commits to main branch, last one 13 days ago
Extremely fast tool to remove duplicates and other lint from your filesystem
Created
2010-09-27
3,161 commits to master branch, last one 14 days ago
Simple, configuration-driven backup software for servers and workstations
Created
2014-11-19
2,584 commits to main branch, last one 3 days ago
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Created
2019-11-22
9,133 commits to master branch, last one 2 days ago
Config driven, easy backup cli for restic.
Created
2019-06-20
541 commits to master branch, last one about a month ago
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created
2015-10-18
912 commits to master branch, last one about a year ago
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Created
2021-08-25
2,345 commits to main branch, last one 2 days ago
Data deduplication engine, supporting optional compression and public key encryption.
Created
2016-03-25
595 commits to master branch, last one 2 years ago
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Created
2016-03-25
556 commits to master branch, last one 3 years ago
Scalable data pre processing and curation toolkit for LLMs
Created
2024-03-14
234 commits to main branch, last one 7 hours ago
Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)
Created
2018-12-26
2,261 commits to master branch, last one a day ago
Open source project for data preparation of LLM application builders
Created
2024-04-08
4,290 commits to dev branch, last one 6 hours ago
A list of free data matching and record linkage software.
Created
2018-01-01
53 commits to master branch, last one about a year ago
Filter, Sort & Delete Duplicate Files Recursively
Created
2022-12-25
182 commits to main branch, last one 5 months ago
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
This repository has been archived
(exclude archived)
Created
2014-09-03
63 commits to master branch, last one about a year ago
Deduplicating archiver with encryption and paranoid-level tests. Swiss army knife for the serious backup and disaster recovery manager. Ransomware neutralizer. Win/Linux/Unix
Created
2021-01-20
581 commits to main branch, last one 8 days ago
Productivity improvements for Rust ecosystem: warnings are skipped until errors are fixed, LSP-independent Neovim integration, etc.
Created
2020-09-13
365 commits to master branch, last one 21 days ago
RocketMQ消息幂等去重消费者,支持使用MySQL或者Redis做幂等表,开箱即用
Created
2020-05-15
17 commits to master branch, last one 3 years ago
A kernel module which provide a pool of deduplicated and/or compressed block storage.
Created
2017-10-22
54 commits to master branch, last one about a month ago
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
Created
2012-07-15
1,233 commits to main branch, last one 27 days ago
Userspace tools for managing VDO volumes.
Created
2017-10-22
44 commits to master branch, last one 2 months ago
A secure and efficient file backup solution that fits both system administrators (CLI) and end users (GUI)
Created
2023-01-26
1,436 commits to main branch, last one 12 days ago
Fast block-level out-of-band BTRFS deduplication tool.
Created
2020-06-16
111 commits to master branch, last one about a month ago