49 results found Sort:

308
5.8k
apache-2.0
82
Transforms PDF, Documents and Images into Enriched Structured Data
Created 2019-08-05
1,987 commits to master branch, last one 12 months ago
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Created 2018-03-15
12,397 commits to main branch, last one 3 days ago
378
3.8k
apache-2.0
72
extract internal monitoring data from application logs for collection in a timeseries database
Created 2014-05-27
4,310 commits to main branch, last one 10 days ago
376
3.3k
gpl-3.0
84
a library for audio and music analysis
Created 2009-12-04
4,161 commits to master branch, last one 8 months ago
Provides functions to read and write from/to an object or array using a simple string notation
Created 2013-01-13
1,084 commits to 7.1 branch, last one 2 days ago
775
2.4k
apache-2.0
97
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Created 2009-05-21
8,959 commits to main branch, last one 18 hours ago
242
2.3k
mit
69
Visual Novels resource browser
Created 2014-07-21
3,043 commits to master branch, last one 11 months ago
80
2.2k
other
20
Extract files from any kind of container formats
Created 2021-06-08
1,945 commits to main branch, last one a day ago
185
1.6k
mit
44
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
Created 2013-04-23
307 commits to master branch, last one 5 years ago
234
1.5k
apache-2.0
39
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Created 2014-06-26
475 commits to master branch, last one about a year ago
🦜⛏️ Did you say you like data?
Created 2024-02-29
80 commits to main branch, last one 3 months ago
Stanford Open Information Extraction made simple!
Created 2016-07-08
125 commits to master branch, last one 8 months ago
116
630
mpl-2.0
20
A C++ static library offering a clean and simple interface to the 7-zip shared libraries.
Created 2014-12-14
1,590 commits to master branch, last one about a month ago
76
605
gpl-3.0
15
A program to extract files from the RPA archive format.
Created 2011-12-25
37 commits to master branch, last one 4 years ago
69
454
unknown
13
北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。
Created 2020-06-22
143 commits to master branch, last one 2 years ago
File Injector is a script that allows you to store any file in an image using steganography
Created 2022-10-22
41 commits to main branch, last one about a year ago
15
403
mit
10
PHP URI Template (RFC 6570) supports both URI expansion & extraction
Created 2014-03-31
68 commits to master branch, last one 18 days ago
40
361
mit
27
Toolchain that lets you interact with the Overwatch files and extract models and stuff.
Created 2017-04-28
2,440 commits to develop branch, last one 9 days ago
Extracts OTP tokens from rooted Android devices
Created 2017-10-21
94 commits to master branch, last one 3 years ago
An actual, updated, surviv.io cheat. Works great and we reply fast.
Created 2019-05-31
199 commits to master branch, last one 2 years ago
43
222
gpl-3.0
17
SEO Macroscope is a website scanning tool, to check your website for broken links; including some technical SEO functionality, site scraping, Excel reporting, and more.
Created 2016-12-15
674 commits to master branch, last one 6 months ago
Java library to extract links (URLs, email addresses) from plain text; fast, small and smart
Created 2015-06-04
161 commits to main branch, last one 7 months ago
36
198
apache-2.0
9
A simple archiving and compression library for Java
Created 2013-03-28
171 commits to master branch, last one 3 years ago
77
190
mit
4
Open source Emoticons and Emoji detection library: emot
Created 2017-06-18
83 commits to master branch, last one about a year ago
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Created 2018-02-07
69 commits to master branch, last one about a year ago
Extract AutoIt scripts embedded in PE binaries
Created 2020-03-18
61 commits to master branch, last one 7 months ago
Extract tables from PDF files (port of tabula-java)
Created 2020-09-08
192 commits to master branch, last one 11 months ago
9
116
mit
12
DocILE: Document Information Localization and Extraction Benchmark
Created 2022-10-19
119 commits to main branch, last one 4 months ago
Detect hidden files and text in images
Created 2018-04-04
30 commits to master branch, last one about a year ago
A ground segmentation algorithm for 3D point clouds based on the work described in “Fast segmentation of 3D point clouds: a paradigm on LIDAR data for Autonomous Vehicle Applications”, D. Zermas, I. I...
Created 2020-12-09
4 commits to main branch, last one 2 years ago