7 results found Sort:

428
2.1k
apache-2.0
54
news-please - an integrated web crawler and information extractor for news that just works
Created 2016-12-18
802 commits to master branch, last one 23 days ago
⛓ Extract web links information: title, description, images, videos, etc. [via OpenGraph], runs on mobiles and node.
Created 2016-10-20
362 commits to main branch, last one 6 days ago
59
491
unknown
24
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Created 2018-11-22
383 commits to master branch, last one 2 years ago
python implementation of jordansissel's grok regular expression library
Created 2014-07-17
93 commits to master branch, last one 6 years ago
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelin...
Created 2015-05-30
486 commits to master branch, last one 2 years ago
From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.
Created 2020-09-04
64 commits to master branch, last one about a year ago
25
88
gpl-3.0
7
An open information extraction system that provides compact extractions
Created 2017-07-06
45 commits to master branch, last one 5 years ago