7 results found Sort:

412
2.0k
apache-2.0
52
news-please - an integrated web crawler and information extractor for news that just works
Created 2016-12-18
736 commits to master branch, last one 3 days ago
⛓ Extract web links information: title, description, images, videos, etc. [via OpenGraph], runs on mobiles and node.
Created 2016-10-20
339 commits to main branch, last one 2 months ago
58
484
unknown
24
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Created 2018-11-22
383 commits to master branch, last one about a year ago
python implementation of jordansissel's grok regular expression library
Created 2014-07-17
93 commits to master branch, last one 5 years ago
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelin...
Created 2015-05-30
486 commits to master branch, last one about a year ago
From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.
Created 2020-09-04
64 commits to master branch, last one about a year ago
25
88
gpl-3.0
7
An open information extraction system that provides compact extractions
Created 2017-07-06
45 commits to master branch, last one 4 years ago