7 results found Sort:
- Filter by Primary Language:
- Python (4)
- Java (1)
- TypeScript (1)
- +
news-please - an integrated web crawler and information extractor for news that just works
Created
2016-12-18
802 commits to master branch, last one 23 days ago
⛓ Extract web links information: title, description, images, videos, etc. [via OpenGraph], runs on mobiles and node.
Created
2016-10-20
362 commits to main branch, last one 6 days ago
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Created
2018-11-22
383 commits to master branch, last one 2 years ago
python implementation of jordansissel's grok regular expression library
Created
2014-07-17
93 commits to master branch, last one 6 years ago
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelin...
Created
2015-05-30
486 commits to master branch, last one 2 years ago
From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.
Created
2020-09-04
64 commits to master branch, last one about a year ago
An open information extraction system that provides compact extractions
Created
2017-07-06
45 commits to master branch, last one 5 years ago