opensemanticsearch / open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

Date Created 2015-05-30 (9 years ago)
Commits 486 (last one about a year ago)
Stargazers 254 (0 this week)
Watchers 27 (0 this week)
Forks 68
License gpl-3.0
Ranking

RepositoryStats indexes 534,551 repositories, of these opensemanticsearch/open-semantic-etl is ranked #141,959 (73rd percentile) for total stargazers, and #79,971 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #24,101/103,470.

opensemanticsearch/open-semantic-etl is also tagged with popular topics, for these it's ranked: python (#6,934/20447),  nlp (#819/2260),  pdf (#374/928),  elasticsearch (#315/720),  ocr (#192/533),  etl (#99/237),  named-entity-recognition (#99/214),  annotation (#59/166)

Other Information

opensemanticsearch/open-semantic-etl has Github issues enabled, there are 41 open issues and 96 closed issues.

Homepage URL: https://opensemanticsearch.org/etl

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

25 commits on the default branch (master) since jan '22

Yearly Commits

Commits to the default branch (master) per year

Issue History

Languages

The primary language is Python but there's also others...

updated: 2024-06-26 @ 08:34am, id: 36568867 / R_kgDOAi3_Iw