opensemanticsearch / open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

Date Created 2015-05-30 (9 years ago)
Commits 486 (last one 2 years ago)
Stargazers 266 (0 this week)
Watchers 27 (0 this week)
Forks 72
License gpl-3.0
Ranking

RepositoryStats indexes 617,630 repositories, of these opensemanticsearch/open-semantic-etl is ranked #150,514 (76th percentile) for total stargazers, and #82,265 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #26,539/124,984.

opensemanticsearch/open-semantic-etl is also tagged with popular topics, for these it's ranked: python (#7,368/22979),  nlp (#859/2485),  pdf (#409/1048),  elasticsearch (#320/763),  ocr (#222/628),  etl (#106/278),  named-entity-recognition (#101/225),  annotation (#60/181)

Other Information

opensemanticsearch/open-semantic-etl has 1 open pull request on Github, 20 pull requests have been merged over the lifetime of the repository.

Github issues are enabled, there are 41 open issues and 96 closed issues.

Homepage URL: https://opensemanticsearch.org/etl

Star History

Github stargazers over time

30030025025020020015015010010050500020162016201720172018201820192019202020202021202120222022202320232024202420252025

Watcher History

Github watchers over time, collection started in '23

27272727272726.526.526262626262620232023Feb '23Feb '23Apr '23Apr '23Jun '23Jun '23Aug '23Aug '23Oct '23Oct '23Dec '23Dec '23Feb '24Feb '24Apr '24Apr '24Jun '24Jun '24Aug '24Aug '24Oct '24Oct '24Dec '24Dec '24Feb '25Feb '25

Recent Commit History

25 commits on the default branch (master) since jan '22

25252020151510105500Jul '22Jul '2220232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Yearly Commits

Commits to the default branch (master) per year

120120100100808060604040202000201520152016201620172017201820182019201920202020202120212022202220242024

Issue History

Total Issues
Open Issues
Closed Issues
140140120120100100808060604040202000201720172018201820192019202020202021202120222022202320232024202420252025

Languages

The primary language is Python but there's also others...

PythonPythonShellShellDockerfileDockerfile

updated: 2025-02-13 @ 01:09am, id: 36568867 / R_kgDOAi3_Iw