joanrod / ocr-vqgan

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

Date Created 2022-11-07 (2 years ago)

Commits 10 (last one 2 years ago)

Stargazers 76 (0 this week)

Watchers 1 (0 this week)

Forks 1

License unknown

Ranking

RepositoryStats indexes 624,936 repositories, of these joanrod/ocr-vqgan is ranked #366,855 (41st percentile) for total stargazers, and #551,686 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #70,167/126,986.

joanrod/ocr-vqgan is also tagged with popular topics, for these it's ranked: deep-learning (#5,936/8755), dataset (#767/1208), ocr (#432/635), image-generation (#272/381)

Other Information

joanrod/ocr-vqgan has Github issues enabled, there are 4 open issues and 8 closed issues.

Homepage URL: https://arxiv.org/abs/2210.11248

All Topics

ocr vqgan dataset ocr-vqgan paper2fig deep-learning paper2fig100k image-generation taming-transformers text-reconstruction image-reconstruction deep-generative-model

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

10 commits on the default branch (master) since jan '22

Yearly Commits

Commits to the default branch (master) per year

Issue History

Languages

The only known language in this repository is Python

updated: 2025-01-28 @ 08:42pm, id: 562925094 / R_kgDOIY2OJg