2 results found Sort:
A lightweight implementation of the Unicode Text Segmentation (UAX #29)
Created
2024-04-13
168 commits to main branch, last one 18 days ago
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.
Created
2020-04-15
264 commits to master branch, last one 2 months ago