3 results found Sort:
A lightweight implementation of the Unicode Text Segmentation (UAX #29)
Created
2024-04-13
210 commits to main branch, last one 23 days ago
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.
Created
2020-04-15
264 commits to master branch, last one 6 months ago
Unicode Extended grapheme clusters in nanoseconds
Created
2024-10-10
22 commits to main branch, last one 5 months ago