2 results found Sort:

A lightweight implementation of the Unicode Text Segmentation (UAX #29)
Created 2024-04-13
168 commits to main branch, last one a day ago
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.
Created 2020-04-15
264 commits to master branch, last one 2 months ago