1 result found Sort:
High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other...
Created
2014-03-31
148 commits to master branch, last one about a year ago