1 result found Sort:

92
485
apache-2.0
33
High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other...
Created 2014-03-31
148 commits to master branch, last one about a year ago