An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
Viimeinen Julkaisu on lokak. 07, 2016汉语言处理包
Viimeinen Julkaisu on nullA Lucene tokenizer plugin for both Simplified Chinese and Traditional Chinese, featured with Chinese Word Segmentation, custom dictionary etc.
Viimeinen Julkaisu on jouluk. 14, 2016HanLP: Han Language Processing
Viimeinen Julkaisu on jouluk. 27, 2020A Lucene tokenizer plugin for both Simplified Chinese and Traditional Chinese, featured with Chinese Word Segmentation, custom dictionary etc.
Viimeinen Julkaisu on jouluk. 14, 2016