Research and Implementation of Unlisted Word Discovery System

Shi-wei JIA, Yu-meng ZHANG

Abstract


Unlisted word is a problem in Chinese word segmentation. In this paper, an improved Apriori algorithm is proposed, which can quickly and accurately identify unlisted words. The improved algorithm applied a compressed database approach to reduce the number of transactions. Compared with the traditional n-tuple algorithm and NApriori algorithm, it is faster and more effective.

Keywords


Unlisted Word, Apriori Algorithm, Transaction Compression


DOI
10.12783/dtetr/icmca2017/12347

Refbacks

  • There are currently no refbacks.