3.129.13.201
3.129.13.201
close menu
Identification of Chinese Personal Names in Unrestricted Texts
( Lawrence Cheung ) , ( Benjamin K Tsou ) , ( Maosong Sun )
국제 워크샵 2002권 28-35(8pages)
UCI I410-ECN-0102-2015-700-001895627

Automatic identification of Chinese personal names in unrestricted texts is a key task in Chinese word segmentation, and can affect other NLP tasks such as word segmentation and information retrieval, if it is not properly addressed. This paper (1) demonstrates the problems of Chinese personal name identification in some IT applications, (2) analyzes the structure of Chinese personal names, and (3) further presents the relevant processing strategies. The geographical differences of Chinese personal names between Beijing and Hong Kong are highlighted at the end. It shows that variation in names across different Chinese communities constitutes a critical factor in designing Chinese personal name identification algorithm.

[자료제공 : 네이버학술정보]
×