18.118.166.98
18.118.166.98
close menu
Sorting by Sound-Arbitrary Lexical Ordering for Transcribed Thai Text
( Doung Cooper )
국제 워크샵 1995권 273-279(7pages)
UCI I410-ECN-0102-2015-700-001894706

When either Thai or transcribed (RomanizedJ Thai is sorted alphabetically, words that sound very much alike usually end up far apart. maay and may are thrown to opposite ends of the letter m entries, even though mistaking one for the other causes problems for both foreign students who cannot speak clearly, and Thais who can``t spell. This paper explains how and why the difficulty occurs, and shows why both Thai and transcription are inherently difficult to sort by sound. It introduces a method of preprocessing - deriving phonemic signatures - that lets us define improved lexical or dictionary orders, yet does not require anything but standard sorting code. The method can be applied to other languages - Lno, Khmer, and Burmese - that, like Thai, distinguish words on the basis of vowel length and/or tone.

[자료제공 : 네이버학술정보]
×