NotesFAQContact Us
Search Tips
Peer reviewed Peer reviewed
ERIC Number: EJ647586
Record Type: Journal
Publication Date: 2002
Pages: N/A
Abstractor: N/A
Reference Count: N/A
ISSN: ISSN-3318-3324
Using Statistical and Contextual Information To Identify Two- and Three-Character Words in Chinese.
Khoo, Christopher S. G.; Dai, Yubin; Loh, Teck Ee
Journal of the American Society for Information Science and Technology, v53 n5 p365-77 Mar 2002
Describes the development of new statistical formulas for identifying two- and three-character words in Chinese text by performing stepwise logistic regression using a sample of sentences that had been manually segmented. Concludes that the new contextual information formulas are substantially better than the mutual information formula. (Author/LRW)
Publication Type: Journal Articles; Reports - Descriptive
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A