Peer reviewed
ERIC Number: EJ647562
Record Type: Journal
Publication Date: 2002
Pages: N/A
Abstractor: N/A
ISBN: N/A
ISSN: ISSN-0306-4573
EISSN: N/A
Integrated Multi-Strategic Web Document Pre-Processing for Sentence and Word Boundary Detection.
Shim, Junhyeok; Kim, Dongseok; Cha, Jeongwon; Lee, Gary Geunbae; Seo, Jungyun
Information Processing & Management, v38 n4 p509-27 Jul 2002
Discussion of natural language processing focuses on a multi-strategic integrated text preprocessing method for difficult problems of sentence boundary disambiguation and word boundary disambiguation of Web texts. Describes an evaluation of the method using Korean Web document collections. (Author/LRW)
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A