NotesFAQContact Us
Search Tips
Peer reviewed Peer reviewed
ERIC Number: EJ670089
Record Type: Journal
Publication Date: 2003
Pages: N/A
Abstractor: N/A
Reference Count: N/A
ISSN: ISSN-0306-4573
Unsupervised Learning of mDTD Extraction Patterns for Web Text Mining.
Kim, Dongseok; Jung, Hanmin; Lee, Gary Geunbae
Information Processing & Management, v39 n4 p623-37 Jul 2003
Presents a new extraction pattern, modified Document Type Definition (mDTD), which relies on analytical interpretation to identify extraction target from the contents of Web documents. Experiments with 330 Korean and 220 English Web documents on audio and video shopping sites yielded an average extraction precision of 91.3% for Korean and 81.9% for English. (AEF)
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A