NotesFAQContact Us
Search Tips
Peer reviewed Peer reviewed
ERIC Number: EJ647563
Record Type: Journal
Publication Date: 2002
Pages: N/A
Abstractor: N/A
Reference Count: N/A
ISSN: ISSN-0306-4573
The Use of Bigrams To Enhance Text Categorization.
Tan, Chade-Meng; Wang, Yuan-Fang; Lee, Chan-Do
Information Processing & Management, v38 n4 p529-46 Jul 2002
Presents an efficient text categorization (or text classification) algorithm for document retrieval of natural language texts that generates bigrams (two-word phrases) and uses the information gain metric, combined with various frequency thresholds. Experimental results suggest that the bigrams can substantially raise the quality of feature sets. (Author/LRW)
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A