ERIC Number: EJ469114
Record Type: Journal
Publication Date: 1993
Reference Count: N/A
Generating and Evaluating Domain-Oriented Multi-Word Terms from Texts.
Damerau, Fred J.
Information Processing and Management, v29 n4 p433-47 Jul-Aug 1993
Examines the use of various statistical techniques for generating domain-oriented multiword vocabulary terms for natural language database systems. Conclusions show the vocabulary clustering effect should be considered when making significance calculations and that a simple ratio of subject matter relative frequency to total sample relative frequency is adequate. (Contains 17 references.) (EAM)
Descriptors: Automatic Indexing, Cluster Analysis, Comparative Analysis, Database Design, Databases, Evaluation Methods, Information Retrieval, Intellectual Disciplines, Language Patterns, Predictor Variables, Probability, Ratios (Mathematics), Reliability, Statistical Analysis, Subject Index Terms, Validity
Publication Type: Reports - Research; Journal Articles
Education Level: N/A
Authoring Institution: N/A