ERIC Number: ED029651
Record Type: RIE
Publication Date: 1968-Mar
Reference Count: N/A
Automatic Informative Abstracting and Extracting. Annual Report.
Earl, L.L.; Robison, H.R.
This fourth annual report summarizes the investigation of (1) a "sentence dictionary" and (2) a "word government dictionary" for use in automatic abstracting and extracting systems. The theory behind the sentence dictionary and its compilation is that a separation of significant from nonsignificant sentences can be accomplished on the basis of syntactic sentence types. In the initial experiments dealing with the implementation of the sentence dictionary, parts-of-speech were used to designate the syntactic classification of sentences. As this classification proved too discriminative, a new system for syntactic classification was generated in which the part-of-speech strings were replaced by a phrase structure representation of the sentences, which were assigned automatically by a parsing program. The degree of complexity needed for the phrase structure is currently being determined by experiment. The compilation of the word government dictionary, also discussed in this report, was done on the assumption that the set of government words is a small subset of all English words and that this word government information would be useful in computational linguistics and information retrieval. Such investigation has led to the conclusion that a word's government pattern is often linked to its semantic meaning. (RM)
Descriptors: Abstracting, Computational Linguistics, Computer Programs, Dictionaries, Information Retrieval, Sentences, Syntax
Clearinghouse for Federal Scientific and Technical Information, Springfield, Va. 22151 (AD 667 473, MF-$0.65, HC-$3.00)
Publication Type: N/A
Education Level: N/A
Sponsor: Office of Naval Research, Washington, DC. Information Systems Research.
Authoring Institution: Lockheed Missiles and Space Co., Palo Alto, CA. Electronic Sciences Lab.
Note: Contains flowcharts and dictionary.