ERIC Number: ED027919
Record Type: RIE
Publication Date: 1968-Aug
Reference Count: N/A
Research on Automatic Classification, Indexing and Extracting: A General-Purpose Frequency Program.
Baker, F. T.; Williams, John H., Jr.
To support studies in automatic indexing, classification and extracting, a general purpose frequency program was developed to further theoretical and practical understanding of text word distributions. While the program is primarily designed for counting strings of character-oriented data, it can be used without change for counting any items which can be represented in an integral number of characters. Counts may be obtained simultaneously at several levels of detail, such as for sentences, paragraphs, chapters and entire documents. Both printed outputs and outputs for further computer processing may be obtained, and a variety of summary and detailed outputs are available. The program, titled FRQNCY, is written in the Fortran IV language and was compiled and run on the IBM System/360 using Fortran IV (G) and the System/360 Operating System. It uses at least one feature of the IBM System/360 Fortran IV language (LOGICAL*1 and INTEGER*2 variables) which is not in USASI Fortran and hence may not compile or run under other Fortran systems. The program is extensively parametrized to allow its efficient use on computers with varying amounts of immediate-access storage and input/output equipment. The report is a complete writeup of the frequency program. It covers the purpose and usage of the program and also describes its organization and internal operation. Guidelines for modifying the program or adapting it to different computers are included. (Author)
Descriptors: Abstracting, Automation, Classification, Computer Programs, Computers, Correlation, Documentation, Indexing, Information Processing, Vocabulary
Clearinghouse for Federal Scientific and Technical Information, Springfield, Va. 22151 (AD 673 428, MF $0.65, HC $3.00).
Publication Type: N/A
Education Level: N/A
Sponsor: Office of Naval Research, Washington, DC.
Authoring Institution: International Business Machines Corp., Bethesda, MD. Federal Systems Div.