NotesFAQContact Us
Collection
Advanced
Search Tips
ERIC Number: ED100137
Record Type: RIE
Publication Date: 1973-Apr
Pages: 14
Abstractor: N/A
Reference Count: 0
ISBN: N/A
ISSN: N/A
Variable-Length Character String Analyses of Three Data-Bases, and their Application for File Compression.
Barton, Ian J.; And Others
A novel text analysis and characterization method involves the generation from text samples of sets of variable-length character strings. These sets are intermediate in number between the character set and the total number of words in a data base; their distribution is less disparate than those of either characters or words. The size of the sets of character strings (key-sets) can be varied arbitrarily by changing parameters. The characteristics of three scientific data bases (two disciplinary, one interdisciplinary) are compared in terms of key-sets of different sizes. Application of the key-sets for file compression, using a variable to fixed-length coding strategy, is discussed. (Author)
ASLIB, 3 Belgrave Square, London SW1, England (6 pounds, 75 pence, for proceedings of conference)
Publication Type: Speeches/Meeting Papers
Education Level: N/A
Audience: N/A
Language: N/A
Sponsor: N/A
Authoring Institution: N/A
Note: Paper presented at the ASLIB Annual Conference, University of Durham, April, 1973