ERIC Number: ED459829
Record Type: Non-Journal
Publication Date: 2001-Jun
Reference Count: N/A
Database Selection for Processing k Nearest Neighbors Queries in Distributed Environments.
Yu, Clement; Sharma, Prasoon; Meng, Weiyi; Qin, Yan
This paper considers the processing of digital library queries, consisting of a text component and a structured component in distributed environments. The paper concentrates on the processing of the structured component of a distributed query. A method is proposed to identify the databases that are likely to be useful for processing any given query and to determine the tuples from each useful site which are necessary for answering the query. In this way, both the communication cost and the local processing costs are saved. One common characteristic of these "k" nearest neighbors queries is that it is not necessary to obtain all the "k" nearest neighbors; it is often sufficient to get most of the "k" neighbors. Experimental results are provided to demonstrate that most of the "k" nearest neighbors (85% to 100%) are obtained using this approach. An average accuracy rate of 94.7% is achieved when the 20 closest neighbors are desired. (Contains 15 references.) (AEF)
Descriptors: Data Processing, Databases, Electronic Libraries, Information Processing, Information Retrieval, Information Seeking, Online Searching, Online Systems
Association for Computing Machinery, 1515 Broadway, New York NY 10036. Tel: 800-342-6626 (Toll Free); Tel: 212-626-0500; e-mail: email@example.com. For full text: http://www1.acm.org/pubs/contents/proceedings/dl/379437/.
Publication Type: Numerical/Quantitative Data; Reports - Research; Speeches/Meeting Papers
Education Level: N/A
Sponsor: National Science Foundation, Arlington, VA.
Authoring Institution: N/A