ERIC Number: ED533639
Record Type: Non-Journal
Publication Date: 2011
Pages: 145
Abstractor: As Provided
ISBN: ISBN-978-1-1248-9428-7
ISSN: N/A
EISSN: N/A
An Evolutionary Algorithm for Feature Subset Selection in Hard Disk Drive Failure Prediction
Bhasin, Harpreet
ProQuest LLC, Ph.D. Dissertation, Nova Southeastern University
Hard disk drives are used in everyday life to store critical data. Although they are reliable, failure of a hard disk drive can be catastrophic, especially in applications like medicine, banking, air traffic control systems, missile guidance systems, computer numerical controlled machines, and more. The use of Self-Monitoring, Analysis and Reporting Technology (SMART) can aid in failure prediction by monitoring specific drive attributes and warning the user of an impending failure so that the user can backup data while there is still time. As a consequence, hard drive failure prediction has become an important problem and the subject of active research. The best available approaches for hard drive failure prediction achieve acceptably low false alarm rates by first selecting a subset of features using non-parametric statistical methods such as reverse arrangements and then using the multiple-instance naive Bayes classifier for the prediction task. However, the prediction accuracy of this approach is not sufficiently high. The focus of this dissertation was to improve the drive failure prediction accuracy while maintaining a low false alarm rate by using a genetic algorithm for feature set reduction in conjunction with the multiple-instance naive Bayes classifier for the prediction task. This research achieved a failure detection rate of 81% with a 0% false alarm rate on 12 attributes selected by the genetic algorithm. As a secondary contribution, this dissertation investigated the tradeoff between feature subset reduction and prediction accuracy in the hard drive prediction problem. This research found that as the number of features decreased below 10, the detection accuracy decreased significantly. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page: http://www.proquest.com/en-US/products/dissertations/individuals.shtml.]
Descriptors: Prediction, Computer Peripherals, Mathematics, Researchers, Statistical Analysis, Computers, Information Technology
ProQuest LLC. 789 East Eisenhower Parkway, P.O. Box 1346, Ann Arbor, MI 48106. Tel: 800-521-0600; Web site: http://www.proquest.com/en-US/products/dissertations/individuals.shtml
Publication Type: Dissertations/Theses - Doctoral Dissertations
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A