skip to main content
Automatic rule discovery and generalization in supervised and unsupervised learning tasks
Publisher:
  • Carleton University
  • 1125 Colonel-By Drive Ottawa, Ont. K1S 5B6
  • Canada
ISBN:978-0-494-43888-6
Order Number:AAINR43888
Pages:
331
Reflects downloads up to 18 Jan 2025Bibliometrics
Skip Abstract Section
Abstract

Data Mining algorithms have been the focus of much research in recent years and new techniques are being developed regularly. This thesis describes EvRFind, an application for rule discovery in the task of Data Mining.

EvRFind is a hybrid Genetic Algorithm that also employs techniques from statistics and machine learning to improve efficiency and performance of the search. Among the non-evolutionary components are algorithms such as gradient ascent local search (Hill Climbing), optimization methods designed to improve search speed, automatic concept generalization, and automatic expansion of the description language.

EvRFind creates predictive models in the form of a default hierarchy. Each hierarchy is comprised of a set of rules that are ordered by generality, and selected with a bias towards minimum-length and comprehensibility.

Experiments on several datasets are run to evaluate EvRFind, and the results are compared to published work. To properly evaluate and illustrate the features and expressive power of EvRFind, the Poker Hand Dataset was created. This dataset represents a very large, imbalanced, and challenging domain. There are several target concepts, each with varying distribution within the dataset. The results achieved by EvRFind are compared to those generated by several other machine learning algorithms.

Contributors
  • Carleton University

Recommendations