Handling Categorical data in Rule Induction

Burgess, M., Janacek, G. J. and Rayward-Smith, V. J. (2003) Handling Categorical data in Rule Induction. In: Proceedings of the International Conference on Artificial Neural Nets and Genetic Algorithms (ICANNGA), 2003-01-01.

Full text not available from this repository. (Request a copy)


In this paper we address problems arising from the use of categorical valued data in rule induction. By naively using categorical values in rule induction, we risk reducing the chances of finding a good rule in terms both of confidence (accuracy) and of support or coverage. In this paper we introduce a technique called arcsin transformation where categorical valued data is replaced with numeric values. Our results show that on relatively large databases, containing many unordered categorical attributes, larger databases incorporating both unordered and numeric data, and especially those databases that are small containing rare cases, this technique is highly effective when dealing with categorical valued data.

Item Type: Conference or Workshop Item (Paper)
Faculty \ School: Faculty of Science > School of Computing Sciences
Depositing User: Vishal Gautam
Date Deposited: 22 Jul 2011 12:47
Last Modified: 25 Aug 2021 23:46
URI: https://ueaeprints.uea.ac.uk/id/eprint/22530

Actions (login required)

View Item View Item