Classification techniques with minimal labelling effort and application to medical reports

Saad, Fathi H., Bell, G. Duncan and de la Iglesia, Beatriz (2008) Classification techniques with minimal labelling effort and application to medical reports. International Journal of Data Mining and Bioinformatics, 2 (3). pp. 268-287. ISSN 1748-5673

Full text not available from this repository. (Request a copy)

Abstract

There are a number of approaches to classify text documents. Here, we use Partially Supervised Classification (PSC) and argue that it is an effective and efficient approach for real-world problems. PSC uses a two-step strategy to cut down on the labelling effort. There are a number of methods that have been proposed for each step. An evaluation of various methods is conducted using real-world medical documents. The results show that using EM to build the classifier yields better results than SVM. We also experimentally show that careful selection of a subset of features to represent the documents can improve performance.

Item Type: Article
Faculty \ School: Faculty of Science > School of Computing Sciences
Related URLs:
Depositing User: Vishal Gautam
Date Deposited: 10 Mar 2011 10:54
Last Modified: 21 Apr 2020 19:06
URI: https://ueaeprints.uea.ac.uk/id/eprint/22377
DOI:

Actions (login required)

View Item View Item