Saad, Fathi H., Bell, G. Duncan and de la Iglesia, Beatriz ORCID: https://orcid.org/0000-0003-2675-5826
(2008)
Classification techniques with minimal labelling effort and application to medical reports.
International Journal of Data Mining and Bioinformatics, 2 (3).
pp. 268-287.
ISSN 1748-5673
Abstract
There are a number of approaches to classify text documents. Here, we use Partially Supervised Classification (PSC) and argue that it is an effective and efficient approach for real-world problems. PSC uses a two-step strategy to cut down on the labelling effort. There are a number of methods that have been proposed for each step. An evaluation of various methods is conducted using real-world medical documents. The results show that using EM to build the classifier yields better results than SVM. We also experimentally show that careful selection of a subset of features to represent the documents can improve performance.
Item Type: | Article |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
Depositing User: | Vishal Gautam |
Date Deposited: | 10 Mar 2011 10:54 |
Last Modified: | 22 Apr 2023 01:08 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/22377 |
DOI: |
Actions (login required)
![]() |
View Item |