Hybrid Feature Selection Method for Improving File Fragment Classification

Algurashi, Alia and Wang, Wenjia (2019) Hybrid Feature Selection Method for Improving File Fragment Classification. In: Artificial Intelligence XXXVI - 39th SGAI International Conference on Artificial Intelligence, AI 2019, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) . UNSPECIFIED, pp. 379-391. ISBN 978-3-030-34884-7

Full text not available from this repository. (Request a copy)


Identifying types of file fragments in isolation from their context is an essential task in digital forensic analysis and can be done with several methods. One common approach is to extract various types of features from file fragments as inputs for classification algorithms. However, this approach suffers from dimensionality curse as the number of the extracted features is too high, which causes the learning and classification to be both inefficient and inaccurate. This paper proposes a hybrid method to address this issue by using filters and wrappers to significantly reduce the number of features and also improve the accuracy of file type classification. First, it uses and combines three appropriate filters to filter out a large number of irrelevant and/or less important features, and then some wrappers to reduce the number of features further to the most salient ones. Our method was tested on some benchmark datasets - GovDocs, and the experimental results indicated that our method was able to not only reduce the number of features from 66,313 to 11–32, but also improve the accuracy of the classification, compared with other methods that used all the features.

Item Type: Book Section
Uncontrolled Keywords: feature selection,file fragment classification,filters,forensics,hybrid method,wrappers,theoretical computer science,computer science(all) ,/dk/atira/pure/subjectarea/asjc/2600/2614
Faculty \ School: Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Data Science and Statistics
Related URLs:
Depositing User: LivePure Connector
Date Deposited: 19 May 2020 00:24
Last Modified: 24 Oct 2023 01:51
URI: https://ueaeprints.uea.ac.uk/id/eprint/75222
DOI: 10.1007/978-3-030-34885-4_29

Actions (login required)

View Item View Item