Clustering Imputation for Air Pollution Data

Alahamade, Wedad, Lake, Iain ORCID: https://orcid.org/0000-0003-4407-5357, Reeves, Claire E. ORCID: https://orcid.org/0000-0003-4071-1926 and De La Iglesia, Beatriz ORCID: https://orcid.org/0000-0003-2675-5826 (2020) Clustering Imputation for Air Pollution Data. In: Hybrid Artificial Intelligent Systems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) . Springer, Cham, pp. 585-597. ISBN 978-3-030-61705-9

[thumbnail of Accepted_Manuscript]
Preview
PDF (Accepted_Manuscript) - Accepted Version
Download (2MB) | Preview

Abstract

Air pollution is a global problem. The assessment of air pollution concentration data is important for evaluating human exposure and the associated risk to health. Unfortunately, air pollution monitoring stations often have periods of missing data or do not measure all pollutants. In this study, we experiment with different approaches to estimate the whole time series for a missing pollutant at a monitoring station as well as missing values within a time series. The main goal is to reduce the uncertainty in air quality assessment. To develop our approach we combine single and multiple imputation, nearest neighbour geographical distance methods and a clustering algorithm for time series. For each station that measures ozone, we produce various imputations for this pollutant and measure the similarity/error between the imputed and the real values. Our results show that imputation by average based on clustering results combined with multiple imputation for missing values is the most reliable and is associated with lower average error and standard deviation.

Item Type: Book Section
Uncontrolled Keywords: air quality,imputation,time series clustering,uncertainty,theoretical computer science,computer science(all),sdg 3 - good health and well-being ,/dk/atira/pure/subjectarea/asjc/2600/2614
Faculty \ School: Faculty of Science > School of Computing Sciences
Faculty of Science > School of Environmental Sciences
University of East Anglia Research Groups/Centres > Theme - ClimateUEA
UEA Research Groups: University of East Anglia Schools > Faculty of Science > Tyndall Centre for Climate Change Research
Faculty of Science > Research Centres > Tyndall Centre for Climate Change Research
Faculty of Science > Research Groups > Environmental Social Sciences
Faculty of Science > Research Groups > Centre for Ocean and Atmospheric Sciences
Faculty of Medicine and Health Sciences > Research Centres > Norwich Institute for Healthy Aging
Faculty of Medicine and Health Sciences > Research Centres > Business and Local Government Data Research Centre (former - to 2023)
Faculty of Science > Research Groups > Data Science and Statistics
Related URLs:
Depositing User: LivePure Connector
Date Deposited: 12 Nov 2020 01:22
Last Modified: 20 Apr 2023 19:37
URI: https://ueaeprints.uea.ac.uk/id/eprint/77669
DOI: 10.1007/978-3-030-61705-9_48

Downloads

Downloads per month over past year

Actions (login required)

View Item View Item