Attention-Based Recurrent Neural Networks (RNNs) for Short Text Classification: An Application in Public Health Monitoring

Edo-Osagie, Osagioduwa and De La Iglesia, Beatriz (2019) Attention-Based Recurrent Neural Networks (RNNs) for Short Text Classification: An Application in Public Health Monitoring. In: International Work-Conference on Artificial Neural Networks, 2019-06-12 - 2019-06-14.

PDF (Accepted_Manuscript) - Accepted Version
Download (1MB) | Preview


In this paper, we propose an attention-based approach to short text classification, which we have created for the practical application of Twitter mining for public health monitoring. Our goal is to automatically filter Tweets which are relevant to the syndrome of asthma/difficulty breathing. We describe a bi-directional Recurrent Neural Network architecture with an attention layer (termed ABRNN) which allows the network to weigh words in a Tweet differently based on their perceived importance. We further distinguish between two variants of the ABRNN based on the Long Short Term Memory and Gated Recurrent Unit architectures respectively, termed the ABLSTM and ABGRU. We apply the ABLSTM and ABGRU, along with popular deep learning text classification models, to a Tweet relevance classification problem and compare their performances. We find that the ABLSTM outperforms the other models, achieving an accuracy of 0.906 and an F1-score of 0.710. The attention vectors computed as a by-product of our models were also found to be meaningful representations of the input Tweets. As such, the described models have the added utility of computing document embeddings which could be used for other tasks besides classification. To further validate the approach, we demonstrate the ABLSTM’s performance in the real world application of public health surveillance and compare the results with real-world syndromic surveillance data provided by Public Health England (PHE). A strong positive correlation was observed between the ABLSTM surveillance signal and the real-world asthma/difficulty breathing syndromic surveillance data. The ABLSTM is a useful tool for the task of public health surveillance.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: syndromic surveillance,sequence modelling,machine learning,deep learning,natural language processing,short-text classification,sdg 3 - good health and well-being ,/dk/atira/pure/sustainabledevelopmentgoals/good_health_and_well_being
Faculty \ School: Faculty of Science > School of Computing Sciences
Faculty of Science > School of Environmental Sciences
University of East Anglia > Faculty of Science > Research Centres > Tyndall Centre for Climate Change Research
Depositing User: LivePure Connector
Date Deposited: 01 Jul 2019 10:30
Last Modified: 01 Oct 2021 00:44
DOI: 10.1007/978-3-030-20521-8_73

Actions (login required)

View Item View Item