Wickramasinghe, Sandamali, Howe, Jacob and Daviaud, Laure (2025) Extracting deterministic finite automata from RNNs via hyperplane partitioning and learning. In: 2nd International Conference on Explainable AI for Neural and Symbolic Methods (EXPLAIN) 2025, 2025-10-22 - 2025-10-24. (In Press)
![]() |
PDF (Extracting_DFAs_from_RNNs_paper_camera_ready_version)
Restricted to Repository staff only until 1 August 2026. Request a copy |
Abstract
Recurrent Neural Networks (RNNs) have achieved remarkable success in handling sequential data. However, they lack interpretability. Extracting Deterministic Finite Automata (DFAs) from black-box models can provide insight into their decision-making processes. This research focuses on extracting DFAs from RNNs trained on regular languages using an exact learning framework. The proposed approach employs the L∗ algorithm to learn a DFA, and it demonstrates how a hyperplane-based method can be used to partition the RNN state space when evaluating equivalence queries.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Faculty \ School: | Faculty of Science > School of Computing Sciences |
Depositing User: | LivePure Connector |
Date Deposited: | 06 Aug 2025 15:30 |
Last Modified: | 06 Aug 2025 15:30 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/100078 |
DOI: |
Actions (login required)
![]() |
View Item |