Extracting deterministic finite automata from RNNs via hyperplane partitioning and learning

Wickramasinghe, Sandamali, Howe, Jacob and Daviaud, Laure (2025) Extracting deterministic finite automata from RNNs via hyperplane partitioning and learning. In: 2nd International Conference on Explainable AI for Neural and Symbolic Methods (EXPLAIN) 2025, 2025-10-22 - 2025-10-24. (In Press)

[thumbnail of Extracting_DFAs_from_RNNs_paper_camera_ready_version] PDF (Extracting_DFAs_from_RNNs_paper_camera_ready_version)
Restricted to Repository staff only until 1 August 2026.

Request a copy

Abstract

Recurrent Neural Networks (RNNs) have achieved remarkable success in handling sequential data. However, they lack interpretability. Extracting Deterministic Finite Automata (DFAs) from black-box models can provide insight into their decision-making processes. This research focuses on extracting DFAs from RNNs trained on regular languages using an exact learning framework. The proposed approach employs the L∗ algorithm to learn a DFA, and it demonstrates how a hyperplane-based method can be used to partition the RNN state space when evaluating equivalence queries.

Item Type: Conference or Workshop Item (Paper)
Faculty \ School: Faculty of Science > School of Computing Sciences
Depositing User: LivePure Connector
Date Deposited: 06 Aug 2025 15:30
Last Modified: 06 Aug 2025 15:30
URI: https://ueaeprints.uea.ac.uk/id/eprint/100078
DOI:

Actions (login required)

View Item View Item