Extracting automata from neural networks using active learning

Deep learning is one of the most advanced forms of machine learning. Most modern deep learning models are based on an artificial neural network, and benchmarking studies reveal that neural networks have produced results comparable to and in some cases superior to human experts. However, the generate...

Full description

Bibliographic Details
Main Authors: Zhiwu Xu, Cheng Wen, Shengchao Qin, Mengda He
Format: Article
Language:English
Published: PeerJ Inc. 2021-04-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-436.pdf
id doaj-5a7f8775b08b4e5fbad6573fcb6e8cb0
record_format Article
spelling doaj-5a7f8775b08b4e5fbad6573fcb6e8cb02021-04-21T15:05:09ZengPeerJ Inc.PeerJ Computer Science2376-59922021-04-017e43610.7717/peerj-cs.436Extracting automata from neural networks using active learningZhiwu Xu0Cheng Wen1Shengchao Qin2Mengda He3College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, ChinaCollege of Computer Science and Software Engineering, Shenzhen University, Shenzhen, ChinaCollege of Computer Science and Software Engineering, Shenzhen University, Shenzhen, ChinaSchool of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, United KingdomDeep learning is one of the most advanced forms of machine learning. Most modern deep learning models are based on an artificial neural network, and benchmarking studies reveal that neural networks have produced results comparable to and in some cases superior to human experts. However, the generated neural networks are typically regarded as incomprehensible black-box models, which not only limits their applications, but also hinders testing and verifying. In this paper, we present an active learning framework to extract automata from neural network classifiers, which can help users to understand the classifiers. In more detail, we use Angluin’s L* algorithm as a learner and the neural network under learning as an oracle, employing abstraction interpretation of the neural network for answering membership and equivalence queries. Our abstraction consists of value, symbol and word abstractions. The factors that may affect the abstraction are also discussed in the paper. We have implemented our approach in a prototype. To evaluate it, we have performed the prototype on a MNIST classifier and have identified that the abstraction with interval number 2 and block size 1 × 28 offers the best performance in terms of F1 score. We also have compared our extracted DFA against the DFAs learned via the passive learning algorithms provided in LearnLib and the experimental results show that our DFA gives a better performance on the MNIST dataset.https://peerj.com/articles/cs-436.pdfAutomata learningNeural networkActive learning
collection DOAJ
language English
format Article
sources DOAJ
author Zhiwu Xu
Cheng Wen
Shengchao Qin
Mengda He
spellingShingle Zhiwu Xu
Cheng Wen
Shengchao Qin
Mengda He
Extracting automata from neural networks using active learning
PeerJ Computer Science
Automata learning
Neural network
Active learning
author_facet Zhiwu Xu
Cheng Wen
Shengchao Qin
Mengda He
author_sort Zhiwu Xu
title Extracting automata from neural networks using active learning
title_short Extracting automata from neural networks using active learning
title_full Extracting automata from neural networks using active learning
title_fullStr Extracting automata from neural networks using active learning
title_full_unstemmed Extracting automata from neural networks using active learning
title_sort extracting automata from neural networks using active learning
publisher PeerJ Inc.
series PeerJ Computer Science
issn 2376-5992
publishDate 2021-04-01
description Deep learning is one of the most advanced forms of machine learning. Most modern deep learning models are based on an artificial neural network, and benchmarking studies reveal that neural networks have produced results comparable to and in some cases superior to human experts. However, the generated neural networks are typically regarded as incomprehensible black-box models, which not only limits their applications, but also hinders testing and verifying. In this paper, we present an active learning framework to extract automata from neural network classifiers, which can help users to understand the classifiers. In more detail, we use Angluin’s L* algorithm as a learner and the neural network under learning as an oracle, employing abstraction interpretation of the neural network for answering membership and equivalence queries. Our abstraction consists of value, symbol and word abstractions. The factors that may affect the abstraction are also discussed in the paper. We have implemented our approach in a prototype. To evaluate it, we have performed the prototype on a MNIST classifier and have identified that the abstraction with interval number 2 and block size 1 × 28 offers the best performance in terms of F1 score. We also have compared our extracted DFA against the DFAs learned via the passive learning algorithms provided in LearnLib and the experimental results show that our DFA gives a better performance on the MNIST dataset.
topic Automata learning
Neural network
Active learning
url https://peerj.com/articles/cs-436.pdf
work_keys_str_mv AT zhiwuxu extractingautomatafromneuralnetworksusingactivelearning
AT chengwen extractingautomatafromneuralnetworksusingactivelearning
AT shengchaoqin extractingautomatafromneuralnetworksusingactivelearning
AT mengdahe extractingautomatafromneuralnetworksusingactivelearning
_version_ 1721516208183836672