Response Characterization for Auditing Cell Dynamics in Long Short-term Memory Networks

In this paper, we introduce a novel method to interpret recurrent neural networks (RNNs), particularly long short-term memory networks (LSTMs) at the cellular level. We propose a systematic pipeline for interpreting individual hidden state dynamics within the network using response characterization...

Full description

Bibliographic Details
Main Authors: Hasani, Ramin (Author), Amini, Alexander A (Author), Lechner, Mathias (Author), Naser, Felix M (Author), Grosu, Radu (Author), Rus, Daniela L (Author)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers (IEEE), 2021-05-04T14:56:09Z.
Subjects:
Online Access:Get fulltext
LEADER 01842 am a22002293u 4500
001 130553
042 |a dc 
100 1 0 |a Hasani, Ramin  |e author 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
700 1 0 |a Amini, Alexander A  |e author 
700 1 0 |a Lechner, Mathias  |e author 
700 1 0 |a Naser, Felix M  |e author 
700 1 0 |a Grosu, Radu  |e author 
700 1 0 |a Rus, Daniela L  |e author 
245 0 0 |a Response Characterization for Auditing Cell Dynamics in Long Short-term Memory Networks 
260 |b Institute of Electrical and Electronics Engineers (IEEE),   |c 2021-05-04T14:56:09Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/130553 
520 |a In this paper, we introduce a novel method to interpret recurrent neural networks (RNNs), particularly long short-term memory networks (LSTMs) at the cellular level. We propose a systematic pipeline for interpreting individual hidden state dynamics within the network using response characterization methods. The ranked contribution of individual cells to the network's output is computed by analyzing a set of interpretable metrics of their decoupled step and sinusoidal responses. As a result, our method is able to uniquely identify neurons with insightful dynamics, quantify relationships between dynamical properties and test accuracy through ablation analysis, and interpret the impact of network capacity on a network's dynamical distribution. Finally, we demonstrate the generalizability and scalability of our method by evaluating a series of different benchmark sequential datasets. 
546 |a en 
655 7 |a Article 
773 |t 2019 International Joint Conference on Neural Networks