Using Summary Layers to Probe Neural Network Behaviour
No framework exists that can explain and predict the generalisation ability of deep neural networks in general circumstances. In fact, this question has not been answered for some of the least complicated of neural network architectures: fully-connected feedforward networks with rectified linear act...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
South African Institute of Computer Scientists and Information Technologists
2020-12-01
|
Series: | South African Computer Journal |
Online Access: | https://sacj.cs.uct.ac.za/index.php/sacj/article/view/861 |
id |
doaj-65d6a87a9b334d6db8ca631152ee4829 |
---|---|
record_format |
Article |
spelling |
doaj-65d6a87a9b334d6db8ca631152ee48292020-12-08T07:46:49ZengSouth African Institute of Computer Scientists and Information TechnologistsSouth African Computer Journal1015-79992313-78352020-12-0132210.18489/sacj.v32i2.861775Using Summary Layers to Probe Neural Network BehaviourMarelie Hattingh Davel0https://orcid.org/0000-0003-3103-5858Multilingual Speech Technologies, North-West University, South Africa and CAIRNo framework exists that can explain and predict the generalisation ability of deep neural networks in general circumstances. In fact, this question has not been answered for some of the least complicated of neural network architectures: fully-connected feedforward networks with rectified linear activations and a limited number of hidden layers. For such an architecture, we show how adding a summary layer to the network makes it more amenable to analysis, and allows us to define the conditions that are required to guarantee that a set of samples will all be classified correctly. This process does not describe the generalisation behaviour of these networks, but produces a number of metrics that are useful for probing their learning and generalisation behaviour. We support the analytical conclusions with empirical results, both to confirm that the mathematical guarantees hold in practice, and to demonstrate the use of the analysis process.https://sacj.cs.uct.ac.za/index.php/sacj/article/view/861 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Marelie Hattingh Davel |
spellingShingle |
Marelie Hattingh Davel Using Summary Layers to Probe Neural Network Behaviour South African Computer Journal |
author_facet |
Marelie Hattingh Davel |
author_sort |
Marelie Hattingh Davel |
title |
Using Summary Layers to Probe Neural Network Behaviour |
title_short |
Using Summary Layers to Probe Neural Network Behaviour |
title_full |
Using Summary Layers to Probe Neural Network Behaviour |
title_fullStr |
Using Summary Layers to Probe Neural Network Behaviour |
title_full_unstemmed |
Using Summary Layers to Probe Neural Network Behaviour |
title_sort |
using summary layers to probe neural network behaviour |
publisher |
South African Institute of Computer Scientists and Information Technologists |
series |
South African Computer Journal |
issn |
1015-7999 2313-7835 |
publishDate |
2020-12-01 |
description |
No framework exists that can explain and predict the generalisation ability of deep neural networks in general circumstances. In fact, this question has not been answered for some of the least complicated of neural network architectures: fully-connected feedforward networks with rectified linear activations and a limited number of hidden layers. For such an architecture, we show how adding a summary layer to the network makes it more amenable to analysis, and allows us to define the conditions that are required to guarantee that a set of samples will all be classified correctly. This process does not describe the generalisation behaviour of these networks, but produces a number of metrics that are useful for probing their learning and generalisation behaviour. We support the analytical conclusions with empirical results, both to confirm that the mathematical guarantees hold in practice, and to demonstrate the use of the analysis process. |
url |
https://sacj.cs.uct.ac.za/index.php/sacj/article/view/861 |
work_keys_str_mv |
AT mareliehattinghdavel usingsummarylayerstoprobeneuralnetworkbehaviour |
_version_ |
1724391174098124800 |