Generalization of deep neural networks to unseen attribute combinations
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2020 === Cataloged from student-submitted PDF of thesis. === Includes bibliographical references (pages 71-73). === Visual understanding results from a combined understanding...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | English |
Published: |
Massachusetts Institute of Technology
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/1721.1/129905 |
id |
ndltd-MIT-oai-dspace.mit.edu-1721.1-129905 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-MIT-oai-dspace.mit.edu-1721.1-1299052021-02-21T05:17:09Z Generalization of deep neural networks to unseen attribute combinations Henry, Timothy G. Tomaso Poggio. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Electrical Engineering and Computer Science. Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2020 Cataloged from student-submitted PDF of thesis. Includes bibliographical references (pages 71-73). Visual understanding results from a combined understanding of primitive visual attributes such as color, texture, and shape. This allows humans and other primates to generalize their understanding of objects to new combinations of attributes. For instance, one can understand that a pink elephant is an elephant even if they have never seen this particular combination of color and shape before. However, is it the case that deep neural networks (DNNs) are able to generalize to such novel combinations in object recognition or other related vision tasks? This thesis demonstrates that (1) the ability of DNNs to generalize to unseen attribute combinations increases with the increased diversity of combinations seen in training as a percentage of the total combination space, (2) this effect is largely independent of the specifics of the DNN architecture used, (3) while single-task and multi-task formulations of supervised attribute classification problems may lead to similar performance on seen combinations, single-task formulations have a superior ability to generalize to unseen combinations, and (4) DNNs demonstrating the ability to generalize well in this setting learn to do so by leveraging emergent hidden units that exhibit properties of attribute selectivity and invariance. by Timothy G. Henry. M. Eng. M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science 2021-02-19T20:49:40Z 2021-02-19T20:49:40Z 2020 2020 Thesis https://hdl.handle.net/1721.1/129905 1237411492 eng MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582 73 pages application/pdf Massachusetts Institute of Technology |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Electrical Engineering and Computer Science. |
spellingShingle |
Electrical Engineering and Computer Science. Henry, Timothy G. Generalization of deep neural networks to unseen attribute combinations |
description |
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2020 === Cataloged from student-submitted PDF of thesis. === Includes bibliographical references (pages 71-73). === Visual understanding results from a combined understanding of primitive visual attributes such as color, texture, and shape. This allows humans and other primates to generalize their understanding of objects to new combinations of attributes. For instance, one can understand that a pink elephant is an elephant even if they have never seen this particular combination of color and shape before. However, is it the case that deep neural networks (DNNs) are able to generalize to such novel combinations in object recognition or other related vision tasks? This thesis demonstrates that (1) the ability of DNNs to generalize to unseen attribute combinations increases with the increased diversity of combinations seen in training as a percentage of the total combination space, (2) this effect is largely independent of the specifics of the DNN architecture used, (3) while single-task and multi-task formulations of supervised attribute classification problems may lead to similar performance on seen combinations, single-task formulations have a superior ability to generalize to unseen combinations, and (4) DNNs demonstrating the ability to generalize well in this setting learn to do so by leveraging emergent hidden units that exhibit properties of attribute selectivity and invariance. === by Timothy G. Henry. === M. Eng. === M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science |
author2 |
Tomaso Poggio. |
author_facet |
Tomaso Poggio. Henry, Timothy G. |
author |
Henry, Timothy G. |
author_sort |
Henry, Timothy G. |
title |
Generalization of deep neural networks to unseen attribute combinations |
title_short |
Generalization of deep neural networks to unseen attribute combinations |
title_full |
Generalization of deep neural networks to unseen attribute combinations |
title_fullStr |
Generalization of deep neural networks to unseen attribute combinations |
title_full_unstemmed |
Generalization of deep neural networks to unseen attribute combinations |
title_sort |
generalization of deep neural networks to unseen attribute combinations |
publisher |
Massachusetts Institute of Technology |
publishDate |
2021 |
url |
https://hdl.handle.net/1721.1/129905 |
work_keys_str_mv |
AT henrytimothyg generalizationofdeepneuralnetworkstounseenattributecombinations |
_version_ |
1719377894924353536 |