Generalization of deep neural networks to unseen attribute combinations

Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2020 === Cataloged from student-submitted PDF of thesis. === Includes bibliographical references (pages 71-73). === Visual understanding results from a combined understanding...

Full description

Bibliographic Details
Main Author: Henry, Timothy G.
Other Authors: Tomaso Poggio.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2021
Subjects:
Online Access:https://hdl.handle.net/1721.1/129905
id ndltd-MIT-oai-dspace.mit.edu-1721.1-129905
record_format oai_dc
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-1299052021-02-21T05:17:09Z Generalization of deep neural networks to unseen attribute combinations Henry, Timothy G. Tomaso Poggio. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Electrical Engineering and Computer Science. Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2020 Cataloged from student-submitted PDF of thesis. Includes bibliographical references (pages 71-73). Visual understanding results from a combined understanding of primitive visual attributes such as color, texture, and shape. This allows humans and other primates to generalize their understanding of objects to new combinations of attributes. For instance, one can understand that a pink elephant is an elephant even if they have never seen this particular combination of color and shape before. However, is it the case that deep neural networks (DNNs) are able to generalize to such novel combinations in object recognition or other related vision tasks? This thesis demonstrates that (1) the ability of DNNs to generalize to unseen attribute combinations increases with the increased diversity of combinations seen in training as a percentage of the total combination space, (2) this effect is largely independent of the specifics of the DNN architecture used, (3) while single-task and multi-task formulations of supervised attribute classification problems may lead to similar performance on seen combinations, single-task formulations have a superior ability to generalize to unseen combinations, and (4) DNNs demonstrating the ability to generalize well in this setting learn to do so by leveraging emergent hidden units that exhibit properties of attribute selectivity and invariance. by Timothy G. Henry. M. Eng. M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science 2021-02-19T20:49:40Z 2021-02-19T20:49:40Z 2020 2020 Thesis https://hdl.handle.net/1721.1/129905 1237411492 eng MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582 73 pages application/pdf Massachusetts Institute of Technology
collection NDLTD
language English
format Others
sources NDLTD
topic Electrical Engineering and Computer Science.
spellingShingle Electrical Engineering and Computer Science.
Henry, Timothy G.
Generalization of deep neural networks to unseen attribute combinations
description Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2020 === Cataloged from student-submitted PDF of thesis. === Includes bibliographical references (pages 71-73). === Visual understanding results from a combined understanding of primitive visual attributes such as color, texture, and shape. This allows humans and other primates to generalize their understanding of objects to new combinations of attributes. For instance, one can understand that a pink elephant is an elephant even if they have never seen this particular combination of color and shape before. However, is it the case that deep neural networks (DNNs) are able to generalize to such novel combinations in object recognition or other related vision tasks? This thesis demonstrates that (1) the ability of DNNs to generalize to unseen attribute combinations increases with the increased diversity of combinations seen in training as a percentage of the total combination space, (2) this effect is largely independent of the specifics of the DNN architecture used, (3) while single-task and multi-task formulations of supervised attribute classification problems may lead to similar performance on seen combinations, single-task formulations have a superior ability to generalize to unseen combinations, and (4) DNNs demonstrating the ability to generalize well in this setting learn to do so by leveraging emergent hidden units that exhibit properties of attribute selectivity and invariance. === by Timothy G. Henry. === M. Eng. === M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science
author2 Tomaso Poggio.
author_facet Tomaso Poggio.
Henry, Timothy G.
author Henry, Timothy G.
author_sort Henry, Timothy G.
title Generalization of deep neural networks to unseen attribute combinations
title_short Generalization of deep neural networks to unseen attribute combinations
title_full Generalization of deep neural networks to unseen attribute combinations
title_fullStr Generalization of deep neural networks to unseen attribute combinations
title_full_unstemmed Generalization of deep neural networks to unseen attribute combinations
title_sort generalization of deep neural networks to unseen attribute combinations
publisher Massachusetts Institute of Technology
publishDate 2021
url https://hdl.handle.net/1721.1/129905
work_keys_str_mv AT henrytimothyg generalizationofdeepneuralnetworkstounseenattributecombinations
_version_ 1719377894924353536