Compositional Learning of Human Activities With a Self-Organizing Neural Architecture

An important step for assistive systems and robot companions operating in human environments is to learn the compositionality of human activities, i.e., recognize both activities and their comprising actions. Most existing approaches address action and activity recognition as separate tasks, i.e., a...

Full description

Bibliographic Details
Main Authors:	Luiza Mici, German I. Parisi, Stefan Wermter
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2019-08-01
Series:	Frontiers in Robotics and AI
Subjects:	human activity recognition self-organizing networks hierarchical learning compositionality of human activities RGB-D perception
Online Access:	https://www.frontiersin.org/article/10.3389/frobt.2019.00072/full

id	doaj-7a9b8b71104040f88d4154a4240fbf38
record_format	Article
spelling	doaj-7a9b8b71104040f88d4154a4240fbf382020-11-25T00:57:30ZengFrontiers Media S.A.Frontiers in Robotics and AI2296-91442019-08-01610.3389/frobt.2019.00072475893Compositional Learning of Human Activities With a Self-Organizing Neural ArchitectureLuiza MiciGerman I. ParisiStefan WermterAn important step for assistive systems and robot companions operating in human environments is to learn the compositionality of human activities, i.e., recognize both activities and their comprising actions. Most existing approaches address action and activity recognition as separate tasks, i.e., actions need to be inferred before the activity labels, and are thus highly sensitive to the correct temporal segmentation of the activity sequences. In this paper, we present a novel learning approach that jointly learns human activities on two levels of semantic and temporal complexity: (1) transitive actions such as reaching and opening, e.g., a cereal box, and (2) high-level activities such as having breakfast. Our model consists of a hierarchy of GWR networks which process and learn inherent spatiotemporal dependencies of multiple visual cues extracted from the human body skeletal representation and the interaction with objects. The neural architecture learns and semantically segments input RGB-D sequences of high-level activities into their composing actions, without supervision. We investigate the performance of our architecture with a set of experiments on a publicly available benchmark dataset. The experimental results show that our approach outperforms the state of the art with respect to the classification of the high-level activities. Additionally, we introduce a novel top-down modulation mechanism to the architecture which uses the actions and activity labels as constraints during the learning phase. In our experiments, we show how this mechanism can be used to control the network's neural growth without decreasing the overall performance.https://www.frontiersin.org/article/10.3389/frobt.2019.00072/fullhuman activity recognitionself-organizing networkshierarchical learningcompositionality of human activitiesRGB-D perception
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Luiza Mici German I. Parisi Stefan Wermter
spellingShingle	Luiza Mici German I. Parisi Stefan Wermter Compositional Learning of Human Activities With a Self-Organizing Neural Architecture Frontiers in Robotics and AI human activity recognition self-organizing networks hierarchical learning compositionality of human activities RGB-D perception
author_facet	Luiza Mici German I. Parisi Stefan Wermter
author_sort	Luiza Mici
title	Compositional Learning of Human Activities With a Self-Organizing Neural Architecture
title_short	Compositional Learning of Human Activities With a Self-Organizing Neural Architecture
title_full	Compositional Learning of Human Activities With a Self-Organizing Neural Architecture
title_fullStr	Compositional Learning of Human Activities With a Self-Organizing Neural Architecture
title_full_unstemmed	Compositional Learning of Human Activities With a Self-Organizing Neural Architecture
title_sort	compositional learning of human activities with a self-organizing neural architecture
publisher	Frontiers Media S.A.
series	Frontiers in Robotics and AI
issn	2296-9144
publishDate	2019-08-01
description	An important step for assistive systems and robot companions operating in human environments is to learn the compositionality of human activities, i.e., recognize both activities and their comprising actions. Most existing approaches address action and activity recognition as separate tasks, i.e., actions need to be inferred before the activity labels, and are thus highly sensitive to the correct temporal segmentation of the activity sequences. In this paper, we present a novel learning approach that jointly learns human activities on two levels of semantic and temporal complexity: (1) transitive actions such as reaching and opening, e.g., a cereal box, and (2) high-level activities such as having breakfast. Our model consists of a hierarchy of GWR networks which process and learn inherent spatiotemporal dependencies of multiple visual cues extracted from the human body skeletal representation and the interaction with objects. The neural architecture learns and semantically segments input RGB-D sequences of high-level activities into their composing actions, without supervision. We investigate the performance of our architecture with a set of experiments on a publicly available benchmark dataset. The experimental results show that our approach outperforms the state of the art with respect to the classification of the high-level activities. Additionally, we introduce a novel top-down modulation mechanism to the architecture which uses the actions and activity labels as constraints during the learning phase. In our experiments, we show how this mechanism can be used to control the network's neural growth without decreasing the overall performance.
topic	human activity recognition self-organizing networks hierarchical learning compositionality of human activities RGB-D perception
url	https://www.frontiersin.org/article/10.3389/frobt.2019.00072/full
work_keys_str_mv	AT luizamici compositionallearningofhumanactivitieswithaselforganizingneuralarchitecture AT germaniparisi compositionallearningofhumanactivitieswithaselforganizingneuralarchitecture AT stefanwermter compositionallearningofhumanactivitieswithaselforganizingneuralarchitecture
_version_	1725223883936628736

Compositional Learning of Human Activities With a Self-Organizing Neural Architecture

Similar Items