Embodied Evolution of Learning Ability

Embodied evolution is a methodology for evolutionary robotics that mimics the distributed, asynchronous, and autonomous properties of biological evolution. The evaluation, selection, and reproduction are carried out by cooperation and competition of the robots, without any need for human interventio...

Full description

Bibliographic Details
Main Author: Elfwing, Stefan
Format: Doctoral Thesis
Language:English
Published: KTH, Datorseende och robotik, CVAP 2007
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-4515
http://nbn-resolving.de/urn:isbn:978-91-7178-787-3
id ndltd-UPSALLA1-oai-DiVA.org-kth-4515
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-kth-45152013-01-08T13:06:38ZEmbodied Evolution of Learning AbilityengElfwing, StefanKTH, Datorseende och robotik, CVAPStockholm : KTH2007Embodied EvolutionEvolutionary RoboticsReinforcement LearningShaping RewardsMeta-parametersHierarchical Reinforcement LearningComputer scienceDatalogiEmbodied evolution is a methodology for evolutionary robotics that mimics the distributed, asynchronous, and autonomous properties of biological evolution. The evaluation, selection, and reproduction are carried out by cooperation and competition of the robots, without any need for human intervention. An embodied evolution framework is therefore well suited to study the adaptive learning mechanisms for artificial agents that share the same fundamental constraints as biological agents: self-preservation and self-reproduction. The main goal of the research in this thesis has been to develop a framework for performing embodied evolution with a limited number of robots, by utilizing time-sharing of subpopulations of virtual agents inside each robot. The framework integrates reproduction as a directed autonomous behavior, and allows for learning of basic behaviors for survival by reinforcement learning. The purpose of the evolution is to evolve the learning ability of the agents, by optimizing meta-properties in reinforcement learning, such as the selection of basic behaviors, meta-parameters that modulate the efficiency of the learning, and additional and richer reward signals that guides the learning in the form of shaping rewards. The realization of the embodied evolution framework has been a cumulative research process in three steps: 1) investigation of the learning of a cooperative mating behavior for directed autonomous reproduction; 2) development of an embodied evolution framework, in which the selection of pre-learned basic behaviors and the optimization of battery recharging are evolved; and 3) development of an embodied evolution framework that includes meta-learning of basic reinforcement learning behaviors for survival, and in which the individuals are evaluated by an implicit and biologically inspired fitness function that promotes reproductive ability. The proposed embodied evolution methods have been validated in a simulation environment of the Cyber Rodent robot, a robotic platform developed for embodied evolution purposes. The evolutionarily obtained solutions have also been transferred to the real robotic platform. The evolutionary approach to meta-learning has also been applied for automatic design of task hierarchies in hierarchical reinforcement learning, and for co-evolving meta-parameters and potential-based shaping rewards to accelerate reinforcement learning, both in regards to finding initial solutions and in regards to convergence to robust policies. QC 20100706Doctoral thesis, comprehensive summaryinfo:eu-repo/semantics/doctoralThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-4515urn:isbn:978-91-7178-787-3Trita-CSC-A, 1653-5723 ; 2007:16application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
topic Embodied Evolution
Evolutionary Robotics
Reinforcement Learning
Shaping Rewards
Meta-parameters
Hierarchical Reinforcement Learning
Computer science
Datalogi
spellingShingle Embodied Evolution
Evolutionary Robotics
Reinforcement Learning
Shaping Rewards
Meta-parameters
Hierarchical Reinforcement Learning
Computer science
Datalogi
Elfwing, Stefan
Embodied Evolution of Learning Ability
description Embodied evolution is a methodology for evolutionary robotics that mimics the distributed, asynchronous, and autonomous properties of biological evolution. The evaluation, selection, and reproduction are carried out by cooperation and competition of the robots, without any need for human intervention. An embodied evolution framework is therefore well suited to study the adaptive learning mechanisms for artificial agents that share the same fundamental constraints as biological agents: self-preservation and self-reproduction. The main goal of the research in this thesis has been to develop a framework for performing embodied evolution with a limited number of robots, by utilizing time-sharing of subpopulations of virtual agents inside each robot. The framework integrates reproduction as a directed autonomous behavior, and allows for learning of basic behaviors for survival by reinforcement learning. The purpose of the evolution is to evolve the learning ability of the agents, by optimizing meta-properties in reinforcement learning, such as the selection of basic behaviors, meta-parameters that modulate the efficiency of the learning, and additional and richer reward signals that guides the learning in the form of shaping rewards. The realization of the embodied evolution framework has been a cumulative research process in three steps: 1) investigation of the learning of a cooperative mating behavior for directed autonomous reproduction; 2) development of an embodied evolution framework, in which the selection of pre-learned basic behaviors and the optimization of battery recharging are evolved; and 3) development of an embodied evolution framework that includes meta-learning of basic reinforcement learning behaviors for survival, and in which the individuals are evaluated by an implicit and biologically inspired fitness function that promotes reproductive ability. The proposed embodied evolution methods have been validated in a simulation environment of the Cyber Rodent robot, a robotic platform developed for embodied evolution purposes. The evolutionarily obtained solutions have also been transferred to the real robotic platform. The evolutionary approach to meta-learning has also been applied for automatic design of task hierarchies in hierarchical reinforcement learning, and for co-evolving meta-parameters and potential-based shaping rewards to accelerate reinforcement learning, both in regards to finding initial solutions and in regards to convergence to robust policies. === QC 20100706
author Elfwing, Stefan
author_facet Elfwing, Stefan
author_sort Elfwing, Stefan
title Embodied Evolution of Learning Ability
title_short Embodied Evolution of Learning Ability
title_full Embodied Evolution of Learning Ability
title_fullStr Embodied Evolution of Learning Ability
title_full_unstemmed Embodied Evolution of Learning Ability
title_sort embodied evolution of learning ability
publisher KTH, Datorseende och robotik, CVAP
publishDate 2007
url http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-4515
http://nbn-resolving.de/urn:isbn:978-91-7178-787-3
work_keys_str_mv AT elfwingstefan embodiedevolutionoflearningability
_version_ 1716509005797392384