Statistical models and decision making for robotic scientific information gathering
Thesis: S.M., Joint Program in Applied Ocean Physics and Engineering (Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science; and the Woods Hole Oceanographic Institution), 2018. === This electronic version was submitted by the student author. The certified...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | English |
Published: |
Massachusetts Institute of Technology
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/120607 |
id |
ndltd-MIT-oai-dspace.mit.edu-1721.1-120607 |
---|---|
record_format |
oai_dc |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Joint Program in Applied Ocean Physics and Engineering. Electrical Engineering and Computer Science. Woods Hole Oceanographic Institution. Oceanography Robotics Algorithms Artificial intelligence Data collection platforms Automatic data collection systems |
spellingShingle |
Joint Program in Applied Ocean Physics and Engineering. Electrical Engineering and Computer Science. Woods Hole Oceanographic Institution. Oceanography Robotics Algorithms Artificial intelligence Data collection platforms Automatic data collection systems Flaspohler, Genevieve Elaine Statistical models and decision making for robotic scientific information gathering |
description |
Thesis: S.M., Joint Program in Applied Ocean Physics and Engineering (Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science; and the Woods Hole Oceanographic Institution), 2018. === This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. === Cataloged from student-submitted PDF version of thesis. === Includes bibliographical references (pages 97-107). === Mobile robots and autonomous sensors have seen increasing use in scientific applications, from planetary rovers surveying for signs of life on Mars, to environmental buoys measuring and logging oceanographic conditions in coastal regions. This thesis makes contributions in both planning algorithms and model design for autonomous scientific information gathering, demonstrating how theory from machine learning, decision theory, theory of optimal experimental design, and statistical inference can be used to develop online algorithms for robotic information gathering that are robust to modeling errors, account for spatiotemporal structure in scientific data, and have probabilistic performance guarantees. This thesis first introduces a novel sample selection algorithm for online, irrevocable sampling in data streams that have spatiotemporal structure, such as those that commonly arise in robotics and environmental monitoring. Given a limited sampling capacity, the proposed periodic secretary algorithm uses an information-theoretic reward function to select samples in real-time that maximally reduce posterior uncertainty in a given scientific model. Additionally, we provide a lower bound on the quality of samples selected by the periodic secretary algorithm by leveraging the submodularity of the information-theoretic reward function. Finally, we demonstrate the robustness of the proposed approach by employing the periodic secretary algorithm to select samples irrevocably from a seven-year oceanographic data stream collected at the Martha's Vineyard Coastal Observatory off the coast of Cape Cod, USA. Secondly, we consider how scientific models can be specified in environments - such as the deep sea or deep space - where domain scientists may not have enough a priori knowledge to formulate a formal scientific model and hypothesis. These domains require scientific models that start with very little prior information and construct a model of the environment online as observations are gathered. We propose unsupervised machine learning as a technique for science model-learning in these environments. To this end, we introduce a hybrid Bayesian-deep learning model that learns a nonparametric topic model of a visual environment. We use this semantic visual model to identify observations that are poorly explained in the current model, and show experimentally that these highly perplexing observations often correspond to scientifically interesting phenomena. On a marine dataset collected by the SeaBED AUV on the Hannibal Sea Mount, images of high perplexity in the learned model corresponded, for example, to a scientifically novel crab congregation in the deep sea. The approaches presented in this thesis capture the depth and breadth of the problems facing the field of autonomous science. Developing robust autonomous systems that enhance our ability to perform exploratory science in environments such as the oceans, deep space, agricultural and disaster-relief zones will require insight and techniques from classical areas of robotics, such as motion and path planning, mapping, and localization, and from other domains, including machine learning, spatial statistics, optimization, and theory of experimental design. This thesis demonstrates how theory and practice from these diverse disciplines can be unified to address problems in autonomous scientific information gathering. === by Genevieve Elaine Flaspohler. === S.M. |
author2 |
Yogesh Girdhar and Nicholas Roy. |
author_facet |
Yogesh Girdhar and Nicholas Roy. Flaspohler, Genevieve Elaine |
author |
Flaspohler, Genevieve Elaine |
author_sort |
Flaspohler, Genevieve Elaine |
title |
Statistical models and decision making for robotic scientific information gathering |
title_short |
Statistical models and decision making for robotic scientific information gathering |
title_full |
Statistical models and decision making for robotic scientific information gathering |
title_fullStr |
Statistical models and decision making for robotic scientific information gathering |
title_full_unstemmed |
Statistical models and decision making for robotic scientific information gathering |
title_sort |
statistical models and decision making for robotic scientific information gathering |
publisher |
Massachusetts Institute of Technology |
publishDate |
2019 |
url |
http://hdl.handle.net/1721.1/120607 |
work_keys_str_mv |
AT flaspohlergenevieveelaine statisticalmodelsanddecisionmakingforroboticscientificinformationgathering |
_version_ |
1719031678677024768 |
spelling |
ndltd-MIT-oai-dspace.mit.edu-1721.1-1206072019-05-02T15:57:38Z Statistical models and decision making for robotic scientific information gathering Flaspohler, Genevieve Elaine Yogesh Girdhar and Nicholas Roy. Woods Hole Oceanographic Institution. Joint Program in Applied Ocean Physics and Engineering. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Woods Hole Oceanographic Institution. Joint Program in Applied Ocean Physics and Engineering. Electrical Engineering and Computer Science. Woods Hole Oceanographic Institution. Oceanography Robotics Algorithms Artificial intelligence Data collection platforms Automatic data collection systems Thesis: S.M., Joint Program in Applied Ocean Physics and Engineering (Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science; and the Woods Hole Oceanographic Institution), 2018. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 97-107). Mobile robots and autonomous sensors have seen increasing use in scientific applications, from planetary rovers surveying for signs of life on Mars, to environmental buoys measuring and logging oceanographic conditions in coastal regions. This thesis makes contributions in both planning algorithms and model design for autonomous scientific information gathering, demonstrating how theory from machine learning, decision theory, theory of optimal experimental design, and statistical inference can be used to develop online algorithms for robotic information gathering that are robust to modeling errors, account for spatiotemporal structure in scientific data, and have probabilistic performance guarantees. This thesis first introduces a novel sample selection algorithm for online, irrevocable sampling in data streams that have spatiotemporal structure, such as those that commonly arise in robotics and environmental monitoring. Given a limited sampling capacity, the proposed periodic secretary algorithm uses an information-theoretic reward function to select samples in real-time that maximally reduce posterior uncertainty in a given scientific model. Additionally, we provide a lower bound on the quality of samples selected by the periodic secretary algorithm by leveraging the submodularity of the information-theoretic reward function. Finally, we demonstrate the robustness of the proposed approach by employing the periodic secretary algorithm to select samples irrevocably from a seven-year oceanographic data stream collected at the Martha's Vineyard Coastal Observatory off the coast of Cape Cod, USA. Secondly, we consider how scientific models can be specified in environments - such as the deep sea or deep space - where domain scientists may not have enough a priori knowledge to formulate a formal scientific model and hypothesis. These domains require scientific models that start with very little prior information and construct a model of the environment online as observations are gathered. We propose unsupervised machine learning as a technique for science model-learning in these environments. To this end, we introduce a hybrid Bayesian-deep learning model that learns a nonparametric topic model of a visual environment. We use this semantic visual model to identify observations that are poorly explained in the current model, and show experimentally that these highly perplexing observations often correspond to scientifically interesting phenomena. On a marine dataset collected by the SeaBED AUV on the Hannibal Sea Mount, images of high perplexity in the learned model corresponded, for example, to a scientifically novel crab congregation in the deep sea. The approaches presented in this thesis capture the depth and breadth of the problems facing the field of autonomous science. Developing robust autonomous systems that enhance our ability to perform exploratory science in environments such as the oceans, deep space, agricultural and disaster-relief zones will require insight and techniques from classical areas of robotics, such as motion and path planning, mapping, and localization, and from other domains, including machine learning, spatial statistics, optimization, and theory of experimental design. This thesis demonstrates how theory and practice from these diverse disciplines can be unified to address problems in autonomous scientific information gathering. by Genevieve Elaine Flaspohler. S.M. 2019-03-01T19:33:40Z 2019-03-01T19:33:40Z 2018 2018 Thesis http://hdl.handle.net/1721.1/120607 1088412583 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 107 pages application/pdf Massachusetts Institute of Technology |