Modern Sensory Substitution for Vision in Dynamic Environments

abstract: Societal infrastructure is built with vision at the forefront of daily life. For those with severe visual impairments, this creates countless barriers to the participation and enjoyment of life’s opportunities. Technological progress has been both a blessing and a curse in this regard....

Full description

Bibliographic Details
Other Authors: Fakhri, Bijan (Author)
Format: Doctoral Thesis
Language:English
Published: 2020
Subjects:
Online Access:http://hdl.handle.net/2286/R.I.57172
id ndltd-asu.edu-item-57172
record_format oai_dc
spelling ndltd-asu.edu-item-571722020-06-02T03:01:19Z Modern Sensory Substitution for Vision in Dynamic Environments abstract: Societal infrastructure is built with vision at the forefront of daily life. For those with severe visual impairments, this creates countless barriers to the participation and enjoyment of life’s opportunities. Technological progress has been both a blessing and a curse in this regard. Digital text together with screen readers and refreshable Braille displays have made whole libraries readily accessible and rideshare tech has made independent mobility more attainable. Simultaneously, screen-based interactions and experiences have only grown in pervasiveness and importance, precluding many of those with visual impairments. Sensory Substituion, the process of substituting an unavailable modality with another one, has shown promise as an alternative to accomodation, but in recent years meaningful strides in Sensory Substitution for vision have declined in frequency. Given recent advances in Computer Vision, this stagnation is especially disconcerting. Designing Sensory Substitution Devices (SSDs) for vision for use in interactive settings that leverage modern Computer Vision techniques presents a variety of challenges including perceptual bandwidth, human-computer-interaction, and person-centered machine learning considerations. To surmount these barriers an approach called Per- sonal Foveated Haptic Gaze (PFHG), is introduced. PFHG consists of two primary components: a human visual system inspired interaction paradigm that is intuitive and flexible enough to generalize to a variety of applications called Foveated Haptic Gaze (FHG), and a person-centered learning component to address the expressivity limitations of most SSDs. This component is called One-Shot Object Detection by Data Augmentation (1SODDA), a one-shot object detection approach that allows a user to specify the objects they are interested in locating visually and with minimal effort realizing an object detection model that does so effectively. The Personal Foveated Haptic Gaze framework was realized in a virtual and real- world application: playing a 3D, interactive, first person video game (DOOM) and finding user-specified real-world objects. User study results found Foveated Haptic Gaze to be an effective and intuitive interface for interacting with dynamic visual world using solely haptics. Additionally, 1SODDA achieves competitive performance among few-shot object detection methods and high-framerate many-shot object de- tectors. The combination of which paves the way for modern Sensory Substitution Devices for vision. Dissertation/Thesis Fakhri, Bijan (Author) Panchanathan, Sethuraman (Advisor) McDaniel, Troy L (Committee member) Venkateswara, Hemanth (Committee member) Amor, Heni (Committee member) Arizona State University (Publisher) Computer engineering Computer science Assistive Technology Computer Vision Haptics Human-Computer-Interaction Sensory Substitution Visual Impairments eng 141 pages Doctoral Dissertation Computer Engineering 2020 Doctoral Dissertation http://hdl.handle.net/2286/R.I.57172 http://rightsstatements.org/vocab/InC/1.0/ 2020
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
topic Computer engineering
Computer science
Assistive Technology
Computer Vision
Haptics
Human-Computer-Interaction
Sensory Substitution
Visual Impairments
spellingShingle Computer engineering
Computer science
Assistive Technology
Computer Vision
Haptics
Human-Computer-Interaction
Sensory Substitution
Visual Impairments
Modern Sensory Substitution for Vision in Dynamic Environments
description abstract: Societal infrastructure is built with vision at the forefront of daily life. For those with severe visual impairments, this creates countless barriers to the participation and enjoyment of life’s opportunities. Technological progress has been both a blessing and a curse in this regard. Digital text together with screen readers and refreshable Braille displays have made whole libraries readily accessible and rideshare tech has made independent mobility more attainable. Simultaneously, screen-based interactions and experiences have only grown in pervasiveness and importance, precluding many of those with visual impairments. Sensory Substituion, the process of substituting an unavailable modality with another one, has shown promise as an alternative to accomodation, but in recent years meaningful strides in Sensory Substitution for vision have declined in frequency. Given recent advances in Computer Vision, this stagnation is especially disconcerting. Designing Sensory Substitution Devices (SSDs) for vision for use in interactive settings that leverage modern Computer Vision techniques presents a variety of challenges including perceptual bandwidth, human-computer-interaction, and person-centered machine learning considerations. To surmount these barriers an approach called Per- sonal Foveated Haptic Gaze (PFHG), is introduced. PFHG consists of two primary components: a human visual system inspired interaction paradigm that is intuitive and flexible enough to generalize to a variety of applications called Foveated Haptic Gaze (FHG), and a person-centered learning component to address the expressivity limitations of most SSDs. This component is called One-Shot Object Detection by Data Augmentation (1SODDA), a one-shot object detection approach that allows a user to specify the objects they are interested in locating visually and with minimal effort realizing an object detection model that does so effectively. The Personal Foveated Haptic Gaze framework was realized in a virtual and real- world application: playing a 3D, interactive, first person video game (DOOM) and finding user-specified real-world objects. User study results found Foveated Haptic Gaze to be an effective and intuitive interface for interacting with dynamic visual world using solely haptics. Additionally, 1SODDA achieves competitive performance among few-shot object detection methods and high-framerate many-shot object de- tectors. The combination of which paves the way for modern Sensory Substitution Devices for vision. === Dissertation/Thesis === Doctoral Dissertation Computer Engineering 2020
author2 Fakhri, Bijan (Author)
author_facet Fakhri, Bijan (Author)
title Modern Sensory Substitution for Vision in Dynamic Environments
title_short Modern Sensory Substitution for Vision in Dynamic Environments
title_full Modern Sensory Substitution for Vision in Dynamic Environments
title_fullStr Modern Sensory Substitution for Vision in Dynamic Environments
title_full_unstemmed Modern Sensory Substitution for Vision in Dynamic Environments
title_sort modern sensory substitution for vision in dynamic environments
publishDate 2020
url http://hdl.handle.net/2286/R.I.57172
_version_ 1719315774370217984