Endophasia: Utilizing Acoustic-Based Images for Dropping Contact-Free Silent Speech Commands

碩士 === 國立臺灣科技大學 === 資訊工程系 === 107 === Using silent speech to drop commands has received growing attention, as users can utilize existing command set from voicebased interface without evoking other people's attention. Such interaction keeps the privacy and social acceptance from others. However,...

Full description

Bibliographic Details
Main Authors: Wei-Hsiang Huang, 黃威翔
Other Authors: Chih-Yuan Yao
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/u533m3
id ndltd-TW-107NTUS5392044
record_format oai_dc
spelling ndltd-TW-107NTUS53920442019-10-23T05:46:03Z http://ndltd.ncl.edu.tw/handle/u533m3 Endophasia: Utilizing Acoustic-Based Images for Dropping Contact-Free Silent Speech Commands 以超聲波下達無聲指令 Wei-Hsiang Huang 黃威翔 碩士 國立臺灣科技大學 資訊工程系 107 Using silent speech to drop commands has received growing attention, as users can utilize existing command set from voicebased interface without evoking other people's attention. Such interaction keeps the privacy and social acceptance from others. However, current solutions for recognizing silent speech mainly relies on vision-based data or attaching the microphone on the throat. These solutions are either power-consuming and have potential privacy issues. In this paper, we propose a sensing technique that only needs a pair of microphone and speaker, which not only consumes only few powers but also have less privacy concerns. We chose 10 commands for experimentation and used the built-in speaker and microphone to detect the mouth movement of the user in front of the phone. Through the deep learning of CNN and the command data collection from 15 users, we trained within-user and cross-user model, and through the accuracy of 81.41\% and 81.80\%, the feasibility of using ultrasonic technology to drop silently command to the mobile phone is verified. Chih-Yuan Yao Da-Yuan Huang 姚智原 黃大源 2019 學位論文 ; thesis 38 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 107 === Using silent speech to drop commands has received growing attention, as users can utilize existing command set from voicebased interface without evoking other people's attention. Such interaction keeps the privacy and social acceptance from others. However, current solutions for recognizing silent speech mainly relies on vision-based data or attaching the microphone on the throat. These solutions are either power-consuming and have potential privacy issues. In this paper, we propose a sensing technique that only needs a pair of microphone and speaker, which not only consumes only few powers but also have less privacy concerns. We chose 10 commands for experimentation and used the built-in speaker and microphone to detect the mouth movement of the user in front of the phone. Through the deep learning of CNN and the command data collection from 15 users, we trained within-user and cross-user model, and through the accuracy of 81.41\% and 81.80\%, the feasibility of using ultrasonic technology to drop silently command to the mobile phone is verified.
author2 Chih-Yuan Yao
author_facet Chih-Yuan Yao
Wei-Hsiang Huang
黃威翔
author Wei-Hsiang Huang
黃威翔
spellingShingle Wei-Hsiang Huang
黃威翔
Endophasia: Utilizing Acoustic-Based Images for Dropping Contact-Free Silent Speech Commands
author_sort Wei-Hsiang Huang
title Endophasia: Utilizing Acoustic-Based Images for Dropping Contact-Free Silent Speech Commands
title_short Endophasia: Utilizing Acoustic-Based Images for Dropping Contact-Free Silent Speech Commands
title_full Endophasia: Utilizing Acoustic-Based Images for Dropping Contact-Free Silent Speech Commands
title_fullStr Endophasia: Utilizing Acoustic-Based Images for Dropping Contact-Free Silent Speech Commands
title_full_unstemmed Endophasia: Utilizing Acoustic-Based Images for Dropping Contact-Free Silent Speech Commands
title_sort endophasia: utilizing acoustic-based images for dropping contact-free silent speech commands
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/u533m3
work_keys_str_mv AT weihsianghuang endophasiautilizingacousticbasedimagesfordroppingcontactfreesilentspeechcommands
AT huángwēixiáng endophasiautilizingacousticbasedimagesfordroppingcontactfreesilentspeechcommands
AT weihsianghuang yǐchāoshēngbōxiàdáwúshēngzhǐlìng
AT huángwēixiáng yǐchāoshēngbōxiàdáwúshēngzhǐlìng
_version_ 1719276241113055232