Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment Control

We proposed and implemented a sound recognition system for electric equipment control. In recent years, industry 4.0 has propelled a rapid growth in intelligent human–machine interactions. User acoustic voice commands for machine control have been examined the most by researchers. The targeted machi...

Full description

Bibliographic Details
Main Authors: Wen-Chung Tsai, You-Jyun Shih, Nien-Ting Huang
Format: Article
Language:English
Published: MDPI AG 2019-08-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/8/9/924
id doaj-b23ddafa9bc94c55ac2e661087e3dbc8
record_format Article
spelling doaj-b23ddafa9bc94c55ac2e661087e3dbc82020-11-25T00:43:59ZengMDPI AGElectronics2079-92922019-08-018992410.3390/electronics8090924electronics8090924Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment ControlWen-Chung Tsai0You-Jyun Shih1Nien-Ting Huang2Department of Information and Communication Engineering, Chaoyang University of Technology, Taichung 41349, TaiwanDepartment of Information and Communication Engineering, Chaoyang University of Technology, Taichung 41349, TaiwanDepartment of Information and Communication Engineering, Chaoyang University of Technology, Taichung 41349, TaiwanWe proposed and implemented a sound recognition system for electric equipment control. In recent years, industry 4.0 has propelled a rapid growth in intelligent human–machine interactions. User acoustic voice commands for machine control have been examined the most by researchers. The targeted machine can be controlled through voice without the use of any hand-held device. However, compared with human voice recognition, limited research has been conducted on nonhuman voice (e.g., mewing sounds) or nonvoice sound recognition (e.g., clapping). Processing of such short-term, biometric nonvoice sounds for electric equipment control requires a rapid response with correct recognition. In practice, this could lead to a trade-off between recognition accuracy and processing performance for conventional software-based implementations. Therefore, we realized a field-programmable gate array-based embedded system, such a hardware-accelerated platform, can enhance information processing performance using a dynamic time warping accelerator. Furthermore, information processing was refined for two specific applications (i.e., mewing sounds and clapping) to enhance system performance including recognition accuracy and execution speed. Performance analyses and demonstrations on real products were conducted to validate the proposed system.https://www.mdpi.com/2079-9292/8/9/924dynamic time warpingfield-programmable gate arraymel-scale frequency cepstral coefficientsshort-term processingequipment controlinformation processing
collection DOAJ
language English
format Article
sources DOAJ
author Wen-Chung Tsai
You-Jyun Shih
Nien-Ting Huang
spellingShingle Wen-Chung Tsai
You-Jyun Shih
Nien-Ting Huang
Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment Control
Electronics
dynamic time warping
field-programmable gate array
mel-scale frequency cepstral coefficients
short-term processing
equipment control
information processing
author_facet Wen-Chung Tsai
You-Jyun Shih
Nien-Ting Huang
author_sort Wen-Chung Tsai
title Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment Control
title_short Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment Control
title_full Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment Control
title_fullStr Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment Control
title_full_unstemmed Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment Control
title_sort hardware-accelerated, short-term processing voice and nonvoice sound recognitions for electric equipment control
publisher MDPI AG
series Electronics
issn 2079-9292
publishDate 2019-08-01
description We proposed and implemented a sound recognition system for electric equipment control. In recent years, industry 4.0 has propelled a rapid growth in intelligent human–machine interactions. User acoustic voice commands for machine control have been examined the most by researchers. The targeted machine can be controlled through voice without the use of any hand-held device. However, compared with human voice recognition, limited research has been conducted on nonhuman voice (e.g., mewing sounds) or nonvoice sound recognition (e.g., clapping). Processing of such short-term, biometric nonvoice sounds for electric equipment control requires a rapid response with correct recognition. In practice, this could lead to a trade-off between recognition accuracy and processing performance for conventional software-based implementations. Therefore, we realized a field-programmable gate array-based embedded system, such a hardware-accelerated platform, can enhance information processing performance using a dynamic time warping accelerator. Furthermore, information processing was refined for two specific applications (i.e., mewing sounds and clapping) to enhance system performance including recognition accuracy and execution speed. Performance analyses and demonstrations on real products were conducted to validate the proposed system.
topic dynamic time warping
field-programmable gate array
mel-scale frequency cepstral coefficients
short-term processing
equipment control
information processing
url https://www.mdpi.com/2079-9292/8/9/924
work_keys_str_mv AT wenchungtsai hardwareacceleratedshorttermprocessingvoiceandnonvoicesoundrecognitionsforelectricequipmentcontrol
AT youjyunshih hardwareacceleratedshorttermprocessingvoiceandnonvoicesoundrecognitionsforelectricequipmentcontrol
AT nientinghuang hardwareacceleratedshorttermprocessingvoiceandnonvoicesoundrecognitionsforelectricequipmentcontrol
_version_ 1725277215184125952