The Design and Implementation of an Interfacing Framework for Bridging Speech Recognizers to Application Systems

碩士 === 國立交通大學 === 資訊工程系所 === 93 === Current solutions that aim at bridging speech recognizers with applications use an ad hoc approach and lack of a generic and systematic way. Such recognizer’s interfacing approaches usually lead to tightly coupled systems where one application is wrapped by a spec...

Full description

Bibliographic Details
Main Authors: Jan Karel Ruzicka, 蔣加洛
Other Authors: Deng-Jyi Chen
Format: Others
Language:en_US
Published: 2005
Online Access:http://ndltd.ncl.edu.tw/handle/22449046228642510406
Description
Summary:碩士 === 國立交通大學 === 資訊工程系所 === 93 === Current solutions that aim at bridging speech recognizers with applications use an ad hoc approach and lack of a generic and systematic way. Such recognizer’s interfacing approaches usually lead to tightly coupled systems where one application is wrapped by a specific recognizer through a low-level programming implementation that makes future modifications very difficult. Also, without supporting mechanisms to abstract group of actions into single reusable macro-level commands to simplify user interaction tasks, intense and time-consuming overheads for end users are created. Applications, especially multimedia oriented ones deal with highly dynamic content, interfacing and keeping track of this kind of content is not yet addressed. In this thesis research, an attempt to provide an interface framework for bridging speech-recognizers to applications through a generic and systematic approach is proposed to overcome the above challenges and limitations. Specifically, a script language is designed and implemented that allows users to define the interfacing commands between a speech recognizer and application software. These commands are executed on a user-composed visual interfacing environment that sits on top of applications and acts as a reference layer for interaction. With this approach, interaction commands can be dynamically scripted to simplify user interaction and allow more natural speech commanding. Moreover it allows immediate modifications to be made to an application interfacing environment by simply drawing and registering application zones, without the need of relying on low-level programming for changes to take effect. Our approach also allows for the coexistence of multiple application environments, allowing integration of speech recognition to more than one application at once. A prototype interface framework system has been constructed and used to demonstrate the feasibility and applicability of the proposed interface framework.