Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards

Approved for public release, distribution is unlimited === Speech-recognition technology is beginning to be used in automobiles, telephones, personal digital assistants (PDAs), medical records, e-commerce, text dictation and editing. Speech recognition can also be integrated into Virtual Environment...

Full description

Bibliographic Details
Main Author: Apaydin, Ozan
Other Authors: Brutzman, Don
Published: Monterey, California. Naval Postgraduate School 2012
Online Access:http://hdl.handle.net/10945/6101
id ndltd-nps.edu-oai-calhoun.nps.edu-10945-6101
record_format oai_dc
spelling ndltd-nps.edu-oai-calhoun.nps.edu-10945-61012015-01-26T15:55:24Z Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards Apaydin, Ozan Brutzman, Don Yun, Xiaoping Computer Science Approved for public release, distribution is unlimited Speech-recognition technology is beginning to be used in automobiles, telephones, personal digital assistants (PDAs), medical records, e-commerce, text dictation and editing. Speech recognition can also be integrated into Virtual Environments (VEs) to create responsive virtual entities. Like the mouse, keyboard, and the trackball, Speech-recognition technology can enhance the control of a computer and improve communication. Dramatically expanding interest in the Internet and VEs has been gated by limited interactivity with human-avatar models. As more users begin interacting with avatars in VEs, designers are prompted to create more realistic, humanlike avatars. This quest for realism needs to go beyond visual aspects to include speechrecognition technology, which can greatly augment the realism of these avatars. This thesis presents design and development of a Voice User Interface (VUI), which maps to a set of behavioral motions for humanoid avatars using Extensible 3D (X3D) graphics, the Virtual Reality Modeling Language (VRML), Humanoid Animation (H-Anim) Standard and Java Speech API. The VUI includes a suitable speech-recognition component for application-command vocabularies. This thesis also demonstrates interchangeability of both avatars and animation behaviors, and creates networked humanoid animation driven by a human voice. 2012-03-14T17:47:46Z 2012-03-14T17:47:46Z 2002-03 Thesis http://hdl.handle.net/10945/6101 Copyright is reserved by the copyright owner Monterey, California. Naval Postgraduate School
collection NDLTD
sources NDLTD
description Approved for public release, distribution is unlimited === Speech-recognition technology is beginning to be used in automobiles, telephones, personal digital assistants (PDAs), medical records, e-commerce, text dictation and editing. Speech recognition can also be integrated into Virtual Environments (VEs) to create responsive virtual entities. Like the mouse, keyboard, and the trackball, Speech-recognition technology can enhance the control of a computer and improve communication. Dramatically expanding interest in the Internet and VEs has been gated by limited interactivity with human-avatar models. As more users begin interacting with avatars in VEs, designers are prompted to create more realistic, humanlike avatars. This quest for realism needs to go beyond visual aspects to include speechrecognition technology, which can greatly augment the realism of these avatars. This thesis presents design and development of a Voice User Interface (VUI), which maps to a set of behavioral motions for humanoid avatars using Extensible 3D (X3D) graphics, the Virtual Reality Modeling Language (VRML), Humanoid Animation (H-Anim) Standard and Java Speech API. The VUI includes a suitable speech-recognition component for application-command vocabularies. This thesis also demonstrates interchangeability of both avatars and animation behaviors, and creates networked humanoid animation driven by a human voice.
author2 Brutzman, Don
author_facet Brutzman, Don
Apaydin, Ozan
author Apaydin, Ozan
spellingShingle Apaydin, Ozan
Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards
author_sort Apaydin, Ozan
title Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards
title_short Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards
title_full Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards
title_fullStr Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards
title_full_unstemmed Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards
title_sort networked humanoid animation driven by human voice using extensible 3d (x3d), h-anim and java speech open standards
publisher Monterey, California. Naval Postgraduate School
publishDate 2012
url http://hdl.handle.net/10945/6101
work_keys_str_mv AT apaydinozan networkedhumanoidanimationdrivenbyhumanvoiceusingextensible3dx3dhanimandjavaspeechopenstandards
_version_ 1716728594678415360