Design Space for Voice-Based Professional Reporting

Speech technology has matured so that voice-based reporting utilizing speech-to-text can be applied in various domains. Speech has two major benefits: it enables efficient reporting and speech input improves the quality of the reports since reporting can be done as a part of the workflow without del...

Full description

Bibliographic Details
Main Authors: Jaakko Hakulinen, Tuuli Keskinen, Markku Turunen, Sanni Siltanen
Format: Article
Language:English
Published: MDPI AG 2021-01-01
Series:Multimodal Technologies and Interaction
Subjects:
Online Access:https://www.mdpi.com/2414-4088/5/1/3
id doaj-f6ed11d8af10444095b615bfecb5b466
record_format Article
spelling doaj-f6ed11d8af10444095b615bfecb5b4662021-01-12T00:04:59ZengMDPI AGMultimodal Technologies and Interaction2414-40882021-01-0153310.3390/mti5010003Design Space for Voice-Based Professional ReportingJaakko Hakulinen0Tuuli Keskinen1Markku Turunen2Sanni Siltanen3Faculty of Information Technology and Communication Sciences, Tampere University, P.O. Box 1001, FI-33014 Tampere, FinlandFaculty of Information Technology and Communication Sciences, Tampere University, P.O. Box 1001, FI-33014 Tampere, FinlandFaculty of Information Technology and Communication Sciences, Tampere University, P.O. Box 1001, FI-33014 Tampere, FinlandKONE Corporation, KONE Technology and Innovation, Myllykatu 3, 05800 Hyvinkää, FinlandSpeech technology has matured so that voice-based reporting utilizing speech-to-text can be applied in various domains. Speech has two major benefits: it enables efficient reporting and speech input improves the quality of the reports since reporting can be done as a part of the workflow without delays between work and reporting. However, designing reporting voice user interfaces (VUIs) for professional use is challenging, as there are numerous aspects from technology to organization and language that need to be considered. Based on our experience in developing professional reporting VUIs with different stakeholders representing both commercial and public sector, we define a design space for voice-based reporting systems. The design space consists of 28 dimensions grouped into five categories: Language Processing, Structure of Reporting, Technical Limitations in the Work Domain, Interaction Related Aspects in the Work Domain, and Organization. We illustrate the design space by discussing four voice-based reporting systems, designed and implemented by us, and describing a design process that utilizes it. The design space enables designers to identify critical aspects of professional reporting VUIs and optimize those for their target domain. The design space can be used as a practical tool especially by designers with limited experience on speech technologies.https://www.mdpi.com/2414-4088/5/1/3design spacevoice user interfacesprofessional reportingvoice-based reportingspeech recognitionspeech-to-text
collection DOAJ
language English
format Article
sources DOAJ
author Jaakko Hakulinen
Tuuli Keskinen
Markku Turunen
Sanni Siltanen
spellingShingle Jaakko Hakulinen
Tuuli Keskinen
Markku Turunen
Sanni Siltanen
Design Space for Voice-Based Professional Reporting
Multimodal Technologies and Interaction
design space
voice user interfaces
professional reporting
voice-based reporting
speech recognition
speech-to-text
author_facet Jaakko Hakulinen
Tuuli Keskinen
Markku Turunen
Sanni Siltanen
author_sort Jaakko Hakulinen
title Design Space for Voice-Based Professional Reporting
title_short Design Space for Voice-Based Professional Reporting
title_full Design Space for Voice-Based Professional Reporting
title_fullStr Design Space for Voice-Based Professional Reporting
title_full_unstemmed Design Space for Voice-Based Professional Reporting
title_sort design space for voice-based professional reporting
publisher MDPI AG
series Multimodal Technologies and Interaction
issn 2414-4088
publishDate 2021-01-01
description Speech technology has matured so that voice-based reporting utilizing speech-to-text can be applied in various domains. Speech has two major benefits: it enables efficient reporting and speech input improves the quality of the reports since reporting can be done as a part of the workflow without delays between work and reporting. However, designing reporting voice user interfaces (VUIs) for professional use is challenging, as there are numerous aspects from technology to organization and language that need to be considered. Based on our experience in developing professional reporting VUIs with different stakeholders representing both commercial and public sector, we define a design space for voice-based reporting systems. The design space consists of 28 dimensions grouped into five categories: Language Processing, Structure of Reporting, Technical Limitations in the Work Domain, Interaction Related Aspects in the Work Domain, and Organization. We illustrate the design space by discussing four voice-based reporting systems, designed and implemented by us, and describing a design process that utilizes it. The design space enables designers to identify critical aspects of professional reporting VUIs and optimize those for their target domain. The design space can be used as a practical tool especially by designers with limited experience on speech technologies.
topic design space
voice user interfaces
professional reporting
voice-based reporting
speech recognition
speech-to-text
url https://www.mdpi.com/2414-4088/5/1/3
work_keys_str_mv AT jaakkohakulinen designspaceforvoicebasedprofessionalreporting
AT tuulikeskinen designspaceforvoicebasedprofessionalreporting
AT markkuturunen designspaceforvoicebasedprofessionalreporting
AT sannisiltanen designspaceforvoicebasedprofessionalreporting
_version_ 1724340952566333440