Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data

In the last decade, automated captioning services have appeared in mainstream technology use. Until now, the focus of these services have been on the technical aspects, supporting pupils with special educational needs and supporting teaching and learning of second language students. Only limited exp...

Full description

Bibliographic Details
Main Authors:	Christian Bokhove, Christopher Downey
Format:	Article
Language:	English
Published:	SAGE Publishing 2018-08-01
Series:	Methodological Innovations
Online Access:	https://doi.org/10.1177/2059799118790743

id	doaj-d4204e29d9484b4486e7b1194c62da26
record_format	Article
spelling	doaj-d4204e29d9484b4486e7b1194c62da262020-11-25T03:03:33ZengSAGE PublishingMethodological Innovations2059-79912018-08-011110.1177/2059799118790743Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded dataChristian BokhoveChristopher DowneyIn the last decade, automated captioning services have appeared in mainstream technology use. Until now, the focus of these services have been on the technical aspects, supporting pupils with special educational needs and supporting teaching and learning of second language students. Only limited explorations have been attempted regarding its use for research purposes: transcription of audio recordings. This article presents a proof-of-concept exploration utilising three examples of automated transcription of audio recordings from different contexts; an interview, a public hearing and a classroom setting, and compares them against ‘manual’ transcription techniques in each case. It begins with an overview of literature on automated captioning and the use of voice recognition tools for the purposes of transcription. An account is provided of the specific processes and tools used for the generation of the automated captions followed by some basic processing of the captions to produce automated transcripts. Originality checking software was used to determine a percentage match between the automated transcript and a manual version as a basic measure of the potential usability of each of the automated transcripts. Some analysis of the more common and persistent mismatches observed between automated and manual transcripts is provided, revealing that the majority of mismatches would be easily identified and rectified in a review and edit of the automated transcript. Finally, some of the challenges and limitations of the approach are considered. These limitations notwithstanding, we conclude that this form of automated transcription provides ‘good enough’ transcription for first versions of transcripts. The time and cost advantages of this could be considerable, even for the production of summary or gisted transcripts.https://doi.org/10.1177/2059799118790743
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Christian Bokhove Christopher Downey
spellingShingle	Christian Bokhove Christopher Downey Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data Methodological Innovations
author_facet	Christian Bokhove Christopher Downey
author_sort	Christian Bokhove
title	Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data
title_short	Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data
title_full	Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data
title_fullStr	Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data
title_full_unstemmed	Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data
title_sort	automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data
publisher	SAGE Publishing
series	Methodological Innovations
issn	2059-7991
publishDate	2018-08-01
description	In the last decade, automated captioning services have appeared in mainstream technology use. Until now, the focus of these services have been on the technical aspects, supporting pupils with special educational needs and supporting teaching and learning of second language students. Only limited explorations have been attempted regarding its use for research purposes: transcription of audio recordings. This article presents a proof-of-concept exploration utilising three examples of automated transcription of audio recordings from different contexts; an interview, a public hearing and a classroom setting, and compares them against ‘manual’ transcription techniques in each case. It begins with an overview of literature on automated captioning and the use of voice recognition tools for the purposes of transcription. An account is provided of the specific processes and tools used for the generation of the automated captions followed by some basic processing of the captions to produce automated transcripts. Originality checking software was used to determine a percentage match between the automated transcript and a manual version as a basic measure of the potential usability of each of the automated transcripts. Some analysis of the more common and persistent mismatches observed between automated and manual transcripts is provided, revealing that the majority of mismatches would be easily identified and rectified in a review and edit of the automated transcript. Finally, some of the challenges and limitations of the approach are considered. These limitations notwithstanding, we conclude that this form of automated transcription provides ‘good enough’ transcription for first versions of transcripts. The time and cost advantages of this could be considerable, even for the production of summary or gisted transcripts.
url	https://doi.org/10.1177/2059799118790743
work_keys_str_mv	AT christianbokhove automatedgenerationofgoodenoughtranscriptsasafirststeptotranscriptionofaudiorecordeddata AT christopherdowney automatedgenerationofgoodenoughtranscriptsasafirststeptotranscriptionofaudiorecordeddata
_version_	1724685098706534400

Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data

Similar Items