The Spoken British National Corpus 2014 : design, compilation and analysis
The ESRC-funded Centre for Corpus Approaches to Social Science at Lancaster University (CASS) and the English Language Teaching group at Cambridge University Press (CUP) have compiled a new, publicly-accessible corpus of spoken British English from the 2010s, known as the Spoken British National Cor...
Main Author: | |
---|---|
Published: |
Lancaster University
2018
|
Online Access: | http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.733547 |
id |
ndltd-bl.uk-oai-ethos.bl.uk-733547 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-bl.uk-oai-ethos.bl.uk-7335472018-06-12T04:07:19ZThe Spoken British National Corpus 2014 : design, compilation and analysisLove, Robbie2018The ESRC-funded Centre for Corpus Approaches to Social Science at Lancaster University (CASS) and the English Language Teaching group at Cambridge University Press (CUP) have compiled a new, publicly-accessible corpus of spoken British English from the 2010s, known as the Spoken British National Corpus 2014 (Spoken BNC2014). The 11.5 million-word corpus, gathered solely in informal contexts, is the first freely-accessible corpus of its kind since the spoken component of the original British National Corpus (the Spoken BNC1994), which, despite its age, is still used as a proxy for present-day English in research today. This thesis presents a detailed account of each stage of the Spoken BNC2014’s construction, including its conception, design, transcription, processing and dissemination. It also demonstrates the research potential of the corpus, by presenting a diachronic analysis of ‘bad language’ in spoken British English, comparing the 1990s to the 2010s. The thesis shows how the research team struck a delicate balance between backwards compatibility with the Spoken BNC1994 and optimal practice in the context of compiling a new corpus. Although comparable with its predecessor, the Spoken BNC2014 is shown to represent innovation in approaches to the compilation of spoken corpora. This thesis makes several useful contributions to the linguistic research community. The Spoken BNC2014 itself should be of use to many researchers, educators and students in the corpus linguistics and English language communities and beyond. In addition, the thesis represents an example of good practice with regards to academic collaboration with a commercial stakeholder. Thirdly, although not a ‘user guide’, the methodological discussions and analysis presented in this thesis are intended to help the Spoken BNC2014 to be as useful to as many people, and for as many purposes, as possible.Lancaster Universityhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.733547http://eprints.lancs.ac.uk/90068/Electronic Thesis or Dissertation |
collection |
NDLTD |
sources |
NDLTD |
description |
The ESRC-funded Centre for Corpus Approaches to Social Science at Lancaster University (CASS) and the English Language Teaching group at Cambridge University Press (CUP) have compiled a new, publicly-accessible corpus of spoken British English from the 2010s, known as the Spoken British National Corpus 2014 (Spoken BNC2014). The 11.5 million-word corpus, gathered solely in informal contexts, is the first freely-accessible corpus of its kind since the spoken component of the original British National Corpus (the Spoken BNC1994), which, despite its age, is still used as a proxy for present-day English in research today. This thesis presents a detailed account of each stage of the Spoken BNC2014’s construction, including its conception, design, transcription, processing and dissemination. It also demonstrates the research potential of the corpus, by presenting a diachronic analysis of ‘bad language’ in spoken British English, comparing the 1990s to the 2010s. The thesis shows how the research team struck a delicate balance between backwards compatibility with the Spoken BNC1994 and optimal practice in the context of compiling a new corpus. Although comparable with its predecessor, the Spoken BNC2014 is shown to represent innovation in approaches to the compilation of spoken corpora. This thesis makes several useful contributions to the linguistic research community. The Spoken BNC2014 itself should be of use to many researchers, educators and students in the corpus linguistics and English language communities and beyond. In addition, the thesis represents an example of good practice with regards to academic collaboration with a commercial stakeholder. Thirdly, although not a ‘user guide’, the methodological discussions and analysis presented in this thesis are intended to help the Spoken BNC2014 to be as useful to as many people, and for as many purposes, as possible. |
author |
Love, Robbie |
spellingShingle |
Love, Robbie The Spoken British National Corpus 2014 : design, compilation and analysis |
author_facet |
Love, Robbie |
author_sort |
Love, Robbie |
title |
The Spoken British National Corpus 2014 : design, compilation and analysis |
title_short |
The Spoken British National Corpus 2014 : design, compilation and analysis |
title_full |
The Spoken British National Corpus 2014 : design, compilation and analysis |
title_fullStr |
The Spoken British National Corpus 2014 : design, compilation and analysis |
title_full_unstemmed |
The Spoken British National Corpus 2014 : design, compilation and analysis |
title_sort |
spoken british national corpus 2014 : design, compilation and analysis |
publisher |
Lancaster University |
publishDate |
2018 |
url |
http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.733547 |
work_keys_str_mv |
AT loverobbie thespokenbritishnationalcorpus2014designcompilationandanalysis AT loverobbie spokenbritishnationalcorpus2014designcompilationandanalysis |
_version_ |
1718695396607262720 |