The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition

碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 97 === The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing...

Full description

Bibliographic Details
Main Authors: Pin-Jen Huang, 黃品甄
Other Authors: Polun Chang
Format: Others
Language:zh-TW
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/91874768499608773576
id ndltd-TW-097YM005114004
record_format oai_dc
spelling ndltd-TW-097YM0051140042016-05-04T04:16:30Z http://ndltd.ncl.edu.tw/handle/91874768499608773576 The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition 護理記錄語料及辭典之建置與應用於語音辨識之可行性評估 Pin-Jen Huang 黃品甄 碩士 國立陽明大學 生物醫學資訊研究所 97 The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing information system. In order to develop the foundation of nursing record entry interface with speech recognition technology, the aim of this study is to build a nursing record corpus and lexicon in Mandarin. We selected electronic nursing record from a medical center in Taiwan as text training data. The data were recorded during Jul, 07 to Mar, 08, included 7 ICU wards and 5 general wards. We used word segmentation and unknown-word extraction system from ACADEMIA SINICA to extract nursing record lexicon and training language models with and without the new lexicon, then calculate the perplexity of language models as evaluation. In this study, we build a 974-words nursing record lexicon. The relative perplexity reduction is 15.541% from the model without nursing record lexicon to the model with 974-words nursing record lexicon. Polun Chang 張博論 2009 學位論文 ; thesis 56 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 97 === The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing information system. In order to develop the foundation of nursing record entry interface with speech recognition technology, the aim of this study is to build a nursing record corpus and lexicon in Mandarin. We selected electronic nursing record from a medical center in Taiwan as text training data. The data were recorded during Jul, 07 to Mar, 08, included 7 ICU wards and 5 general wards. We used word segmentation and unknown-word extraction system from ACADEMIA SINICA to extract nursing record lexicon and training language models with and without the new lexicon, then calculate the perplexity of language models as evaluation. In this study, we build a 974-words nursing record lexicon. The relative perplexity reduction is 15.541% from the model without nursing record lexicon to the model with 974-words nursing record lexicon.
author2 Polun Chang
author_facet Polun Chang
Pin-Jen Huang
黃品甄
author Pin-Jen Huang
黃品甄
spellingShingle Pin-Jen Huang
黃品甄
The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
author_sort Pin-Jen Huang
title The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_short The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_full The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_fullStr The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_full_unstemmed The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_sort feasibility study of building a routine nursing records corpus, lexicon and its application in the speech recognition
publishDate 2009
url http://ndltd.ncl.edu.tw/handle/91874768499608773576
work_keys_str_mv AT pinjenhuang thefeasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition
AT huángpǐnzhēn thefeasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition
AT pinjenhuang hùlǐjìlùyǔliàojícídiǎnzhījiànzhìyǔyīngyòngyúyǔyīnbiànshízhīkěxíngxìngpínggū
AT huángpǐnzhēn hùlǐjìlùyǔliàojícídiǎnzhījiànzhìyǔyīngyòngyúyǔyīnbiànshízhīkěxíngxìngpínggū
AT pinjenhuang feasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition
AT huángpǐnzhēn feasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition
_version_ 1718255273307537408