The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 97 === The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/91874768499608773576 |
id |
ndltd-TW-097YM005114004 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-097YM0051140042016-05-04T04:16:30Z http://ndltd.ncl.edu.tw/handle/91874768499608773576 The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition 護理記錄語料及辭典之建置與應用於語音辨識之可行性評估 Pin-Jen Huang 黃品甄 碩士 國立陽明大學 生物醫學資訊研究所 97 The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing information system. In order to develop the foundation of nursing record entry interface with speech recognition technology, the aim of this study is to build a nursing record corpus and lexicon in Mandarin. We selected electronic nursing record from a medical center in Taiwan as text training data. The data were recorded during Jul, 07 to Mar, 08, included 7 ICU wards and 5 general wards. We used word segmentation and unknown-word extraction system from ACADEMIA SINICA to extract nursing record lexicon and training language models with and without the new lexicon, then calculate the perplexity of language models as evaluation. In this study, we build a 974-words nursing record lexicon. The relative perplexity reduction is 15.541% from the model without nursing record lexicon to the model with 974-words nursing record lexicon. Polun Chang 張博論 2009 學位論文 ; thesis 56 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 97 === The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing information system. In order to develop the foundation of nursing record entry interface with speech recognition technology, the aim of this study is to build a nursing record corpus and lexicon in Mandarin. We selected electronic nursing record from a medical center in Taiwan as text training data. The data were recorded during Jul, 07 to Mar, 08, included 7 ICU wards and 5 general wards. We used word segmentation and unknown-word extraction system from ACADEMIA SINICA to extract nursing record lexicon and training language models with and without the new lexicon, then calculate the perplexity of language models as evaluation. In this study, we build a 974-words nursing record lexicon. The relative perplexity reduction is 15.541% from the model without nursing record lexicon to the model with 974-words nursing record lexicon.
|
author2 |
Polun Chang |
author_facet |
Polun Chang Pin-Jen Huang 黃品甄 |
author |
Pin-Jen Huang 黃品甄 |
spellingShingle |
Pin-Jen Huang 黃品甄 The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition |
author_sort |
Pin-Jen Huang |
title |
The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition |
title_short |
The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition |
title_full |
The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition |
title_fullStr |
The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition |
title_full_unstemmed |
The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition |
title_sort |
feasibility study of building a routine nursing records corpus, lexicon and its application in the speech recognition |
publishDate |
2009 |
url |
http://ndltd.ncl.edu.tw/handle/91874768499608773576 |
work_keys_str_mv |
AT pinjenhuang thefeasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition AT huángpǐnzhēn thefeasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition AT pinjenhuang hùlǐjìlùyǔliàojícídiǎnzhījiànzhìyǔyīngyòngyúyǔyīnbiànshízhīkěxíngxìngpínggū AT huángpǐnzhēn hùlǐjìlùyǔliàojícídiǎnzhījiànzhìyǔyīngyòngyúyǔyīnbiànshízhīkěxíngxìngpínggū AT pinjenhuang feasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition AT huángpǐnzhēn feasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition |
_version_ |
1718255273307537408 |