The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition

碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 97 === The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing...

Full description

Bibliographic Details
Main Authors:	Pin-Jen Huang, 黃品甄
Other Authors:	Polun Chang
Format:	Others
Language:	zh-TW
Published:	2009
Online Access:	http://ndltd.ncl.edu.tw/handle/91874768499608773576

id	ndltd-TW-097YM005114004
record_format	oai_dc
spelling	ndltd-TW-097YM0051140042016-05-04T04:16:30Z http://ndltd.ncl.edu.tw/handle/91874768499608773576 The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition 護理記錄語料及辭典之建置與應用於語音辨識之可行性評估 Pin-Jen Huang 黃品甄碩士國立陽明大學生物醫學資訊研究所 97 The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing information system. In order to develop the foundation of nursing record entry interface with speech recognition technology, the aim of this study is to build a nursing record corpus and lexicon in Mandarin. We selected electronic nursing record from a medical center in Taiwan as text training data. The data were recorded during Jul, 07 to Mar, 08, included 7 ICU wards and 5 general wards. We used word segmentation and unknown-word extraction system from ACADEMIA SINICA to extract nursing record lexicon and training language models with and without the new lexicon, then calculate the perplexity of language models as evaluation. In this study, we build a 974-words nursing record lexicon. The relative perplexity reduction is 15.541％ from the model without nursing record lexicon to the model with 974-words nursing record lexicon. Polun Chang 張博論 2009 學位論文 ; thesis 56 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 97 === The speech recognition technology has improved in last two decades, and some of them were used frequently in health industry of English speaking country. However, multilingual speech recognition is a challenging necessity when it comes to Mandarin based nursing information system. In order to develop the foundation of nursing record entry interface with speech recognition technology, the aim of this study is to build a nursing record corpus and lexicon in Mandarin. We selected electronic nursing record from a medical center in Taiwan as text training data. The data were recorded during Jul, 07 to Mar, 08, included 7 ICU wards and 5 general wards. We used word segmentation and unknown-word extraction system from ACADEMIA SINICA to extract nursing record lexicon and training language models with and without the new lexicon, then calculate the perplexity of language models as evaluation. In this study, we build a 974-words nursing record lexicon. The relative perplexity reduction is 15.541％ from the model without nursing record lexicon to the model with 974-words nursing record lexicon.
author2	Polun Chang
author_facet	Polun Chang Pin-Jen Huang 黃品甄
author	Pin-Jen Huang 黃品甄
spellingShingle	Pin-Jen Huang 黃品甄 The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
author_sort	Pin-Jen Huang
title	The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_short	The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_full	The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_fullStr	The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_full_unstemmed	The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition
title_sort	feasibility study of building a routine nursing records corpus, lexicon and its application in the speech recognition
publishDate	2009
url	http://ndltd.ncl.edu.tw/handle/91874768499608773576
work_keys_str_mv	AT pinjenhuang thefeasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition AT huángpǐnzhēn thefeasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition AT pinjenhuang hùlǐjìlùyǔliàojícídiǎnzhījiànzhìyǔyīngyòngyúyǔyīnbiànshízhīkěxíngxìngpínggū AT huángpǐnzhēn hùlǐjìlùyǔliàojícídiǎnzhījiànzhìyǔyīngyòngyúyǔyīnbiànshízhīkěxíngxìngpínggū AT pinjenhuang feasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition AT huángpǐnzhēn feasibilitystudyofbuildingaroutinenursingrecordscorpuslexiconanditsapplicationinthespeechrecognition
_version_	1718255273307537408

The Feasibility Study of Building a Routine Nursing Records Corpus, Lexicon and Its Application in the Speech Recognition

Similar Items