Ellipsis Handling in A Medical Diagnosis Dialog System
碩士 === 國立臺灣海洋大學 === 資訊工程學系 === 100 === Computerized virtual patient (CVP) is a domain specific dialog system. In this system, we handle ellipsis in medical diagnosis. Virtual patient is an important teaching method for medical college’s student. It can help student to learn how to judge patient’s co...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2012
|
Online Access: | http://ndltd.ncl.edu.tw/handle/50928352094819149040 |
id |
ndltd-TW-100NTOU5394004 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100NTOU53940042015-10-13T22:01:07Z http://ndltd.ncl.edu.tw/handle/50928352094819149040 Ellipsis Handling in A Medical Diagnosis Dialog System 醫療問診對話系統中省略現象之處理 鮑建威 碩士 國立臺灣海洋大學 資訊工程學系 100 Computerized virtual patient (CVP) is a domain specific dialog system. In this system, we handle ellipsis in medical diagnosis. Virtual patient is an important teaching method for medical college’s student. It can help student to learn how to judge patient’s condition from medical diagnosis. CVP need to resolve oral phenomenon something like our goal ellipsis. If we don’t handle ellipsis which is not easy to find corresponding problems and answers in the standard problem set of teaching text. Ellipsis handling includes ellipsis detection, type classification and recovery. There are many domain specific dialog system, but no one similar ours. Ellipses in our thesis are classified according to omitted element. Medical diagnosis template saves necessary information from dialogs. Our system is a hybrid system, rule-based module and machine learning. Rule-based module uses information from template to detect, classify and recover ellipsis. If some ellipsis can’t be detected by rule-based module, machine learning will implement. We learn a classifier for detecting ellipsis. Features include lexical surface, word information, POS, verb tense, punctuation and special terms from observation. These features also can be combined. After detection, rule-based module classifies and recovers ellipsis. The training and testing data are from virtual patient’s teaching record and medical diagnosis record in the hospital. Our machine learning method is Condition Random Field(CRF). Training is performed in 10-fold-cross-validation. In training, when using best features and feature combination, ellipsis detection classifier with a f-value of 86.73%, then recover by rule-based module with a f-value of 78.95%. Using information in diagnosis template to detect and recover ellipsis with a f-value of 82.58%. Total ellipsis system with a f-value of 85.54%. In testing, ellipsis system with a recall of 77.4%, a precision of 79.36% and a f-value of 78.35% Chuan-Jie Lin 林川傑 2012 學位論文 ; thesis 57 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣海洋大學 === 資訊工程學系 === 100 === Computerized virtual patient (CVP) is a domain specific dialog system. In this system, we handle ellipsis in medical diagnosis.
Virtual patient is an important teaching method for medical college’s student. It can help student to learn how to judge patient’s condition from medical diagnosis. CVP need to resolve oral phenomenon something like our goal ellipsis. If we don’t handle ellipsis which is not easy to find corresponding problems and answers in the standard problem set of teaching text.
Ellipsis handling includes ellipsis detection, type classification and recovery. There are many domain specific dialog system, but no one similar ours. Ellipses in our thesis are classified according to omitted element. Medical diagnosis template saves necessary information from dialogs.
Our system is a hybrid system, rule-based module and machine learning. Rule-based module uses information from template to detect, classify and recover ellipsis. If some ellipsis can’t be detected by rule-based module, machine learning will implement. We learn a classifier for detecting ellipsis. Features include lexical surface, word information, POS, verb tense, punctuation and special terms from observation. These features also can be combined. After detection, rule-based module classifies and recovers ellipsis.
The training and testing data are from virtual patient’s teaching record and medical diagnosis record in the hospital. Our machine learning method is Condition Random Field(CRF). Training is performed in 10-fold-cross-validation.
In training, when using best features and feature combination, ellipsis detection classifier with a f-value of 86.73%, then recover by rule-based module with a f-value of 78.95%. Using information in diagnosis template to detect and recover ellipsis with a f-value of 82.58%. Total ellipsis system with a f-value of 85.54%. In testing, ellipsis system with a recall of 77.4%, a precision of 79.36% and a f-value of 78.35%
|
author2 |
Chuan-Jie Lin |
author_facet |
Chuan-Jie Lin 鮑建威 |
author |
鮑建威 |
spellingShingle |
鮑建威 Ellipsis Handling in A Medical Diagnosis Dialog System |
author_sort |
鮑建威 |
title |
Ellipsis Handling in A Medical Diagnosis Dialog System |
title_short |
Ellipsis Handling in A Medical Diagnosis Dialog System |
title_full |
Ellipsis Handling in A Medical Diagnosis Dialog System |
title_fullStr |
Ellipsis Handling in A Medical Diagnosis Dialog System |
title_full_unstemmed |
Ellipsis Handling in A Medical Diagnosis Dialog System |
title_sort |
ellipsis handling in a medical diagnosis dialog system |
publishDate |
2012 |
url |
http://ndltd.ncl.edu.tw/handle/50928352094819149040 |
work_keys_str_mv |
AT bàojiànwēi ellipsishandlinginamedicaldiagnosisdialogsystem AT bàojiànwēi yīliáowènzhěnduìhuàxìtǒngzhōngshěnglüèxiànxiàngzhīchùlǐ |
_version_ |
1718071621129863168 |