sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook
碩士 === 國立臺灣師範大學 === 資訊教育學系 === 99 === Language plays an important part in every reign. And the most efficient way to enhance our ability is to read. Readability can estimate whether an article is suitable for one reader. Past researches claim that readability is a mean to adjust the level of articl...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/05892956462226005886 |
id |
ndltd-TW-099NTNU5395023 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099NTNU53950232015-10-19T04:05:07Z http://ndltd.ncl.edu.tw/handle/05892956462226005886 sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook 使用支援向量機進行中文文本可讀性分類-以國小國語課文為例 胡夢珂 碩士 國立臺灣師範大學 資訊教育學系 99 Language plays an important part in every reign. And the most efficient way to enhance our ability is to read. Readability can estimate whether an article is suitable for one reader. Past researches claim that readability is a mean to adjust the level of article according to different kinds of educational attainment. The research of English readability has been on its way while Chinese has a little progression. However, Chinese is a trend in nowadays. It is important to find a suitable way to classify text readability. In the past researches, many western readability formulas do to the lack of technology use linear models on text classification, and linear readability formulas is a limit for the data in my research. Therefore, the purpose of this research is to use the predict model, which trained by the support vector machine, to classify the elementary Chinese textbook’s readability. And to check up that whether the text is matched with the predict text. At last, analyze the wrong text to improve the accuracy of text readability. This research was compiled by course expert and the experience materials( from first to sixth grades deleting the classical Chinese texts of three vision texts of private publish enterprise including vision H, K, and N) total 386 texts were examined by the national compilation organization. Part of the texts are used as training materials and the others are testing materials. Through the Chinese Word Segmentation processing and data format conversion, we at last do the text classification by SVM. The research conclusion is that the accuracy of predicting elementary texts is 47.92% while the fit rate is 80.31%. At the end, analyze the wrong prediction and understand the reason of this wrong prediction. 張國恩老師 宋曜廷老師 張道行老師 2011 學位論文 ; thesis 75 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣師範大學 === 資訊教育學系 === 99 === Language plays an important part in every reign. And the most efficient way to enhance our ability is to read. Readability can estimate whether an article is suitable for one reader. Past researches claim that readability is a mean to adjust the level of article according to different kinds of educational attainment. The research of English readability has been on its way while Chinese has a little progression. However, Chinese is a trend in nowadays. It is important to find a suitable way to classify text readability.
In the past researches, many western readability formulas do to the lack of technology use linear models on text classification, and linear readability formulas is a limit for the data in my research. Therefore, the purpose of this research is to use the predict model, which trained by the support vector machine, to classify the elementary Chinese textbook’s readability. And to check up that whether the text is matched with the predict text. At last, analyze the wrong text to improve the accuracy of text readability.
This research was compiled by course expert and the experience materials( from first to sixth grades deleting the classical Chinese texts of three vision texts of private publish enterprise including vision H, K, and N) total 386 texts were examined by the national compilation organization. Part of the texts are used as training materials and the others are testing materials. Through the Chinese Word Segmentation processing and data format conversion, we at last do the text classification by SVM. The research conclusion is that the accuracy of predicting elementary texts is 47.92% while the fit rate is 80.31%. At the end, analyze the wrong prediction and understand the reason of this wrong prediction.
|
author2 |
張國恩老師 |
author_facet |
張國恩老師 胡夢珂 |
author |
胡夢珂 |
spellingShingle |
胡夢珂 sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook |
author_sort |
胡夢珂 |
title |
sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook |
title_short |
sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook |
title_full |
sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook |
title_fullStr |
sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook |
title_full_unstemmed |
sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook |
title_sort |
sing the support vector machine to classify the chinese text readability – a case of elementary chinese textbook |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/05892956462226005886 |
work_keys_str_mv |
AT húmèngkē singthesupportvectormachinetoclassifythechinesetextreadabilityacaseofelementarychinesetextbook AT húmèngkē shǐyòngzhīyuánxiàngliàngjījìnxíngzhōngwénwénběnkědúxìngfēnlèiyǐguóxiǎoguóyǔkèwénwèilì |
_version_ |
1718095405509509120 |