sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook

碩士 === 國立臺灣師範大學 === 資訊教育學系 === 99 === Language plays an important part in every reign. And the most efficient way to enhance our ability is to read. Readability can estimate whether an article is suitable for one reader. Past researches claim that readability is a mean to adjust the level of articl...

Full description

Bibliographic Details
Main Author:	胡夢珂
Other Authors:	張國恩老師
Format:	Others
Language:	zh-TW
Published:	2011
Online Access:	http://ndltd.ncl.edu.tw/handle/05892956462226005886

id	ndltd-TW-099NTNU5395023
record_format	oai_dc
spelling	ndltd-TW-099NTNU53950232015-10-19T04:05:07Z http://ndltd.ncl.edu.tw/handle/05892956462226005886 sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook 使用支援向量機進行中文文本可讀性分類-以國小國語課文為例胡夢珂碩士國立臺灣師範大學資訊教育學系 99 Language plays an important part in every reign. And the most efficient way to enhance our ability is to read. Readability can estimate whether an article is suitable for one reader. Past researches claim that readability is a mean to adjust the level of article according to different kinds of educational attainment. The research of English readability has been on its way while Chinese has a little progression. However, Chinese is a trend in nowadays. It is important to find a suitable way to classify text readability. In the past researches, many western readability formulas do to the lack of technology use linear models on text classification, and linear readability formulas is a limit for the data in my research. Therefore, the purpose of this research is to use the predict model, which trained by the support vector machine, to classify the elementary Chinese textbook’s readability. And to check up that whether the text is matched with the predict text. At last, analyze the wrong text to improve the accuracy of text readability. This research was compiled by course expert and the experience materials( from first to sixth grades deleting the classical Chinese texts of three vision texts of private publish enterprise including vision H, K, and N) total 386 texts were examined by the national compilation organization. Part of the texts are used as training materials and the others are testing materials. Through the Chinese Word Segmentation processing and data format conversion, we at last do the text classification by SVM. The research conclusion is that the accuracy of predicting elementary texts is 47.92% while the fit rate is 80.31%. At the end, analyze the wrong prediction and understand the reason of this wrong prediction. 張國恩老師宋曜廷老師張道行老師 2011 學位論文 ; thesis 75 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立臺灣師範大學 === 資訊教育學系 === 99 === Language plays an important part in every reign. And the most efficient way to enhance our ability is to read. Readability can estimate whether an article is suitable for one reader. Past researches claim that readability is a mean to adjust the level of article according to different kinds of educational attainment. The research of English readability has been on its way while Chinese has a little progression. However, Chinese is a trend in nowadays. It is important to find a suitable way to classify text readability. In the past researches, many western readability formulas do to the lack of technology use linear models on text classification, and linear readability formulas is a limit for the data in my research. Therefore, the purpose of this research is to use the predict model, which trained by the support vector machine, to classify the elementary Chinese textbook’s readability. And to check up that whether the text is matched with the predict text. At last, analyze the wrong text to improve the accuracy of text readability. This research was compiled by course expert and the experience materials( from first to sixth grades deleting the classical Chinese texts of three vision texts of private publish enterprise including vision H, K, and N) total 386 texts were examined by the national compilation organization. Part of the texts are used as training materials and the others are testing materials. Through the Chinese Word Segmentation processing and data format conversion, we at last do the text classification by SVM. The research conclusion is that the accuracy of predicting elementary texts is 47.92% while the fit rate is 80.31%. At the end, analyze the wrong prediction and understand the reason of this wrong prediction.
author2	張國恩老師
author_facet	張國恩老師胡夢珂
author	胡夢珂
spellingShingle	胡夢珂 sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook
author_sort	胡夢珂
title	sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook
title_short	sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook
title_full	sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook
title_fullStr	sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook
title_full_unstemmed	sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook
title_sort	sing the support vector machine to classify the chinese text readability – a case of elementary chinese textbook
publishDate	2011
url	http://ndltd.ncl.edu.tw/handle/05892956462226005886
work_keys_str_mv	AT húmèngkē singthesupportvectormachinetoclassifythechinesetextreadabilityacaseofelementarychinesetextbook AT húmèngkē shǐyòngzhīyuánxiàngliàngjījìnxíngzhōngwénwénběnkědúxìngfēnlèiyǐguóxiǎoguóyǔkèwénwèilì
_version_	1718095405509509120

sing the Support Vector Machine to classify the Chinese text readability – A Case of Elementary Chinese Textbook

Similar Items