A Web-based Online Document Classification System for Food contamination News – Take Melamine as the Example
碩士 === 臺北醫學大學 === 醫學資訊研究所 === 97 === Food safety is essential to human health, and also to guard the food safety as well as the quality of food is an important issue. As long as the international food and food materials trade being more frequently, the flow of them are far-reaching, and will spread...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/67993068737518264946 |
id |
ndltd-TW-097TMC05674010 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-097TMC056740102016-05-04T04:31:29Z http://ndltd.ncl.edu.tw/handle/67993068737518264946 A Web-based Online Document Classification System for Food contamination News – Take Melamine as the Example 食品汙染之網路媒體即時分類系統-以三聚氰胺為例 Chia-Yi Lin 林佳頤 碩士 臺北醫學大學 醫學資訊研究所 97 Food safety is essential to human health, and also to guard the food safety as well as the quality of food is an important issue. As long as the international food and food materials trade being more frequently, the flow of them are far-reaching, and will spread all over the world once they get contaminated. Turns out, it will bring unrest and panic to the public (such as the 2008 outbreak of melamine food contamination). This study is to build an online food contamination monitoring and automatic classification system to regularly search Google news about food contamination (we took melamine-related news as an example) in Taiwan and obtain the most complete information and stored them in the database. The system will classify news into correct categories and can help users find relevant information. In this study, we used Gibbs LDA++, which is a C/C++ implementation of Latent Dirichlet Allocation (LDA) to train news documents by unsupervised learning and supervised learning. The classifier was built by the parameter estimations and inferences from LDA training results and then adjusted manually by human expert. We defined the melamine news as five categories including "contamination", "analysis", "medical and health", "law and policy" and “others”. 521 news documents were used as training data to train the classifier and 793 documents were used to test and evaluate the classifier. The assessment of the effectiveness for the classifier is based on precision, recall and F-measure. According to the evaluation for the classifier, the macro-precision is 69.66%, the macro-recall is 64.52% and the F-measure is 0.68. According to the evaluation results, we estimate the performance of classification system and will improve the system in the future research. We expect the system could save time for reading complexity news, and help people get prepared for food contamination. 徐建業 2009 學位論文 ; thesis 73 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 臺北醫學大學 === 醫學資訊研究所 === 97 === Food safety is essential to human health, and also to guard the food safety as well as the quality of food is an important issue. As long as the international food and food materials trade being more frequently, the flow of them are far-reaching, and will spread all over the world once they get contaminated. Turns out, it will bring unrest and panic to the public (such as the 2008 outbreak of melamine food contamination).
This study is to build an online food contamination monitoring and automatic classification system to regularly search Google news about food contamination (we took melamine-related news as an example) in Taiwan and obtain the most complete information and stored them in the database. The system will classify news into correct categories and can help users find relevant information.
In this study, we used Gibbs LDA++, which is a C/C++ implementation of Latent Dirichlet Allocation (LDA) to train news documents by unsupervised learning and supervised learning. The classifier was built by the parameter estimations and inferences from LDA training results and then adjusted manually by human expert. We defined the melamine news as five categories including "contamination", "analysis", "medical and health", "law and policy" and “others”. 521 news documents were used as training data to train the classifier and 793 documents were used to test and evaluate the classifier. The assessment of the effectiveness for the classifier is based on precision, recall and F-measure. According to the evaluation for the classifier, the macro-precision is 69.66%, the macro-recall is 64.52% and the F-measure is 0.68.
According to the evaluation results, we estimate the performance of classification system and will improve the system in the future research. We expect the system could save time for reading complexity news, and help people get prepared for food contamination.
|
author2 |
徐建業 |
author_facet |
徐建業 Chia-Yi Lin 林佳頤 |
author |
Chia-Yi Lin 林佳頤 |
spellingShingle |
Chia-Yi Lin 林佳頤 A Web-based Online Document Classification System for Food contamination News – Take Melamine as the Example |
author_sort |
Chia-Yi Lin |
title |
A Web-based Online Document Classification System for Food contamination News – Take Melamine as the Example |
title_short |
A Web-based Online Document Classification System for Food contamination News – Take Melamine as the Example |
title_full |
A Web-based Online Document Classification System for Food contamination News – Take Melamine as the Example |
title_fullStr |
A Web-based Online Document Classification System for Food contamination News – Take Melamine as the Example |
title_full_unstemmed |
A Web-based Online Document Classification System for Food contamination News – Take Melamine as the Example |
title_sort |
web-based online document classification system for food contamination news – take melamine as the example |
publishDate |
2009 |
url |
http://ndltd.ncl.edu.tw/handle/67993068737518264946 |
work_keys_str_mv |
AT chiayilin awebbasedonlinedocumentclassificationsystemforfoodcontaminationnewstakemelamineastheexample AT línjiāyí awebbasedonlinedocumentclassificationsystemforfoodcontaminationnewstakemelamineastheexample AT chiayilin shípǐnwūrǎnzhīwǎnglùméitǐjíshífēnlèixìtǒngyǐsānjùqíngànwèilì AT línjiāyí shípǐnwūrǎnzhīwǎnglùméitǐjíshífēnlèixìtǒngyǐsānjùqíngànwèilì AT chiayilin webbasedonlinedocumentclassificationsystemforfoodcontaminationnewstakemelamineastheexample AT línjiāyí webbasedonlinedocumentclassificationsystemforfoodcontaminationnewstakemelamineastheexample |
_version_ |
1718259674606731264 |