Summary: | 碩士 === 國立成功大學 === 工程科學系碩士在職專班 === 104 === The Internet popularity changes the way of broadcasting information and the ecology of the media services. Rapid information broadcasting through networks piles up the number of information we receive every day that is beyond our imagination. Facing such floods of information makes us begin to wonder if their content is up to a certain quality and what we received are correct and useful or just spams. This research proposes a systematic approach for judging if a piece of news from the Internet is exclusive and, from which, one can also begin to think and judge the correctness and the truth of a news. The implementation of the proposed approach includes using Google Custom Search to search for Chinese news articles from specific news websites, using the Chinese Word Segmentation System - CKIP for getting the keywords from the title of the targeted exclusive news and getting all the words from the contents of searched news articles for counting the frequencies of these words, using php cURL and HTML DOM Parser to get news articles and filter out irrelevant words in the webpages, and using cosine similarity algorithm to calculate the similarity between two news articles. Although the system still needs improvement, the judgement of exclusive news in this research can make people begin to think and judge the correctness and the truth of a news and lead the way to investigate the automatic judgement of the truth or correctness of a news.
|