A Feature-Based Automatic Speech Digest Generator

碩士 === 國立交通大學 === 資訊管理研究所 === 100 === As the number of speech and video documents is increasing on the Internet and portable devices, speech summarization has become more important in these years. In usual, the research domain focused on the domain of broadcast and news. Unfortunately, the method o...

Full description

Bibliographic Details
Main Authors: Wu, Yu-Rou, 吳御柔
Other Authors: Lo, Chi-Chun
Format: Others
Language:en_US
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/28950894916160238237
id ndltd-TW-100NCTU5396020
record_format oai_dc
spelling ndltd-TW-100NCTU53960202016-03-28T04:20:36Z http://ndltd.ncl.edu.tw/handle/28950894916160238237 A Feature-Based Automatic Speech Digest Generator 一個基於特徵值的演講語音自動摘要產生器 Wu, Yu-Rou 吳御柔 碩士 國立交通大學 資訊管理研究所 100 As the number of speech and video documents is increasing on the Internet and portable devices, speech summarization has become more important in these years. In usual, the research domain focused on the domain of broadcast and news. Unfortunately, the method of automatic summarization used in the past may not suit to other speech domains (e.g. lecture speech). Therefore, this thesis focuses on the research of lecture speech domain. We analyze the features used in past research, choose the suitable features through experimental, and propose a three-phase Real-Time Speech Summarizer (RTSS). Phase one chooses independent features (e.g. centrality, resemblance to the title, sentence length, term frequency, and thematic word) and calculates the independent features-scores; phase two calculates the dependent feature such as position with above-mentioned independent features-scores; phase three compares the above-mentioned feature-scores, weighted average the function-scores to find the top score sentence, and get the summary. With the experimental, RTSS are evaluated by comparing the summary sentence set selecting from RTSS and five experts. RTSS is a useful that the Macro F-Measure score is 52%, and the Macro Accuracy is 70% that can help users to get the key information of speech. Lo, Chi-Chun 羅濟群 2012 學位論文 ; thesis 37 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 資訊管理研究所 === 100 === As the number of speech and video documents is increasing on the Internet and portable devices, speech summarization has become more important in these years. In usual, the research domain focused on the domain of broadcast and news. Unfortunately, the method of automatic summarization used in the past may not suit to other speech domains (e.g. lecture speech). Therefore, this thesis focuses on the research of lecture speech domain. We analyze the features used in past research, choose the suitable features through experimental, and propose a three-phase Real-Time Speech Summarizer (RTSS). Phase one chooses independent features (e.g. centrality, resemblance to the title, sentence length, term frequency, and thematic word) and calculates the independent features-scores; phase two calculates the dependent feature such as position with above-mentioned independent features-scores; phase three compares the above-mentioned feature-scores, weighted average the function-scores to find the top score sentence, and get the summary. With the experimental, RTSS are evaluated by comparing the summary sentence set selecting from RTSS and five experts. RTSS is a useful that the Macro F-Measure score is 52%, and the Macro Accuracy is 70% that can help users to get the key information of speech.
author2 Lo, Chi-Chun
author_facet Lo, Chi-Chun
Wu, Yu-Rou
吳御柔
author Wu, Yu-Rou
吳御柔
spellingShingle Wu, Yu-Rou
吳御柔
A Feature-Based Automatic Speech Digest Generator
author_sort Wu, Yu-Rou
title A Feature-Based Automatic Speech Digest Generator
title_short A Feature-Based Automatic Speech Digest Generator
title_full A Feature-Based Automatic Speech Digest Generator
title_fullStr A Feature-Based Automatic Speech Digest Generator
title_full_unstemmed A Feature-Based Automatic Speech Digest Generator
title_sort feature-based automatic speech digest generator
publishDate 2012
url http://ndltd.ncl.edu.tw/handle/28950894916160238237
work_keys_str_mv AT wuyurou afeaturebasedautomaticspeechdigestgenerator
AT wúyùróu afeaturebasedautomaticspeechdigestgenerator
AT wuyurou yīgèjīyútèzhēngzhídeyǎnjiǎngyǔyīnzìdòngzhāiyàochǎnshēngqì
AT wúyùróu yīgèjīyútèzhēngzhídeyǎnjiǎngyǔyīnzìdòngzhāiyàochǎnshēngqì
AT wuyurou featurebasedautomaticspeechdigestgenerator
AT wúyùróu featurebasedautomaticspeechdigestgenerator
_version_ 1718212757476605952