Graph Models For Query Focused Text Summarization And Assessment Of Machine Translation Using Stopwords

Text summarization is the task of generating a shortened version of the original text where core ideas of the original text are retained. In this work, we focus on query focused summarization. The task is to generate the summary from a set of documents which answers the query. Query focused summariz...

Full description

Bibliographic Details
Main Author: Rama, B
Other Authors: Veni Madhavan, C E
Language:en_US
Published: 2014
Subjects:
Online Access:http://etd.iisc.ernet.in/handle/2005/2294
http://etd.ncsi.iisc.ernet.in/abstracts/2952/G25317-Abs.pdf
id ndltd-IISc-oai-etd.ncsi.iisc.ernet.in-2005-2294
record_format oai_dc
spelling ndltd-IISc-oai-etd.ncsi.iisc.ernet.in-2005-22942018-01-10T03:36:37ZGraph Models For Query Focused Text Summarization And Assessment Of Machine Translation Using StopwordsRama, BNatural Language ProcessingAbstractingQuery OptimizationMachine TranslationText SummarizationQuery Focused SummarizationMachine TranslatorsComputer ScienceText summarization is the task of generating a shortened version of the original text where core ideas of the original text are retained. In this work, we focus on query focused summarization. The task is to generate the summary from a set of documents which answers the query. Query focused summarization is a hard task because it expects the summary to be biased towards the query and at the same time important concepts in the original documents must be preserved with high degree of novelty. Graph based ranking algorithms which use biased random surfer model like Topic-sensitive LexRank have been applied to query focused summarization. In our work, we propose look-ahead version of Topic-sensitive LexRank. We incorporate the option of look-ahead in the random walk model and we show that it helps in generating better quality summaries. Next, we consider assessment of machine translation. Assessment of a machine translation output is important for establishing benchmarks for translation quality. An obvious way to assess the quality of machine translation is through the perception of human subjects. Though highly reliable, this approach is not scalable and is time consuming. Hence mechanisms have been devised to automate the assessment process. All such assessment methods are essentially a study of correlations between human translation and the machine translation. In this work, we present a scalable approach to assess the quality of machine translation that borrows features from the study of writing styles, popularly known as Stylometry. Towards this, we quantify the characteristic styles of individual machine translators and compare them with that of human generated text. The translator whose style is closest to human style is deemed to generate a higher quality translation. We show that our approach is scalable and does not require actual source text translations for evaluation.Veni Madhavan, C E2014-04-09T10:57:00Z2014-04-09T10:57:00Z2014-04-092012-06Thesishttp://etd.iisc.ernet.in/handle/2005/2294http://etd.ncsi.iisc.ernet.in/abstracts/2952/G25317-Abs.pdfen_USG25317
collection NDLTD
language en_US
sources NDLTD
topic Natural Language Processing
Abstracting
Query Optimization
Machine Translation
Text Summarization
Query Focused Summarization
Machine Translators
Computer Science
spellingShingle Natural Language Processing
Abstracting
Query Optimization
Machine Translation
Text Summarization
Query Focused Summarization
Machine Translators
Computer Science
Rama, B
Graph Models For Query Focused Text Summarization And Assessment Of Machine Translation Using Stopwords
description Text summarization is the task of generating a shortened version of the original text where core ideas of the original text are retained. In this work, we focus on query focused summarization. The task is to generate the summary from a set of documents which answers the query. Query focused summarization is a hard task because it expects the summary to be biased towards the query and at the same time important concepts in the original documents must be preserved with high degree of novelty. Graph based ranking algorithms which use biased random surfer model like Topic-sensitive LexRank have been applied to query focused summarization. In our work, we propose look-ahead version of Topic-sensitive LexRank. We incorporate the option of look-ahead in the random walk model and we show that it helps in generating better quality summaries. Next, we consider assessment of machine translation. Assessment of a machine translation output is important for establishing benchmarks for translation quality. An obvious way to assess the quality of machine translation is through the perception of human subjects. Though highly reliable, this approach is not scalable and is time consuming. Hence mechanisms have been devised to automate the assessment process. All such assessment methods are essentially a study of correlations between human translation and the machine translation. In this work, we present a scalable approach to assess the quality of machine translation that borrows features from the study of writing styles, popularly known as Stylometry. Towards this, we quantify the characteristic styles of individual machine translators and compare them with that of human generated text. The translator whose style is closest to human style is deemed to generate a higher quality translation. We show that our approach is scalable and does not require actual source text translations for evaluation.
author2 Veni Madhavan, C E
author_facet Veni Madhavan, C E
Rama, B
author Rama, B
author_sort Rama, B
title Graph Models For Query Focused Text Summarization And Assessment Of Machine Translation Using Stopwords
title_short Graph Models For Query Focused Text Summarization And Assessment Of Machine Translation Using Stopwords
title_full Graph Models For Query Focused Text Summarization And Assessment Of Machine Translation Using Stopwords
title_fullStr Graph Models For Query Focused Text Summarization And Assessment Of Machine Translation Using Stopwords
title_full_unstemmed Graph Models For Query Focused Text Summarization And Assessment Of Machine Translation Using Stopwords
title_sort graph models for query focused text summarization and assessment of machine translation using stopwords
publishDate 2014
url http://etd.iisc.ernet.in/handle/2005/2294
http://etd.ncsi.iisc.ernet.in/abstracts/2952/G25317-Abs.pdf
work_keys_str_mv AT ramab graphmodelsforqueryfocusedtextsummarizationandassessmentofmachinetranslationusingstopwords
_version_ 1718603724463538176