Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
The increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online tex...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9303358/ |
id |
doaj-68f2698fd810493fa3b87a3993d9f18a |
---|---|
record_format |
Article |
spelling |
doaj-68f2698fd810493fa3b87a3993d9f18a2021-03-30T04:20:23ZengIEEEIEEE Access2169-35362020-01-01822820622822410.1109/ACCESS.2020.30464949303358Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid ClusteringRana Alqaisi0https://orcid.org/0000-0003-4298-803XWasel Ghanem1https://orcid.org/0000-0003-2409-4729Aziz Qaroush2https://orcid.org/0000-0002-0274-4121Department of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineDepartment of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineDepartment of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineThe increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online textual data. Thus, there is a persistent need for an automatic text summarization system, which extract the relevant and salient information from a collection of documents, that sharing the same or related topics. Then, presenting this extracted information in a condensed form to preserve the main topics. This paper proposes an automatic, generic, and extractive Arabic multi-document summarization system. The proposed system employs the clustering-based and evolutionary multi-objective optimization methods. The clustering-based method discovers the main topics in the text, while the evolutionary multi-objective optimization method optimizes three objectives based on coverage, diversity/redundancy, and relevancy. The performance of the proposed system is evaluated using TAC 2011 and DUC 2002 datasets. The experimental results are compared using ROUGE evaluation measure. The obtained results showed the effectiveness of the proposed system compared to other peer systems. The proposed system outperformed other peer systems for all ROUGE metrics using TAC 2011. We achieved an F-measure of 38.9%, 17.7%, 35.4%, and 15.8% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively. In addition, the proposed system with DUC 2002 dataset achieved an F-measure of 47.1%, 23.7%, 47.1%, 20.4% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively.https://ieeexplore.ieee.org/document/9303358/Natural language processingextractive text summarizationmulti-objective optimizationmaximum coverage and relevancyless redundancy |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rana Alqaisi Wasel Ghanem Aziz Qaroush |
spellingShingle |
Rana Alqaisi Wasel Ghanem Aziz Qaroush Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering IEEE Access Natural language processing extractive text summarization multi-objective optimization maximum coverage and relevancy less redundancy |
author_facet |
Rana Alqaisi Wasel Ghanem Aziz Qaroush |
author_sort |
Rana Alqaisi |
title |
Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering |
title_short |
Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering |
title_full |
Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering |
title_fullStr |
Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering |
title_full_unstemmed |
Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering |
title_sort |
extractive multi-document arabic text summarization using evolutionary multi-objective optimization with k-medoid clustering |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
The increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online textual data. Thus, there is a persistent need for an automatic text summarization system, which extract the relevant and salient information from a collection of documents, that sharing the same or related topics. Then, presenting this extracted information in a condensed form to preserve the main topics. This paper proposes an automatic, generic, and extractive Arabic multi-document summarization system. The proposed system employs the clustering-based and evolutionary multi-objective optimization methods. The clustering-based method discovers the main topics in the text, while the evolutionary multi-objective optimization method optimizes three objectives based on coverage, diversity/redundancy, and relevancy. The performance of the proposed system is evaluated using TAC 2011 and DUC 2002 datasets. The experimental results are compared using ROUGE evaluation measure. The obtained results showed the effectiveness of the proposed system compared to other peer systems. The proposed system outperformed other peer systems for all ROUGE metrics using TAC 2011. We achieved an F-measure of 38.9%, 17.7%, 35.4%, and 15.8% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively. In addition, the proposed system with DUC 2002 dataset achieved an F-measure of 47.1%, 23.7%, 47.1%, 20.4% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively. |
topic |
Natural language processing extractive text summarization multi-objective optimization maximum coverage and relevancy less redundancy |
url |
https://ieeexplore.ieee.org/document/9303358/ |
work_keys_str_mv |
AT ranaalqaisi extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering AT waselghanem extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering AT azizqaroush extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering |
_version_ |
1724181979161690112 |