Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering

The increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online tex...

Full description

Bibliographic Details
Main Authors: Rana Alqaisi, Wasel Ghanem, Aziz Qaroush
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9303358/
id doaj-68f2698fd810493fa3b87a3993d9f18a
record_format Article
spelling doaj-68f2698fd810493fa3b87a3993d9f18a2021-03-30T04:20:23ZengIEEEIEEE Access2169-35362020-01-01822820622822410.1109/ACCESS.2020.30464949303358Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid ClusteringRana Alqaisi0https://orcid.org/0000-0003-4298-803XWasel Ghanem1https://orcid.org/0000-0003-2409-4729Aziz Qaroush2https://orcid.org/0000-0002-0274-4121Department of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineDepartment of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineDepartment of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineThe increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online textual data. Thus, there is a persistent need for an automatic text summarization system, which extract the relevant and salient information from a collection of documents, that sharing the same or related topics. Then, presenting this extracted information in a condensed form to preserve the main topics. This paper proposes an automatic, generic, and extractive Arabic multi-document summarization system. The proposed system employs the clustering-based and evolutionary multi-objective optimization methods. The clustering-based method discovers the main topics in the text, while the evolutionary multi-objective optimization method optimizes three objectives based on coverage, diversity/redundancy, and relevancy. The performance of the proposed system is evaluated using TAC 2011 and DUC 2002 datasets. The experimental results are compared using ROUGE evaluation measure. The obtained results showed the effectiveness of the proposed system compared to other peer systems. The proposed system outperformed other peer systems for all ROUGE metrics using TAC 2011. We achieved an F-measure of 38.9%, 17.7%, 35.4%, and 15.8% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively. In addition, the proposed system with DUC 2002 dataset achieved an F-measure of 47.1%, 23.7%, 47.1%, 20.4% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively.https://ieeexplore.ieee.org/document/9303358/Natural language processingextractive text summarizationmulti-objective optimizationmaximum coverage and relevancyless redundancy
collection DOAJ
language English
format Article
sources DOAJ
author Rana Alqaisi
Wasel Ghanem
Aziz Qaroush
spellingShingle Rana Alqaisi
Wasel Ghanem
Aziz Qaroush
Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
IEEE Access
Natural language processing
extractive text summarization
multi-objective optimization
maximum coverage and relevancy
less redundancy
author_facet Rana Alqaisi
Wasel Ghanem
Aziz Qaroush
author_sort Rana Alqaisi
title Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_short Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_full Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_fullStr Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_full_unstemmed Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_sort extractive multi-document arabic text summarization using evolutionary multi-objective optimization with k-medoid clustering
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description The increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online textual data. Thus, there is a persistent need for an automatic text summarization system, which extract the relevant and salient information from a collection of documents, that sharing the same or related topics. Then, presenting this extracted information in a condensed form to preserve the main topics. This paper proposes an automatic, generic, and extractive Arabic multi-document summarization system. The proposed system employs the clustering-based and evolutionary multi-objective optimization methods. The clustering-based method discovers the main topics in the text, while the evolutionary multi-objective optimization method optimizes three objectives based on coverage, diversity/redundancy, and relevancy. The performance of the proposed system is evaluated using TAC 2011 and DUC 2002 datasets. The experimental results are compared using ROUGE evaluation measure. The obtained results showed the effectiveness of the proposed system compared to other peer systems. The proposed system outperformed other peer systems for all ROUGE metrics using TAC 2011. We achieved an F-measure of 38.9%, 17.7%, 35.4%, and 15.8% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively. In addition, the proposed system with DUC 2002 dataset achieved an F-measure of 47.1%, 23.7%, 47.1%, 20.4% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively.
topic Natural language processing
extractive text summarization
multi-objective optimization
maximum coverage and relevancy
less redundancy
url https://ieeexplore.ieee.org/document/9303358/
work_keys_str_mv AT ranaalqaisi extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering
AT waselghanem extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering
AT azizqaroush extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering
_version_ 1724181979161690112