Classification of Software Engineering Documents Based on Artificial Immune Systems

Care about automated documents classification has increased since the appearance of the digital documents and the wide diffusion of Internet. In the 1990's, the computer performance has greatly improved and has led to the methods of machine learning to establish automated classifiers. These met...

Full description

Bibliographic Details
Main Authors: Nada Saleem, Rasha Saeed
Format: Article
Language:Arabic
Published: Mosul University 2013-09-01
Series:Al-Rafidain Journal of Computer Sciences and Mathematics
Subjects:
Online Access:https://csmj.mosuljournals.com/article_163538_c8073393aec4ca660427a3715374f9bc.pdf
id doaj-31f7e8e6c72a478f9937d9bb9b3b5c7f
record_format Article
spelling doaj-31f7e8e6c72a478f9937d9bb9b3b5c7f2020-11-25T04:07:52ZaraMosul UniversityAl-Rafidain Journal of Computer Sciences and Mathematics 1815-48162311-79902013-09-011039111210.33899/csmj.2013.163538163538Classification of Software Engineering Documents Based on Artificial Immune SystemsNada Saleem0Rasha Saeed1College of Computer Sciences and Mathematics University of Mosul, Mosul, IraqCollege of Computer Sciences and Mathematics University of Mosul, Mosul, IraqCare about automated documents classification has increased since the appearance of the digital documents and the wide diffusion of Internet. In the 1990's, the computer performance has greatly improved and has led to the methods of machine learning to establish automated classifiers. These methods have achieved good speed and classification's accuracy and researchers still investigate in this field to accomplish more accuracy and less time. Artificial immunologic systems have shown high performance in such as data clustering and anomaly detection which can be ascribed to the nature of the immunologic system in protecting the body. Some of the present methods and ways used in the training process of the document classification are time consuming and others have less accuracy rate concerned the classification of the related document as software engineering document classes For these reasons, this research deals with the study of Natural Immune System and using the dynamic process of the Innate Immune System work based on danger theory and Dendritic Cell (DC) technique  in building Dendritic Cell Model (DCM) to classify Software Engineering documents as they comprise information related to developing the software systems, that makes it easy for the software engineer who works in maintenance. DCM has high classification's speed and accuracy besides easy and flexible use by designing interfaces that makes it easy for the user to deal with the system. In order to improve the quality and the efficiency of DCM method, it was compared to one of the best and well-known methods of classification referred to as, Naive Bayes(NB). After conducting several experiments on a various group of software engineering documents, evaluations results have shown that the accuracy of the innate immunologic method (DCM) has reached (DCM) (95%), whereas Naïve classification method has reached (90 %) with training and classification speed that doesn’t exceed one minutes. This shows the feasibility of using the algorithms of AIS systems in the field of information recovery and documents classification. This system was built and programmed in Java language and was implemented under an operating system environment Microsoft Windows7.https://csmj.mosuljournals.com/article_163538_c8073393aec4ca660427a3715374f9bc.pdfartificial immune systemdanger theorydendritic cellsoftware engineering documentsdocument classification
collection DOAJ
language Arabic
format Article
sources DOAJ
author Nada Saleem
Rasha Saeed
spellingShingle Nada Saleem
Rasha Saeed
Classification of Software Engineering Documents Based on Artificial Immune Systems
Al-Rafidain Journal of Computer Sciences and Mathematics
artificial immune system
danger theory
dendritic cell
software engineering documents
document classification
author_facet Nada Saleem
Rasha Saeed
author_sort Nada Saleem
title Classification of Software Engineering Documents Based on Artificial Immune Systems
title_short Classification of Software Engineering Documents Based on Artificial Immune Systems
title_full Classification of Software Engineering Documents Based on Artificial Immune Systems
title_fullStr Classification of Software Engineering Documents Based on Artificial Immune Systems
title_full_unstemmed Classification of Software Engineering Documents Based on Artificial Immune Systems
title_sort classification of software engineering documents based on artificial immune systems
publisher Mosul University
series Al-Rafidain Journal of Computer Sciences and Mathematics
issn 1815-4816
2311-7990
publishDate 2013-09-01
description Care about automated documents classification has increased since the appearance of the digital documents and the wide diffusion of Internet. In the 1990's, the computer performance has greatly improved and has led to the methods of machine learning to establish automated classifiers. These methods have achieved good speed and classification's accuracy and researchers still investigate in this field to accomplish more accuracy and less time. Artificial immunologic systems have shown high performance in such as data clustering and anomaly detection which can be ascribed to the nature of the immunologic system in protecting the body. Some of the present methods and ways used in the training process of the document classification are time consuming and others have less accuracy rate concerned the classification of the related document as software engineering document classes For these reasons, this research deals with the study of Natural Immune System and using the dynamic process of the Innate Immune System work based on danger theory and Dendritic Cell (DC) technique  in building Dendritic Cell Model (DCM) to classify Software Engineering documents as they comprise information related to developing the software systems, that makes it easy for the software engineer who works in maintenance. DCM has high classification's speed and accuracy besides easy and flexible use by designing interfaces that makes it easy for the user to deal with the system. In order to improve the quality and the efficiency of DCM method, it was compared to one of the best and well-known methods of classification referred to as, Naive Bayes(NB). After conducting several experiments on a various group of software engineering documents, evaluations results have shown that the accuracy of the innate immunologic method (DCM) has reached (DCM) (95%), whereas Naïve classification method has reached (90 %) with training and classification speed that doesn’t exceed one minutes. This shows the feasibility of using the algorithms of AIS systems in the field of information recovery and documents classification. This system was built and programmed in Java language and was implemented under an operating system environment Microsoft Windows7.
topic artificial immune system
danger theory
dendritic cell
software engineering documents
document classification
url https://csmj.mosuljournals.com/article_163538_c8073393aec4ca660427a3715374f9bc.pdf
work_keys_str_mv AT nadasaleem classificationofsoftwareengineeringdocumentsbasedonartificialimmunesystems
AT rashasaeed classificationofsoftwareengineeringdocumentsbasedonartificialimmunesystems
_version_ 1724427577153552384