An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming
As the field of data science grows, document analytics has become a more challenging task for rough classification, response analysis, and text summarization. These tasks are used for the analysis of text data from various intelligent sensing systems. The conventional approach for data analytics and...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-11-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/8/11/1331 |
id |
doaj-0d5c99ca1dee4cbc9c452cf989640542 |
---|---|
record_format |
Article |
spelling |
doaj-0d5c99ca1dee4cbc9c452cf9896405422020-11-25T01:50:57ZengMDPI AGElectronics2079-92922019-11-01811133110.3390/electronics8111331electronics8111331An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data StreamingCelestine Iwendi0Suresh Ponnan1Revathi Munirathinam2Kathiravan Srinivasan3Chuan-Yu Chang4Department of Electronics, Bangor College of Central South University of Forestry and Technology, Changsha, 410004 ChinaDepartment of ECE, Vel Tech Rangarajan Dr. Sagunthala R & D Institute of Science and Technology, Avadi 600062, IndiaEngineering Division, Scientific Society, Tiruvannamalai 606805, IndiaSchool of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632 014, IndiaDepartment of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin 64002, TaiwanAs the field of data science grows, document analytics has become a more challenging task for rough classification, response analysis, and text summarization. These tasks are used for the analysis of text data from various intelligent sensing systems. The conventional approach for data analytics and text processing is not useful for big data coming from intelligent systems. This work proposes a novel TF/IDF algorithm with the temporal Louvain approach to solve the above problem. Such an approach is supposed to help the categorization of documents into hierarchical structures showing the relationship between variables, which is a boon to analysts making essential decisions. This paper used public corpora, such as Reuters-21578 and 20 Newsgroups for massive-data analytic experimentation. The result shows the efficacy of the proposed algorithm in terms of accuracy and execution time across six datasets. The proposed approach is validated to bring value to big text data analysis. Big data handling with map-reduce has led to tremendous growth and support for tasks like categorization, sentiment analysis, and higher-quality accuracy from the input data. Outperforming the state-of-the-art approach in terms of accuracy and execution time for six datasets ensures proper validation.https://www.mdpi.com/2079-9292/8/11/1331big data analyticsdocument gatheringefficiencyhierarchical structural categoriesdata fusionintelligent systems |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Celestine Iwendi Suresh Ponnan Revathi Munirathinam Kathiravan Srinivasan Chuan-Yu Chang |
spellingShingle |
Celestine Iwendi Suresh Ponnan Revathi Munirathinam Kathiravan Srinivasan Chuan-Yu Chang An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming Electronics big data analytics document gathering efficiency hierarchical structural categories data fusion intelligent systems |
author_facet |
Celestine Iwendi Suresh Ponnan Revathi Munirathinam Kathiravan Srinivasan Chuan-Yu Chang |
author_sort |
Celestine Iwendi |
title |
An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming |
title_short |
An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming |
title_full |
An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming |
title_fullStr |
An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming |
title_full_unstemmed |
An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming |
title_sort |
efficient and unique tf/idf algorithmic model-based data analysis for handling applications with big data streaming |
publisher |
MDPI AG |
series |
Electronics |
issn |
2079-9292 |
publishDate |
2019-11-01 |
description |
As the field of data science grows, document analytics has become a more challenging task for rough classification, response analysis, and text summarization. These tasks are used for the analysis of text data from various intelligent sensing systems. The conventional approach for data analytics and text processing is not useful for big data coming from intelligent systems. This work proposes a novel TF/IDF algorithm with the temporal Louvain approach to solve the above problem. Such an approach is supposed to help the categorization of documents into hierarchical structures showing the relationship between variables, which is a boon to analysts making essential decisions. This paper used public corpora, such as Reuters-21578 and 20 Newsgroups for massive-data analytic experimentation. The result shows the efficacy of the proposed algorithm in terms of accuracy and execution time across six datasets. The proposed approach is validated to bring value to big text data analysis. Big data handling with map-reduce has led to tremendous growth and support for tasks like categorization, sentiment analysis, and higher-quality accuracy from the input data. Outperforming the state-of-the-art approach in terms of accuracy and execution time for six datasets ensures proper validation. |
topic |
big data analytics document gathering efficiency hierarchical structural categories data fusion intelligent systems |
url |
https://www.mdpi.com/2079-9292/8/11/1331 |
work_keys_str_mv |
AT celestineiwendi anefficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT sureshponnan anefficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT revathimunirathinam anefficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT kathiravansrinivasan anefficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT chuanyuchang anefficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT celestineiwendi efficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT sureshponnan efficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT revathimunirathinam efficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT kathiravansrinivasan efficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming AT chuanyuchang efficientanduniquetfidfalgorithmicmodelbaseddataanalysisforhandlingapplicationswithbigdatastreaming |
_version_ |
1724999305074311168 |