Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms

Electrocardiogram (ECG) signal is critical to the classification of cardiac arrhythmia using some machine learning methods. In practice, the ECG datasets are usually with multiple missing values due to faults or distortion. Unfortunately, many established algorithms for classification require a full...

Full description

Bibliographic Details
Main Authors: Fei Yang, Jiazhi Du, Jiying Lang, Weigang Lu, Lei Liu, Changlong Jin, Qinma Kang
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:BioMed Research International
Online Access:http://dx.doi.org/10.1155/2020/7141725
id doaj-02a5d65f8a644365b18d71f18d1c98ec
record_format Article
spelling doaj-02a5d65f8a644365b18d71f18d1c98ec2020-11-25T03:25:10ZengHindawi LimitedBioMed Research International2314-61332314-61412020-01-01202010.1155/2020/71417257141725Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN AlgorithmsFei Yang0Jiazhi Du1Jiying Lang2Weigang Lu3Lei Liu4Changlong Jin5Qinma Kang6School of Computer Science and Technology, Shandong University, Qingdao, ChinaSchool of Computer Science and Technology, Harbin Institute of Technology, Harbin, ChinaSchool of Mechanical, Electrical and Information Engineering, Shandong University, Weihai, ChinaDepartment of Educational Technology, Ocean University of China, Qingdao, ChinaBeijing Institute of New Technology Applications, Beijing Academy of Science and Technology, Beijing, ChinaSchool of Mechanical, Electrical and Information Engineering, Shandong University, Weihai, ChinaSchool of Mechanical, Electrical and Information Engineering, Shandong University, Weihai, ChinaElectrocardiogram (ECG) signal is critical to the classification of cardiac arrhythmia using some machine learning methods. In practice, the ECG datasets are usually with multiple missing values due to faults or distortion. Unfortunately, many established algorithms for classification require a fully complete matrix as input. Thus it is necessary to impute the missing data to increase the effectiveness of classification for datasets with a few missing values. In this paper, we compare the main methods for estimating the missing values in electrocardiogram data, e.g., the “Zero method”, “Mean method”, “PCA-based method”, and “RPCA-based method” and then propose a novel KNN-based classification algorithm, i.e., a modified kernel Difference-Weighted KNN classifier (MKDF-WKNN), which is fit for the classification of imbalance datasets. The experimental results on the UCI database indicate that the “RPCA-based method” can successfully handle missing values in arrhythmia dataset no matter how many values in it are missing and our proposed classification algorithm, MKDF-WKNN, is superior to other state-of-the-art algorithms like KNN, DS-WKNN, DF-WKNN, and KDF-WKNN for uneven datasets which impacts the accuracy of classification.http://dx.doi.org/10.1155/2020/7141725
collection DOAJ
language English
format Article
sources DOAJ
author Fei Yang
Jiazhi Du
Jiying Lang
Weigang Lu
Lei Liu
Changlong Jin
Qinma Kang
spellingShingle Fei Yang
Jiazhi Du
Jiying Lang
Weigang Lu
Lei Liu
Changlong Jin
Qinma Kang
Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms
BioMed Research International
author_facet Fei Yang
Jiazhi Du
Jiying Lang
Weigang Lu
Lei Liu
Changlong Jin
Qinma Kang
author_sort Fei Yang
title Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms
title_short Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms
title_full Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms
title_fullStr Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms
title_full_unstemmed Missing Value Estimation Methods Research for Arrhythmia Classification Using the Modified Kernel Difference-Weighted KNN Algorithms
title_sort missing value estimation methods research for arrhythmia classification using the modified kernel difference-weighted knn algorithms
publisher Hindawi Limited
series BioMed Research International
issn 2314-6133
2314-6141
publishDate 2020-01-01
description Electrocardiogram (ECG) signal is critical to the classification of cardiac arrhythmia using some machine learning methods. In practice, the ECG datasets are usually with multiple missing values due to faults or distortion. Unfortunately, many established algorithms for classification require a fully complete matrix as input. Thus it is necessary to impute the missing data to increase the effectiveness of classification for datasets with a few missing values. In this paper, we compare the main methods for estimating the missing values in electrocardiogram data, e.g., the “Zero method”, “Mean method”, “PCA-based method”, and “RPCA-based method” and then propose a novel KNN-based classification algorithm, i.e., a modified kernel Difference-Weighted KNN classifier (MKDF-WKNN), which is fit for the classification of imbalance datasets. The experimental results on the UCI database indicate that the “RPCA-based method” can successfully handle missing values in arrhythmia dataset no matter how many values in it are missing and our proposed classification algorithm, MKDF-WKNN, is superior to other state-of-the-art algorithms like KNN, DS-WKNN, DF-WKNN, and KDF-WKNN for uneven datasets which impacts the accuracy of classification.
url http://dx.doi.org/10.1155/2020/7141725
work_keys_str_mv AT feiyang missingvalueestimationmethodsresearchforarrhythmiaclassificationusingthemodifiedkerneldifferenceweightedknnalgorithms
AT jiazhidu missingvalueestimationmethodsresearchforarrhythmiaclassificationusingthemodifiedkerneldifferenceweightedknnalgorithms
AT jiyinglang missingvalueestimationmethodsresearchforarrhythmiaclassificationusingthemodifiedkerneldifferenceweightedknnalgorithms
AT weiganglu missingvalueestimationmethodsresearchforarrhythmiaclassificationusingthemodifiedkerneldifferenceweightedknnalgorithms
AT leiliu missingvalueestimationmethodsresearchforarrhythmiaclassificationusingthemodifiedkerneldifferenceweightedknnalgorithms
AT changlongjin missingvalueestimationmethodsresearchforarrhythmiaclassificationusingthemodifiedkerneldifferenceweightedknnalgorithms
AT qinmakang missingvalueestimationmethodsresearchforarrhythmiaclassificationusingthemodifiedkerneldifferenceweightedknnalgorithms
_version_ 1715221385763094528