Int J Performability Eng ›› 2021, Vol. 17 ›› Issue (3): 263-275.doi: 10.23940/ijpe.21.03.p2.263275
• Original article • Previous Articles Next Articles
Tianhao Wanga,b, Peng Chenc, Tianjiazhi Baoc, Jiaheng Lid, and Xiaosheng Yuc,*()
Contact:
Yu Xiaosheng
E-mail:yuxiaosheng@ctgu.edu.cn
Supported by:
Tianhao Wang, Peng Chen, Tianjiazhi Bao, Jiaheng Li, and Xiaosheng Yu. Arrhythmia Classification Algorithm based on SMOTE and Feature Selection [J]. Int J Performability Eng, 2021, 17(3): 263-275.
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
Table 2.
Arrhythmia datasets"
Class No. | Arrhythmia types | Sample size |
---|---|---|
01 | Normal | 245 |
02 | Ischemic changes | 44 |
03 | Old Anterior Myocardial Infarction | 15 |
04 | Old Inferior Myocardial Infarction | 15 |
05 | Sinus tachycardy | 13 |
06 | Sinus bradycardy | 25 |
07 | Ventricular Premature Contraction | 3 |
08 | Supraventricular Premature Contraction | 2 |
09 | Left bundle branch block | 9 |
10 | Right bundle branch block | 50 |
11 | 1. degree AtrioVentricular block | 0 |
12 | 2. degree AV block | 0 |
13 | 3. degree AV block | 0 |
14 | Left ventricule hypertrophy | 4 |
15 | Atrial Fibrillation or Flutter | 5 |
16 | Others | 22 |
Table 6.
Accuracy comparison between the proposed algorithm and other algorithms under study"
Algorithm | Train-test split | Number of selected Features | Accuracy (%) |
---|---|---|---|
Proposed | 10-fold CV | 89 | 98.68 |
MLP NN | 3-fold CV | 88.24 | |
Filter+Wrapper+Knn | 20-fold CV | 60 | 73.80 |
Filter+Wrapper+SVM | 20-fold CV | 148 | 68.80 |
RF+SVM | 90-10 | 94 | 92.07 |
GA+C4.5 | 75-25 | — | 78.76 |
[1]. | YangB.F.and BenzhiC. Advances in the study of arrhythmogenic mechanisms. Journal of International Pharmaceutical Research, 37(2), pp.81-8, 2010. |
[2]. | DengL. and FuR. Heartbeat-based end-to-end classification of arrhythmias. Journal of Southern Medical University, 39(9), pp. 1071-1077, 2019. |
[3]. | GuvenirH.A., AcarB., DemirozG. and CekinA. A supervised machine learning algorithm for arrhythmia analysis. In Computers in Cardiology 1997, pp.433-436, 1997. |
[4]. | RautR.D.and DudulS.V. Arrhythmias classification with MLP neural network and statistical analysis. In 2008 First International Conference on Emerging Trends in Engineering and Technology, pp.553-558, 2008. |
[5]. | ZhaoY., HongW. and SunS. Cardiac arrhythmia classification based on multi-features and support vector machines. Journal of Biomedical Engineering, 28(2), pp.292-295, 2011. |
[6]. | MitraM. and SamantaR.K. Cardiac arrhythmia classification using neural networks with selected features. Procedia Technology, 10, pp.76-84, 2013. |
[7]. | NiaziK.A.K., KhanS.A., ShaukatA. and AkhtarM. Identifying best feature subset for cardiac arrhythmia classification. In 2015 Science and Information Conference ( SAI), pp.494-499, 2015. |
[8]. | MustaqeemA., AnwarS.M., MajidM. and KhanA.R. Wrapper method for feature selection to classify cardiac arrhythmia. In 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp.3656-3659, 2017. |
[9]. | MustaqeemA., AnwarS.M.and MajidM. Multiclass classification of cardiac arrhythmia using improved feature selection and SVM invariants. Computational and mathematical methods in medicine, 2018. |
[10]. | YangM., XiaoJ., and CaiH. Multicollinearity and its treatment in multivariate analysis. Chinese Journal of Health Statistics, 29(4), pp.620-624, 2012. |
[11]. | ZhaoN., ZhangX.F.and ZHANGL.J. Overview of imbalanced data classification. Computer Science, 45(S1), pp.22-27, 2018. |
[12]. | ChawlaN.V., BowyerK.W., HallL.O.and KegelmeyerW.P. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, pp.321-357, 2002. |
[13]. | LiY.X., ChaiY., HuY.Q.and YINH.P. Review of imbalanced data classification methods. Control and Decision, 34(4), pp.673-688, 2019. |
[14]. | ChawlaN.V., JapkowiczN. and KotczA. Special issue on learning from imbalanced data sets. ACM SIGKDD explorations newsletter, 6(1), pp.1-6, 2004. |
[15]. | ShiH.H., ChenY.W.and ChenX. Summary of research on SMOTE oversampling and its improved algorithms. CAAI transactions on intelligent systems, 14, pp. 1073-1083, 2019. |
[16]. | TibshiraniR. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58(1), pp.267-288, 1996. |
[17]. | LiZ., DuJ., NieB., XiongW., HuangC. and LiH. Summary of feature selection methods. Computer Engineering and Applications, 55(24), pp.10-19, 2019. |
[18]. | MallowsC.L. Some comments on Cp. Technometrics, 42(1), pp.87-94, 2000. |
[19]. | YaoY. and CaiS. Study on wine evaluation based on LASSO regression. Journal of Yunnan Agricultural University, 31(2), pp.294-302, 2016. |
[20]. | BreimanL. Random forests. Machine learning, 45(1), pp.5-32, 2001. |
[21]. | GregoruttiB., MichelB. and Saint-PierreP. Correlation and variable importance in random forests. Statistics and Computing, 27(3), pp.659-678, 2017. |
[22]. | YaoD., YangJ., and ZhanX. Feature selection algorithm based on random forest. Journal of Jilin University (Engineering and Technology Edition), 44(1), pp.137-141, 2014. |
[23]. | ShiW., HuX. and YuK. K-part Lasso based on feature selection algorithm for high-dimensional data. Computer Engineering and Applications, 48(1), pp.157-161, 2012. |
[24]. | SvetnikV., LiawA., TongC. and WangT. Application of Breiman’s random forest to modeling structure-activity relationships of pharmaceutical molecules. InInternational Workshop on Multiple Classifier Systems, pp.334-343, 2004. |
[25]. | GenuerR., PoggiJ.M.and Tuleau-MalotC. VSURF: an R package for variable selection using random forests. The R Journal, 7(2), pp.19-33, 2015. |
[26]. | GuyonI., WestonJ., BarnhillS. and VapnikV. Gene selection for cancer classification using support vector machines. Machine learning, 46(1), pp.389-422, 2002. |
[27]. | WangX. and HuX. Overview on feature selection in high-dimensional and small-sample-size classification. Journal of Computer Applications, 37(9), pp.2433-2438, 2017. |
[28]. | LiawA. and WienerM. Classification and regression by randomForest. R news, 2(3), pp.18-22, 2002. |
[29]. | KraskovA., StögbauerH. and GrassbergerP. Estimating mutual information. Physical review E, 69(6), p.066138, 2004. |
[30]. | ChaudhuriA., DeK. and ChatterjeeD. October. A comparative study of kernels for the multi-class support vector machine. In 2008 Fourth International Conference on Natural Computation, 2, pp.3-7, 2008,. |
[31]. | PolatK. and GüneşS. A novel hybrid intelligent method based on C4. 5 decision tree classifier and one-against-all approach for multi-class classification problems. Expert Systems with Applications, 36(2), pp. 1587-1592, 2009. |
[32]. | AyarM. and SabamoniriS. An ECG-based feature selection and heartbeat classification model using a hybrid heuristic algorithm. Informatics in Medicine Unlocked, 13, pp.167-175, 2018. |
[1] | C. Rohith Bhat and Madhusundar Nelson. Artificial Intelligence Based Credit Card Fraud Detection for Online Transactions Optimized with Sparrow Search Algorithm [J]. Int J Performability Eng, 2023, 19(9): 624-632. |
[2] | Pranshu Kumar Soni and Leema Nelson. PCP: Profit-Driven Churn Prediction using Machine Learning Techniques in Banking Sector [J]. Int J Performability Eng, 2023, 19(5): 303-311. |
[3] | Harshita Batra and Leema Nelson. DCADS: Data-Driven Computer Aided Diagnostic System using Machine Learning Techniques for Polycystic Ovary Syndrome [J]. Int J Performability Eng, 2023, 19(3): 193-202. |
[4] | Priyanshu Verma, Ishan Sharma, Sonia Deshmukh, and Rohit Vashisht. Customer Churn Analysis using Spark and Hadoop [J]. Int J Performability Eng, 2023, 19(10): 663-675. |
[5] | Shobhanam Krishna and Sumati Sidharth. HR Analytics: Employee Attrition Analysis using Random Forest [J]. Int J Performability Eng, 2022, 18(4): 275-281. |
[6] | Lakshmi Kala Pampana, and Manjula Sri Rayudu. Multi-Class Classification of Retinal Abnormality using Machine Learning Algorithms [J]. Int J Performability Eng, 2022, 18(11): 826-832. |
[7] | C Chandana and G Parthasarathy. Efficient Machine Learning Regression Algorithm using Naïve Bayes Classifier for Crop Yield Prediction and Optimal Utilization of Fertilizer [J]. Int J Performability Eng, 2022, 18(1): 47-55. |
[8] | Ngan Tran, Haihua Chen, Janet Jiang, Jay Bhuyan, Junhua Ding. Effect of Class Imbalance on the Performance of Machine Learning-based Network Intrusion Detection [J]. Int J Performability Eng, 2021, 17(9): 741-755. |
[9] | Kavitha Ravindran, Srinivasan Rajkumar, and Kavitha Muthuvel. An Investigation on Cervical Cancer with Image Processing and Hybrid Classification [J]. Int J Performability Eng, 2021, 17(11): 918-925. |
[10] | Jiali Yang, Jiarui Chen, and Sheng Li. Decomposition and Identification of Non-Intrusive Load Equipment Group [J]. Int J Performability Eng, 2021, 17(1): 74-84. |
[11] | Fang Li, Yubin Qu, Junxia Ji, Dejun Zhang, and Long Li. Active Learning Empirical Research on Cross-Version Software Defect Prediction Datasets [J]. Int J Performability Eng, 2020, 16(4): 609-617. |
[12] | Yubin Qu, Fang Li, and Xiang Chen. LAL: Meta-Active Learning-based Software Defect Prediction [J]. Int J Performability Eng, 2020, 16(2): 203-213. |
[13] | Wenjie Li. Imbalanced Data Optimization Combining K-Means and SMOTE [J]. Int J Performability Eng, 2019, 15(8): 2173-2181. |
[14] | Xiao Chen, Li Han, Meng Leng, and Xiao Pan. Similarity based on the Importance of Common Features in Random Forest [J]. Int J Performability Eng, 2019, 15(4): 1171-1180. |
[15] | Wei Jiang, Jiayi Wu, and Chao Yao. Can Machine Automatically Discover Text Image from Overall Perspective [J]. Int J Performability Eng, 2019, 15(1): 281-287. |
|