Analysis of K-Nearest Neighbor (KNN), Naive Bayes ands Decision Tree C4.5 Algorithm With Classification Method In Breast Cancer Using RapidMiner

Authors

Abstract

Breast cancer is cancer that forms in the cells of the breast. It is the most common cancer in women and the leading cause of cancer deaths in women worldwide. Breast cancer is usually divided into two types: benign, or usually called benign and malignant, or usually called malignant. Benign cancers are usually characterized by small, round, tender lumps. In the fields of medicine, finance, marketing, and social science, data mining is a popular tool for performing proven analysis. This study will compare K-Nearest Neighbor (KNN), Naive Bayes, and Decision Tree C4.5 approaches for classifying breast cancer. The problem of this research is which algorithm has a high level of accuracy that can be used with breast cancer datasets and can provide information about patterns or models for early detection of breast cancer. The results of the research conducted using CRISP-DM show that K-Nearest Neighbor (KNN) has the highest accuracy value with 97.14% and its AUC value is 0.976. The AUC value also showed excellent classification, with an AUC value between 0.90 and 1.00.

Author Biography

Muhammad Iqbal, University Pelita Bangsa

Department of Informatic Engineering

Downloads

Published

2024-08-14