ارسل ملاحظاتك

ارسل ملاحظاتك لنا







Predicting Breast Cancer Survivability: A Comparison of Three Data Mining Methods

المصدر: مجلة جامعة جيهان أربيل للعلوم الإنسانية والاجتماعية
الناشر: جامعة جيهان أربيل
المؤلف الرئيسي: Hussain, Omead I. (Author)
المجلد/العدد: مج4, ع1
محكمة: نعم
الدولة: العراق
التاريخ الميلادي: 2020
الصفحات: 17 - 30
ISSN: 2709-8648
رقم MD: 1431120
نوع المحتوى: بحوث ومقالات
اللغة: الإنجليزية
قواعد المعلومات: EduSearch, HumanIndex
مواضيع:
كلمات المؤلف المفتاحية:
Predicting Breast Cancer | Data Mining | SEER Database | Artificial Neural Network
رابط المحتوى:
صورة الغلاف QR قانون
حفظ في:
المستخلص: This study concentrates on predicting breast cancer survivability using data mining, and comparing between three main predictive modeling tools. Precisely, we used three popular data mining methods: Two from machine learning (artificial neural network [ANN] and decision trees) and one from statistics (logistic regression) and aimed to choose the best model through the efficiency of each model and with the most effective variables to these models and the most common important predictor. We defined the three main modeling aims and used by demonstrating the purpose of the modeling. By using data mining, we can begin to characterize and describe trends and patterns that reside in data and information. The preprocessed dataset contents were of 87 variables and the total of the records are 457,389; which became 93 variables and 90308 records for each variable, and these datasets were from the SEER database. We have achieved more than three data mining techniques and we have investigated all the data mining techniques and finally, we find the best thing to do is to focus about these data mining techniques which are ANN, Decision Trees, and Logistic Regression using SAS Enterprise Miner 5.2 which is in our view of point are the suitable system to use according to the facilities and the results are given to us. Several experiments have been conducted using these algorithms. The achieved prediction implementations are comparison-based techniques. However, we have found out that the neural network has a much better performance than the other two techniques. Finally, we can say that the model we chose has the highest accuracy which specialists in the breast cancer field can use and depend on.

ISSN: 2709-8648

عناصر مشابهة