Advancements in Phishing Website Detection: A Comprehensive Analysis of Machine Learning and Deep Learning Models

Mousavi S., Bahaghighat M., ÖZEN F.

32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024, Mersin, Türkiye, 15 - 18 Mayıs 2024, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/siu61531.2024.10601036
Basıldığı Şehir: Mersin
Basıldığı Ülke: Türkiye
Anahtar Kelimeler: CNN, Deep Learning, LSTM, Machine Learning, MLP, Phishing website detection
Yıldız Teknik Üniversitesi Adresli: Evet

Özet

Today, machine learning (ML) and deep learning (DL) have become potent tools for detecting phishing websites. This paper has employed a bunch of ML algorithms such as Random Forest, Extra Tree, XGBoost, LGBM, and XGBoost-bagging, along with some proposed neural network architectures found on Multilayer Perceptron (MLP), Convolutional Neural Network (CNN), and Long Short-Term Memory (LSTM) for identifying website phishing. Implementing a comprehensive approach, we devised our model based on 56 proposed features extracted from each target website using Feature Importance. Then, we conducted five strategic steps to enhance the performance of ML and DL models. Through these steps, we improved DL model accuracy remarkably, notably elevating LSTM accuracy from the lowest accuracy of 96.18% to 98.78% with an AUC-ROC LSTM of about 99.85%.