Advancements in Phishing Website Detection: A Comprehensive Analysis of Machine Learning and Deep Learning Models


Mousavi S., Bahaghighat M., ÖZEN F.

32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024, Mersin, Türkiye, 15 - 18 Mayıs 2024, (Tam Metin Bildiri) identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/siu61531.2024.10601036
  • Basıldığı Şehir: Mersin
  • Basıldığı Ülke: Türkiye
  • Anahtar Kelimeler: CNN, Deep Learning, LSTM, Machine Learning, MLP, Phishing website detection
  • Yıldız Teknik Üniversitesi Adresli: Evet

Özet

Today, machine learning (ML) and deep learning (DL) have become potent tools for detecting phishing websites. This paper has employed a bunch of ML algorithms such as Random Forest, Extra Tree, XGBoost, LGBM, and XGBoost-bagging, along with some proposed neural network architectures found on Multilayer Perceptron (MLP), Convolutional Neural Network (CNN), and Long Short-Term Memory (LSTM) for identifying website phishing. Implementing a comprehensive approach, we devised our model based on 56 proposed features extracted from each target website using Feature Importance. Then, we conducted five strategic steps to enhance the performance of ML and DL models. Through these steps, we improved DL model accuracy remarkably, notably elevating LSTM accuracy from the lowest accuracy of 96.18% to 98.78% with an AUC-ROC LSTM of about 99.85%.