Advancements in Phishing Website Detection: A Comprehensive Analysis of Machine Learning and Deep Learning Models


Mousavi S., Bahaghighat M., ÖZEN F.

32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024, Mersin, Turkey, 15 - 18 May 2024, (Full Text) identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/siu61531.2024.10601036
  • City: Mersin
  • Country: Turkey
  • Keywords: CNN, Deep Learning, LSTM, Machine Learning, MLP, Phishing website detection
  • Yıldız Technical University Affiliated: Yes

Abstract

Today, machine learning (ML) and deep learning (DL) have become potent tools for detecting phishing websites. This paper has employed a bunch of ML algorithms such as Random Forest, Extra Tree, XGBoost, LGBM, and XGBoost-bagging, along with some proposed neural network architectures found on Multilayer Perceptron (MLP), Convolutional Neural Network (CNN), and Long Short-Term Memory (LSTM) for identifying website phishing. Implementing a comprehensive approach, we devised our model based on 56 proposed features extracted from each target website using Feature Importance. Then, we conducted five strategic steps to enhance the performance of ML and DL models. Through these steps, we improved DL model accuracy remarkably, notably elevating LSTM accuracy from the lowest accuracy of 96.18% to 98.78% with an AUC-ROC LSTM of about 99.85%.