Skip to main content

Research Repository

Advanced Search

Phishing website detection from URLs using classical machine learning ANN model

Salloum, S; Gaber, T; Vadera, S; Shaalan, K

Authors

S Salloum

T Gaber

K Shaalan



Contributors

J Garcia-Alfaro
Editor

S Li
Editor

R Poovendran
Editor

H Debar
Editor

M Yung
Editor

Abstract

Phishing is a serious form of online fraud made up of spoofed websites that attempt to gain users’ sensitive information by tricking them into believing that they are visiting a legitimate site. Phishing attacks can be detected many ways, including a user's awareness of fraud protection, blacklisting websites, analyzing the suspected characteristics, or comparing them to recent attempts that followed similar patterns. The purpose of this paper is to create classification models using features extracted from websites to study and classify phishing websites. In order to train the system, we use two datasets consisting of 58,645 and 88,647 URLs labeled as “Phishing” or “Legitimate”. A diverse range of machine learning models such as “XGBOOST, Support Vector Machine (SVM), Random Forest (RF), k-nearest neighbor (KNN), Artificial neural network (ANN), Logistic Regression (LR), Decision tree (DT), and Gaussian naïve Bayes (NB)” classifiers are evaluated. ANN provided the best performance with 97.63% accuracy for detecting phishing URLs in experiments. Such a study would be valuable to the scientific community, especially to researchers who work on phishing attack detection and prevention.

Citation

Salloum, S., Gaber, T., Vadera, S., & Shaalan, K. (2021). Phishing website detection from URLs using classical machine learning ANN model. Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering (Internet), 2, 509-523. https://doi.org/10.1007/978-3-030-90022-9_28

Journal Article Type Conference Paper
Conference Name International Conference on Security and Privacy in Communication Systems, SECURECOMM 2021
Conference Location Online
End Date Sep 9, 2021
Online Publication Date Nov 4, 2021
Publication Date Dec 24, 2021
Deposit Date Feb 9, 2022
Journal Security and Privacy in Communication Networks : 17th EAI International Conference, SecureComm 2021, Virtual Event, September 6–9, 2021, Proceedings, Part II
Print ISSN 1867-8211
Electronic ISSN 1867-822X
Volume 2
Pages 509-523
Series Title Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
Series Number 399
Book Title Security and Privacy in Communication Networks : 17th EAI International Conference, SecureComm 2021, Virtual Event, September 6–9, 2021, Proceedings
ISBN 9783030900212-(paperback);-9783030900229-(ebook)
DOI https://doi.org/10.1007/978-3-030-90022-9_28
Publisher URL https://doi.org/10.1007/978-3-030-90022-9_28
Related Public URLs https://doi.org/10.1007/978-3-030-90022-9
Additional Information Event Type : Conference