CFP last date
20 May 2025
Reseach Article

Phishing URL Attack Detection using Logistic Regression and Convolutional Neural Network

by Umejuru Daniel, Eke Bartholomew, Fubara Egbono
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Number 1
Year of Publication: 2025
Authors: Umejuru Daniel, Eke Bartholomew, Fubara Egbono
10.5120/ijca2025924611

Umejuru Daniel, Eke Bartholomew, Fubara Egbono . Phishing URL Attack Detection using Logistic Regression and Convolutional Neural Network. International Journal of Computer Applications. 187, 1 ( May 2025), 8-14. DOI=10.5120/ijca2025924611

@article{ 10.5120/ijca2025924611,
author = { Umejuru Daniel, Eke Bartholomew, Fubara Egbono },
title = { Phishing URL Attack Detection using Logistic Regression and Convolutional Neural Network },
journal = { International Journal of Computer Applications },
issue_date = { May 2025 },
volume = { 187 },
number = { 1 },
month = { May },
year = { 2025 },
issn = { 0975-8887 },
pages = { 8-14 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume187/number1/phishing-url-attack-detection-using-logistic-regression-and-convolutional-neural-network/ },
doi = { 10.5120/ijca2025924611 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-05-17T02:45:23.241104+05:30
%A Umejuru Daniel
%A Eke Bartholomew
%A Fubara Egbono
%T Phishing URL Attack Detection using Logistic Regression and Convolutional Neural Network
%J International Journal of Computer Applications
%@ 0975-8887
%V 187
%N 1
%P 8-14
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Phishing is a problem that is quickly spreading worldwide and costs internet users billions of dollars each year. It is illegal to gather sensitive information from internet users using social engineering techniques combined with technology. The overall performance is unreliable, inefficient, and requiring improvement in terms of prediction accuracy, time complexity, misclassification error and robustness. Phishing strategies can be recognized in a variety of communication channels, including email, instant chats, pop-up messages, and web pages itself. Over time, existing methods and approaches have been unable to detect all connected dangers and provide an all-encompassing solution. Although it is widely believed that a successful phishing attack entails developing a website that looks exactly like the target site in order to trick the internet user, this idea has not been incorporated into existing approaches to assess the dangers and thoroughly analyze the gaps. In this study, the aim is to create an enhanced Phishing attack detection system utilizing logistic regression and Convolutional Neural Network (CNN). This study outlined a novel method capable of identifying malicious phishing URLs with an emphasis on using features primarily obtained from the phishing and real URL addresses. The model kernel, weights, and bias values were tuned with a penalty term which increased model detection accuracy. The experimental findings show that CNN performed better incorporating penalty term which recorded a detection accuracy of 98.20% and LR yielded a recommendable prediction accuracy of 89.85%.

References
  1. Iwendi, C. Jalil, Z. Javed, A. R. Reddy, T. Kaluri, R., and Srivastava, G. J. O. (2020) Keysplitwatermark: Zero watermarking algorithm for software protection against cyber-attacks. IEEE Access, 8, 72650–72660. doi: 10.1109/ACCESS.2020.2988160
  2. Kumar, A., Chatterjee, J. M., & Díaz, V. G. (2020). A novel hybrid approach of svm combined with nlp and probabilistic neural network for email phishing. International Journal of Electrical and Computer Engineering, 10(1), 486.
  3. Safi, A. and Singh, S.(2023), A systematic literature review on phishing website detection techniques, Journal of King Saud University – Computer and Information Sciences, 35(2), 591-611.
  4. Basit, A., Zafar, M., Liu, X., Javed, A.R., Jalil, Z., Kifayat, K., (2021). A comprehensive survey of AI-enabled phishing attacks detection techniques. Telecommun. Syst. 76(1), 139–154. https://doi.org/10.1007/s11235-020-00733-2.
  5. Javed, A. R., Jalil, Z., Moqurrab, S. A., Abbas, S., and Liu, X. (2020), Ensemble adaboost classifier for accurate and fast detection of botnet attacks in connected vehicles, Transactions on Emerging Telecommunications Technologies, 45.
  6. Jain, A. K., Parashar, S., Katare, P., & Sharma, I. (2020). Phishskape: A content based approach to escape phishing attacks. Procedia Computer Science, 171, 1102–1109.
  7. Mittal, M., Iwendi, C., Khan, S., and Rehman-Javed, A. (2020). Analysis of security and energy efficiency for shortest route discovery in low-energy adaptive clustering hierarchy protocol using Levenberg–Marquardt neural network and gated recurrent unit for intrusion detection system. Transactions on Emerging Telecommunications Technologies, p. e3997.
  8. Fu, A. Y., Liu, W. & Deng, X. T.(2021). Detecting Phishing web Pages with Visual Similarity Assessment based on Earth Mover’s Distance (EMD), IEEE Transactions on Dependable and Secure Computing, 3(4), 301-311.
  9. Liu, G., Qiu, B. & Wenyin, L. (2020) Automation of Phishing Target from Phishing Web-pages, International Conference on Pattern Recognition, 50, 4153-4156.
  10. Zhu, E., Ju, Y., Chen, Z., Liu, F., Fang, X.,( 2020). DTOF-ANN: an artificial neural network phishing detection model based on decision tree and optimal features. Appl. Soft Comput. J. 95,. https://doi.org/10.1016/j.asoc.2020.106505 106505.
  11. Al-Sarem, M., Saeed, F., Al-Mekhlafi, Z.G., Mohammed, B.A., Al-Hadhrami, T., Alshammari, M.T., & Alshammari, T.S. (2021). An Optimized Stacking Ensemble Model for Phishing Websites Detection. Electronics, 10(11), 1285.
  12. Barlow, L., Bendiab, G., Shiaeles, S. and Savage, N.(2020) A Novel Approach to Detect Phishing Attacks using Binary Visualisation and Machine Learning,” in Proceedings - 2020 IEEE World Congress on Services, SERVICES, 177–182.
  13. Liu, X., and Fu, J.,(2020). SPWalk: Similar Property Oriented Feature Learning for Phishing Detection. IEEE Access 8, 87031–87045. https://doi.org/10.1109/ ACCESS, 2992381.
  14. Chiew, K.L., Tan, C.L., Wong, K., Yong, K.S., and Tiong, W.K. (2019). A new hybrid ensemble feature selection framework for machine learning-based phishing detection system. Information Sciences, 484, 153-166.
  15. Hong J., Kim T., Liu J., Park N., Kim S. W. (2021) Phishing URL Detection with Lexical Features and Blacklisted Domains, Autonomous Secure Cyber Systems. Springer, 1-12
  16. Whittaker, C., Ryner, B. and Nazif, M.(2020), Large-Scale Automatic Classification of Phishing Pages., Conference: Proceedings of the Network and Distributed System Security Symposium, NDSS 2010, San Diego, California, USA,1- 20
  17. Basnet, R. B., Mukkamala, S. and Sung, A. H.(2021). Detection of Phishing Attacks: A Machine Learning Approach. In Soft Computing Applications in Industry: Studies in Fuzziness and Soft Computing, 226(10), 373–383.
  18. Afroz, A. and Greenstadt,R.(2020) PhishZoo detecting phishing websites by looking at them, in Proceedings of IEEE Fifth International Conference on Semantic Computing, 368–375.
  19. Gowtham, R. Krishnamurthi, L. and Kumar, S.(2021) An efficacious method for detecting phishing webpages through target domain identification, Journal of Decision Support Systems, Elsevier Press, 1-20.
  20. Huang, C. Ma, S. Chen, K.(2020) Using one-time passwords to prevent password phishing attacks, Journal of Network and Computer Applications, Elsevier Press, 1-10.
  21. Sheng, S. Holbrook, M. Kumaraguru, P. and Downs, L. F. J. (2020) Who falls for phish? A demographic analysis of phishing susceptibility and effectiveness of interventions, in Proceedings of the 28th International Conference on Human Factors in Computing Systems, USA, 2-6.
Index Terms

Computer Science
Information Sciences

Keywords

Logistic regression CNN phishing URL attack feature extraction penalty term