CFP last date
21 July 2025
Reseach Article

Phishing and Spam Detection: based on URL Heuristics and Email Text Analysis

by Aditya Dusane, Sanket Dhonde, Aayush Dumbre, Omkar Indore, Rizwana Shaikh
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Number 14
Year of Publication: 2025
Authors: Aditya Dusane, Sanket Dhonde, Aayush Dumbre, Omkar Indore, Rizwana Shaikh
10.5120/ijca2025925157

Aditya Dusane, Sanket Dhonde, Aayush Dumbre, Omkar Indore, Rizwana Shaikh . Phishing and Spam Detection: based on URL Heuristics and Email Text Analysis. International Journal of Computer Applications. 187, 14 ( Jun 2025), 48-52. DOI=10.5120/ijca2025925157

@article{ 10.5120/ijca2025925157,
author = { Aditya Dusane, Sanket Dhonde, Aayush Dumbre, Omkar Indore, Rizwana Shaikh },
title = { Phishing and Spam Detection: based on URL Heuristics and Email Text Analysis },
journal = { International Journal of Computer Applications },
issue_date = { Jun 2025 },
volume = { 187 },
number = { 14 },
month = { Jun },
year = { 2025 },
issn = { 0975-8887 },
pages = { 48-52 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume187/number14/phishing-and-spam-detection-based-on-url-heuristics-and-email-text-analysis/ },
doi = { 10.5120/ijca2025925157 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-06-26T19:04:43.383755+05:30
%A Aditya Dusane
%A Sanket Dhonde
%A Aayush Dumbre
%A Omkar Indore
%A Rizwana Shaikh
%T Phishing and Spam Detection: based on URL Heuristics and Email Text Analysis
%J International Journal of Computer Applications
%@ 0975-8887
%V 187
%N 14
%P 48-52
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Phishing attacks continue to compromise cybersecurity by exploiting deceptive URLs and fraudulent emails to extract confidential user information. Traditional systems relying on static heuristics and blacklists are challenged by novel phishing tactics—especially the use of dynamically generated session URLs and subtle email cues. In this paper, we propose a dual-model approach that integrates URL-based heuristics with email text analysis using machine learning (ML) and deep learning (DL) techniques. The system extracts lexical and host-based features from URLs and leverages natural language processing (NLP) to analyze email messages. Experiments on an 11,054-sample phishing URL dataset and a 5,572-sample email dataset reveal that our method achieves a URL classification accuracy of 96.8% and an email spam detection accuracy of 99.2%, with a combined system accuracy of 98.5%. These results demonstrate the robustness of the integrated approach in addressing challenges such as flagging new links and handling dynamic URL patterns.

References
  1. A. Safi and S. Singh, “A systematic literature review on phishing website detection techniques,” J. King Saud Univ. – Computer. Inf. Sci., vol. 35, pp. 590–611, 2023.
  2. H. A. Shaik et al., “Phishing URL detection using machine learning methods,” Adv. Eng. Softw., vol. 173, p. 103288, Jan. 2022.
  3. S. Sallouma et al., “Phishing Email Detection Using Natural Language Processing Techniques: A Literature Survey,” Procedia Computer. Sci., vol. 189, pp. 19–28, 2021.
  4. Aniket Garje et al., “Detecting Phishing Websites Using Machine Learning,” Int. J. Creative Res. Thoughts (IJCRT), Nov. 2021.
  5. A. Krishna V. et al., “Phishing Detection using Machine Learning based URL Analysis: A Survey,” IJERT, 2021.
  6. R. Mahajan and I. Siddavatam, “Phishing Website Detection using Machine Learning Algorithms,” Int. J. Computer. Appl., Oct. 2018.
  7. H. Ghalechyan et al., “Phishing URL detection with neural networks: an empirical study,” Sci. Rep., 2024.
  8. S. Atawneh and H. Aljehani, “Phishing Email Detection Model Using Deep Learning,” Electronics, vol. 12, p. 4261, 2023.
  9. B. Sucharitha et al., “Detecting Phishing Websites Using Machine Learning,” Int. J. Adv. Res. Computer. Commun. Eng., vol. 13, Issue 4, Apr. 2024.
  10. A. El Aassal et al., “An In-Depth Benchmarking of Phishing Detection Research,” IEEE Access, 2020
Index Terms

Computer Science
Information Sciences

Keywords

Phishing detection; URL analysis; email spam; machine learning; deep learning; natural language processing