CFP last date
22 April 2024
Reseach Article

Spam Filtration using Boyer Moore Algorithm and Naïve Method

by Aastha Baranwal, Gunjan Gaur, Akanksha Bhasker, Rishabh Jain
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 180 - Number 42
Year of Publication: 2018
Authors: Aastha Baranwal, Gunjan Gaur, Akanksha Bhasker, Rishabh Jain
10.5120/ijca2018917119

Aastha Baranwal, Gunjan Gaur, Akanksha Bhasker, Rishabh Jain . Spam Filtration using Boyer Moore Algorithm and Naïve Method. International Journal of Computer Applications. 180, 42 ( May 2018), 35-38. DOI=10.5120/ijca2018917119

@article{ 10.5120/ijca2018917119,
author = { Aastha Baranwal, Gunjan Gaur, Akanksha Bhasker, Rishabh Jain },
title = { Spam Filtration using Boyer Moore Algorithm and Naïve Method },
journal = { International Journal of Computer Applications },
issue_date = { May 2018 },
volume = { 180 },
number = { 42 },
month = { May },
year = { 2018 },
issn = { 0975-8887 },
pages = { 35-38 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume180/number42/29414-2018917119/ },
doi = { 10.5120/ijca2018917119 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:03:25.794888+05:30
%A Aastha Baranwal
%A Gunjan Gaur
%A Akanksha Bhasker
%A Rishabh Jain
%T Spam Filtration using Boyer Moore Algorithm and Naïve Method
%J International Journal of Computer Applications
%@ 0975-8887
%V 180
%N 42
%P 35-38
%D 2018
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Emails are primarily being used for transporting information in a quicker and well-organized way. They are favored in professional as well as personal space because of its attribute of time saving and trading data over huge distances. According to the statistics of the past few years, forgery and fraudulent activities are frequent in emails and these are categorized as ‘spams’. Spam emails utilize time, transmission capacity and storage section; hence it is essential to spot these mails to shield our treasured data and time from being distorted. There are numerous approaches that have been formulated to screen the emails and organize them as spam and non-spam. The motive of scripting this paper is to analyze recent works done in spam detection and also present a new technique using graylist filter, Boyer Moore string searching algorithm and Naïve Bayes algorithm. Also, observe its working in contrast with traditional Naïve Bayes algorithm.

References
  1. S. Aski and N. K. Sourati, -Proposed efficient algorithm to filter spam using machine learning techniques, in International Journal of Innovative Research in Computer and Communication Engineering, 2017
  2. Akash Iyengar, G.Kalpana, Kalyankumar.S, S.GunaNandhini, “Integrated Spam Detection for Pacific Science Review- A Natural Science Engineering- Elsevier., vol. 18, no. 2, pp. 145-149, 2016
  3. V. Suganya, “A Review on Phishing Attacks and Various Anti Phishing Techniques” in International Journal of Computer Applications, 2016
  4. Sunil B. Rathod, Tareek M. Pattewar, “Content Based Spam Detection in Email using Bayesian Classifier” in International Conference on Cryptography, Security and Privacy, 2015
  5. Lin Li, Chi Li, “Research and Improvement of a Spam Filter based on Naïve Bayes” in International Conference on Intelligent Human-Machine Systems and Cybernetics, 2015
  6. R. Malarvizhi, K. Saraswathi, “Content-Based Spam Filtering and Detection Algorithm- An Efficient Analysis & Comparison” in International Journal of Engineering Trends and Technology, 2013
  7. Omar Saad, Ashraf Darwish, Ramadan Faraj, “A survey of machine learning techniques for Spam filtering” in International Journal of Computer Science and Network Security, 2012
  8. Zhengda Xiong, “A Composite Boyer-Moore Algorithm for the String Matching Problem” in International Conference on Parallel and Distributed Computing, Applications and Technologies, 2010
  9. Kajaree Das, Rabi Narayan Behera, “A Survey on Machine Learning: Concept, Algorithms and Applications.” Multilingual Emails” in International Conference on Information, Communication, and Embedded Systems, 2017
  10. Akhtar Rasool, Amrita Tiwari, Gunjan Singla, Nilay Khare, “String Matching Methodologies: A Comaparative Analysis” in International Journal of Computer Science and Information Technologies, 2012
  11. Anjali Sharma, Manisha, Dr. Rekha Jain, Dr. Manisha, “Data Preprocessing in Spam Detection” in International Journal of Science Technology and Engineering, 2015
  12. William S. Yerazunis, Shalendra Chhabra, Christian Siefkes, Fidelis Assis, Dimitrios, “A Unified Model of Spam Filtration” in Mitsubishi Electric Research Laboratories, 2005
  13. Saadat Nazirova, “Survey on Spam Filtering Techniques” in Communications and Network, Scientific Research, 2011
  14. Mamoru Kato, Joseph Langeway, Yimin Wu, William S. Yerazunis, “Three Non-Bayesian Methods of Spam Filtration” in TREC, 2007
  15. Neha Roy, Rishabh Jain, “Virtual Machine Scheduling on Clouds Using DVFS” in International Journal of Advanced Research in Computer Science and Software Engineering ISSN: 2277 128X, Volume 5, Issue 5, May 2015.
  16. Pooja Ahlawat, Poonam, Rishabh Jain, “An Improvement to Life of Wireless Sensor Network Using Leach Design a Cluster Head”- IJCSMS (International Journal of Computer Science & Management Studies) ISSN(Online): 2231-5268, Volume 15, Issue 06, June
Index Terms

Computer Science
Information Sciences

Keywords

Spam Ham Tokenization Classifier.