![]() |
10.5120/15485-4265 |
Sarju S, Riju Thomas and Emilin Shyni C. Article: Spam Email Detection using Structural Features. International Journal of Computer Applications 89(3):38-41, March 2014. Full text available. BibTeX
@article{key:article, author = {Sarju S and Riju Thomas and Emilin Shyni C}, title = {Article: Spam Email Detection using Structural Features}, journal = {International Journal of Computer Applications}, year = {2014}, volume = {89}, number = {3}, pages = {38-41}, month = {March}, note = {Full text available} }
Abstract
In recent years, we have witnessed a dramatic raise in the use of web and thus email becomes an inevitable mode of communication. This is the scenario where the attackers take advantage by the mode of spam mails to the email users and misguide them to some phished sites or the users unwittingly install some malwares to their machine. This shows the importance of research activities being carried out in the field of spam mail detection. In this paper we tend to project a replacement methodology to segregate spam emails from non-spam (legitimate) emails using the distinct structural features available in them. The experiments with 8000 emails show that that our methodology preserves an accuracy of the spam detection up to 99. 4% with at the most 0. 6 % false positives.
References
- S. Abu-Nimeh, D. Nappa, X Wang, S. Nair, "A comparison of machine learning techniques for phishing detection. " In Proceedings of the anti-phishing working groups 2nd annual eCrime researchers summit, pp. 60-69. ACM, 2007.
- Apache James Mime4, http://james. apache. org/mime4j/
- L . Breiman, "Random Forests. ," InMachine Learning,Vol 45 No. 1,pp. 5–32,2001.
- X. Carreras,L. Marquez and J. G Salgado ,"Boosting trees for anti spam filtering," In International conference on Recent Advances in Natural Language Processing, , 2001,pp. 58-64.
- J Clark, I Koprinska, J Poon," A neural network based approach to automated e-mail classification," In Web Intelligence, 2003. WI 2003. Proceedings. IEEE/WIC International Conference on, pp. 702-705. 2003.
- I Fette, N Sadeh, A Tomasic, "Learning to detect phishing emails. " Proceedings of the 16th international conference on World Wide Web. ACM, 2007.
- G . Forman, "An extensive empirical study of feature selection metrics for text classification. ,"InThe Journal of machine learning research,pp. 1289-1305,2003.
- Y. Freund , R. E. Schapire, "A Short Introduction to Boosting. ,"1999.
- Kaspersky Spam Trends and Statistics Report ( 2013), http://www. securelist. com/en/analysis/204792297/Spam_in_Q2_2013
- S. Kiritchenko,S. Matwin,SAbu-Hakima. "Email Classification with Temporal Features," In Intelligent Information Systems,2004,pp. 523-533
- M . Rathi, V. Pareek, "Spam Mail Detection through Data Mining-A Comparative Performance Analysis. ," InInternational Journal of Modern Education & Computer ScienceVol 5 No. 12,2013.
- M. Sahami, S. Dumasi, D. Heckerman, and E. Horvitz, "A Bayesian approach to filtering junk e- mail: In Learning for text categorization," InInternational Journal of Modern Education and Computer Science (IJMECS), Vol. 5 No. 12,pp. 31-39,1998.
- S. Shankar and G. Karypis, "Weight adjustment schemes for a centroid based classifier," Computer Science Technical Report TR00-035, 2000.
- B. Thomas, and P. Richard, "An Essay towards solving a Problem in the Doctrine of Chance. By the late Rev. Mr. Bayes, communicated by Mr. Price, in a letter to John Canton, A. M. F. R. S. ," InPhilosophical Transactions of the Royal Society of LondonVol 53 No. 0,pp. 370–418,1763.
- L Zhang, J Zhu, T Yao," An evaluation of statistical spam filtering techniques. ," In ACM Transactions on Asian Language Information Processing (TALIP) Vol 3, No. 4,pp. 243-269,2004.