CFP last date
20 May 2024
Reseach Article

Fuzzy Improved Decision Tree Approach for Outlier Detection in SMS

by Priyanka Maan, Meghna Sharma
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 119 - Number 16
Year of Publication: 2015
Authors: Priyanka Maan, Meghna Sharma
10.5120/21149-4130

Priyanka Maan, Meghna Sharma . Fuzzy Improved Decision Tree Approach for Outlier Detection in SMS. International Journal of Computer Applications. 119, 16 ( June 2015), 6-10. DOI=10.5120/21149-4130

@article{ 10.5120/21149-4130,
author = { Priyanka Maan, Meghna Sharma },
title = { Fuzzy Improved Decision Tree Approach for Outlier Detection in SMS },
journal = { International Journal of Computer Applications },
issue_date = { June 2015 },
volume = { 119 },
number = { 16 },
month = { June },
year = { 2015 },
issn = { 0975-8887 },
pages = { 6-10 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume119/number16/21149-4130/ },
doi = { 10.5120/21149-4130 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:04:11.171816+05:30
%A Priyanka Maan
%A Meghna Sharma
%T Fuzzy Improved Decision Tree Approach for Outlier Detection in SMS
%J International Journal of Computer Applications
%@ 0975-8887
%V 119
%N 16
%P 6-10
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Spam is one of the serious problems faced by internet community globally. Spam Detection is a critical issue in business world. In this paper an intelligent three stage model is presented to perform the spam inclusive outlier identification. The SMS textual dataset is taken as input and than its filtration is done. After that this textual information is converted to the statistical information using fuzzy and assign the weights to dataset. The decision tree algorithm is than applied on this fuzzy weighed dataset to classify the dataset. This algorithm is defined to separate the spam and non spam data values. A comparison of existing Bayesian and proposed Fuzzy based decision tree approach is done. The results shows that the recognition rate is improved using the proposed approach. The work is implemented in weka integrated java environment.

References
  1. Koosha Golmohammadi, Osmar R. Zaiane, Data Mining Applications for Fraud Detection in Securities Market,2012 European Intelligence and Security Informatics Conference 978-0-7695-4782-4/12 © 2012 IEEE.
  2. V. Hodge and J. Austin. A survey of outlier detection methodologies. Artif. Intell. Rev. , 22(2), 2004.
  3. M. Markou and S. Singh. Novelty detection: A review- part 1: Statistical approaches. Signal Processing, 83(12), 2003.
  4. Nikita Spirin, Jiawei Han, Survey on Web Spam Detection: Principles and Algorithms, SIGKDD Explorations Volume 13, Issue 2.
  5. M. Markou and S. Singh. Novelty detection: A review - part 2: Neural network based approaches. Signal Processing, 83(12), 2003.
  6. Priyanka Maan, Meghna Sharma, A Study on Mining Approach under Cyber Crime Analysis , IJETCAS 15-183; 2015.
  7. Sahil Puri, Dishant Gosain, Mehak Ahuja, Ishita Kathuria, Nishtha Jatana, Comparison and Analysis of Spam Detection Algorithms, IJAIEM, Volume 2, Issue 4, April 2013.
  8. N. S. Kumar, D. P. Rana, R. G. Mehta, Detecting E-mail Spam Using Spam Word Associations, IJETAE Volume 2, Issue 4, April 2012.
  9. R. Malarvizhi, K. Saraswathi, Content-Based Spam Filtering and Detection Algorithms- An Efficient Analysis & Comparison, IJETT–Volume 4 Issue 9- Sep 2013.
  10. Zhen Yang, Xiangfei Nie, Weiran Xu, and Jun Guo, An Approach to Spam Detection by Naive Bayes Ensemble Based on Decision Induction, ISDA'06, 2006.
  11. Matthias Reif, Markus Goldstein, Armin Stahl, Anomaly Detection by Combining Decision Trees and Parametric Densities, 2008 IEEE.
  12. Kelton Costa; Patricia Ribeiro; Atair Camargo; Victor Rossi; Henrique Martins; Miguel Neves; Ricardo Fabris, Comparison of the Techniques Decision Tree and MLP for Data Mining in Spams Detection To Computer Networks, 2013 IEEE.
  13. Semih Ergin, Sahin Isik, The Investigation on the Effect of Feature Vector Dimension for Spam Email Detection with a New Framework.
  14. CHUNG-I CHANG," An Integrated Sequential Patterns Mining with Fuzzy Time-Intervals", 2012 International Conference on Systems and Informatics (ICSAI 2012) 978-1-4673-0199-2/12 ©2012 IEEE
  15. Mohammad Zaid Pasha, Nitin Umesh, A Comparative Study on Outlier Detection Techniques, International Journal of Computer Applications, Volume 66– No. 24, March 2013.
  16. N N R Ranga Suri,"An Algorithm for Mining Outliers in Categorical Data through Ranking", 978-1-4673-5116-4/12@2012 IEEE
  17. Amir A. Sheibani," Opinion Mining and Opinion Spam", 6'th International Symposium on Telecommunications (IST'2012) 978-1-4673-2073-3/12©2012 IEEE
  18. Ms. K. Mouthami," Sentiment Analysis and Classification Based On Textual Reviews".
  19. Farkhund Iqbal," Mining Criminal Networks from Chat Log", 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology 978-0-7695-4880-7/12 © 2012 IEEE.
Index Terms

Computer Science
Information Sciences

Keywords

Spam Detection Outlier detection Data mining fuzzy logic decision tree (DT) weka