CFP last date
20 May 2024
Reseach Article

Novel and Recurring Class Detection using Ensemble of Classfiers: A Class-based Approach

by Mohammad Raihanul Islam
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 84 - Number 6
Year of Publication: 2013
Authors: Mohammad Raihanul Islam
10.5120/14577-2806

Mohammad Raihanul Islam . Novel and Recurring Class Detection using Ensemble of Classfiers: A Class-based Approach. International Journal of Computer Applications. 84, 6 ( December 2013), 1-9. DOI=10.5120/14577-2806

@article{ 10.5120/14577-2806,
author = { Mohammad Raihanul Islam },
title = { Novel and Recurring Class Detection using Ensemble of Classfiers: A Class-based Approach },
journal = { International Journal of Computer Applications },
issue_date = { December 2013 },
volume = { 84 },
number = { 6 },
month = { December },
year = { 2013 },
issn = { 0975-8887 },
pages = { 1-9 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume84/number6/14577-2806/ },
doi = { 10.5120/14577-2806 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:00:11.129603+05:30
%A Mohammad Raihanul Islam
%T Novel and Recurring Class Detection using Ensemble of Classfiers: A Class-based Approach
%J International Journal of Computer Applications
%@ 0975-8887
%V 84
%N 6
%P 1-9
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Over the recent years, concept-evolution has received a lot of attention to the research community because of its importance in the context of mining data streams. Mining data stream has become a crucial task due to its wide range of applications such as network intrusion detection, credit card fraud identification, identifying trends in the social networks etc. Concept-evolution means introduction of novel class in the data stream. Many recent works address this phenomenon. In addition, a class may appear in the stream, disappears for a while and then reemerges. This scenario is known as recurring classes and also remained unaddressed in most of the cases. As a result, generally where a novel class detection system is present, any recurring class is falsely detected as novel class. This results in unnecessary waste of human and computational resources. In this paper, we have investigated the idea of a class-based ensemble of classification model addressing the issues of recurring and novel class in the presence of concept drift. Our approach has shown impressive performance compared to the state-of-art methods in the literature.

References
  1. Tahseen Al-Khateeb, Mohammad M. Masud, Latifur Khan, Charu Aggarwal, Jiawei Han, and Bhavani Thuraisingham. Stream classification with recurring and novel class detection using class-based ensemble. In Proceedings of the 2012 IEEE 12th International Conference on Data Mining, ICDM '12, pages 31–40. IEEE Computer Society, 2012.
  2. Manuel Baena-Garc ia, Raul Fidalgo Jose del Campo-A vila and, Albert Bifet, Ricard Gavald a, and Rafael Morales- Bueno. Early drift detection method. In ECML PKDD 2006 Workshop on Knowledge Discovery from Data Streams, 2006.
  3. Jing Gao, Wei Fan, and Jiawei Han. On appropriate assumptions to mine data streams: Analysis and practice. In Proceedings of the 2007 Seventh IEEE International Conference on Data Mining, ICDM '07, pages 143–152, Washington, DC, USA, 2007. IEEE Computer Society.
  4. S. Hashemi, Ying Yang, Z. Mirzamomen, and M. Kangavari. Adapted one-versus-all decision trees for data stream classification. IEEE Transactions on Knowledge and Data Engineering, 21(5):624–637, 2009.
  5. Geoff Hulten, Laurie Spencer, and Pedro Domingos. Mining time-changing data streams. In Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '01, pages 97–106, New York, NY, USA, 2001. ACM.
  6. Jeremy Z. Kolter and Marcus A. Maloof. Using additive expert ensembles to cope with concept drift. In Proceedings of the 22nd international conference on Machine learning, ICML '05, pages 449–456, New York, NY, USA, 2005. ACM.
  7. M. M. Masud, Jing Gao, Latifur Khan, Jiawei Han, and Bhavani Thuraisingham. Classification and novel class detection in concept-drifting data streams under time constraints. IEEE Transactions on Knowledge and Data Engineering, 23(6):859–874, 2011.
  8. Mohammad M. Masud, Tahseen M. Al-Khateeb, Latifur Khan, Charu Aggarwal, Jing Gao, Jiawei Han, and Bhavani Thuraisingham. Detecting recurring and novel classes in concept-drifting data streams. In Proceedings of the 2011 IEEE 11th International Conference on Data Mining, ICDM '11, pages 1176–1181, Washington, DC, USA, 2011. IEEE Computer Society.
  9. Mohammad M. Masud, Qing Chen, Latifur Khan, Charu Aggarwal, Jing Gao, Jiawei Han, and Bhavani Thuraisingham. Addressing concept-evolution in concept-drifting data streams. In Proceedings of the 2010 IEEE International Conference on Data Mining, ICDM '10, pages 929–934, Washington, DC, USA, 2010. IEEE Computer Society.
  10. Mohammad M. Masud, Jing Gao, Latifur Khan, Jiawei Han, and Bhavani Thuraisingham. Integrating novel class detection with classification for concept-drifting data streams. In Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II, ECML PKDD '09, pages 79–94, Berlin, Heidelberg, 2009. Springer- Verlag.
  11. L. L. Minku and Xin Yao. Ddd: A new ensemble approach for dealing with concept drift. Knowledge and Data Engineering, IEEE Transactions on, 24(4):619–633, 2012.
  12. Robi Polikar. Ensemble based systems in decision making. IEEE Circuits and Systems Magazine, 6(3):21–45, 2006.
  13. Eduardo J. Spinosa, Andr´e Ponce de Leon F. , and Jo ao Gama. Cluster-based novel concept detection in data streams appliedto intrusion detection in computer networks. In Proceedings of the 2008 ACM symposium on Applied computing, SAC '08, pages 976–980, New York, NY, USA, 2008. ACM.
  14. Haixun Wang, Wei Fan, Philip S. Yu, and Jiawei Han. Mining concept-drifting data streams using ensemble classifiers. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '03, pages 226–235, New York, NY, USA, 2003. ACM.
  15. Peng Wang, Haixun Wang, Xiaochen Wu, Wei Wang, and Baile Shi. A low-granularity classifier for data streams with concept drifts and biased class distribution. IEEE Transactions on Knowledge and Data Engineering, 19(9):1202–1213, 2007.
  16. Ying Yang, Xindong Wu, and Xingquan Zhu. Combining proactive and reactive predictions for data streams. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, KDD '05, pages 710–715, New York, NY, USA, 2005. ACM.
  17. Peng Zhang, Xingquan Zhu, and Li Guo. Mining data streams with labeled and unlabeled training examples. In Ninth IEEE International Conference on Data Mining, ICDM '09, pages 627–636, 2009.
Index Terms

Computer Science
Information Sciences

Keywords

Novel Class Recurring Class Concept Evolution Stream Classification