CFP last date
20 May 2024
Reseach Article

Review on Class Imbalance Learning: Binary and Multiclass

by Ranjana Singh, Roshani Raut (Ade)
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 131 - Number 16
Year of Publication: 2015
Authors: Ranjana Singh, Roshani Raut (Ade)
10.5120/ijca2015907573

Ranjana Singh, Roshani Raut (Ade) . Review on Class Imbalance Learning: Binary and Multiclass. International Journal of Computer Applications. 131, 16 ( December 2015), 4-8. DOI=10.5120/ijca2015907573

@article{ 10.5120/ijca2015907573,
author = { Ranjana Singh, Roshani Raut (Ade) },
title = { Review on Class Imbalance Learning: Binary and Multiclass },
journal = { International Journal of Computer Applications },
issue_date = { December 2015 },
volume = { 131 },
number = { 16 },
month = { December },
year = { 2015 },
issn = { 0975-8887 },
pages = { 4-8 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume131/number16/23531-2015907573/ },
doi = { 10.5120/ijca2015907573 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:27:32.701425+05:30
%A Ranjana Singh
%A Roshani Raut (Ade)
%T Review on Class Imbalance Learning: Binary and Multiclass
%J International Journal of Computer Applications
%@ 0975-8887
%V 131
%N 16
%P 4-8
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The application area of technology is expanding the span of information size is also additionally increases. Classification gets to be troublesome in view of unbounded size and imbalance nature of data. Class imbalance where one of the two classes having more sample than other years. There are typical strategies for an imbalance data set which is zoned into three main categories, the algorithmic methodology, data pre-processing approach and feature selection approach. In this paper every methodology is characterize which gives the right bearing for exploration in the class imbalance problem. This Paper also examines the three basic divisions of class Imbalance learning like data-preprocessing, the algorithmic approach, and feature selection approach.

References
  1. Shuo Wang, Member, IEEE, and Xin Yao, “Using Class Imbalance Learning for Software Defect Prediction”, IEEE TRANSACTIONS ON RELIABILITY, VOL. 62, NO. 2, JUNE 2013.
  2. Haibo He and Yunqian Ma, “Imbalanced Learning: Foundations, Algorithms, and Applications.”
  3. Mikel Galar, Alberto Fern´andez, Edurne Barrenechea, Humberto Bustince, “A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches”, IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART C: APPLICATIONS AND REVIEWS
  4. J. Zhang and M. Zulkernine, “Network intrusion detection using random forests,” in Proc. 3rd Annu. Conf. Privacy, Secur. Trust, 2005, pp. 53–61.
  5. G. Wang, “Asymmetric random subspace method for imbalanced credit risk evaluation,” in Proc. Softw. Eng. Knowl. Eng., Theory Pract., 2012, pp. 1047–1053.
  6. M. A. Mazurowski, P. A. Habas, J. M. Zurada, J. Y. Lo, J. A. Baker, and G. D. Tourassi, “Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classificationperformance,” Neural Netw., vol. 21, nos. 2–3, pp. 427–436, 2008.
  7. Minlong Lin, Ke Tang, . ”Dynamic Sampling Approach to Training Neural Networks for Multiclass Imbalance Classification”, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 24, NO. 4, APRIL 2013.
  8. Ishani Aroraa , Vivek Tetarwala,*, Anju Sahaa, “Open Issues in Software Defect Prediction”, International Conference on Information and Communication Technologies (ICICT 2014), Elsevier, Procedia Computer Science 46 (2015) 906 – 912.
  9. Jiang Y, Lin J, Cukic B, Menzies, T. Variance analysis in software fault prediction models. In: 20th International Symposium on Software Reliability Engineering. Mysuru; 2009. p. 99-108.
  10. Arora I, Saha A. A literature review on software defect prediction. In: Second International Conference on Emerging Research in Computing, Information, Communication and Applications. Bangalore; 2014. p. 478-487.
  11. Batista GEAPA, Prati RC, Monard MC. A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explorations 2004; 6:20-9.
  12. Galar M, Fernandez A, Barrenechea E, Bustince H, Herrera F. A review on ensembles for the class imbalance problem: bagging-, boosting-,and hybrid-based approaches. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 2012; 42:463-84.
  13. Barandela R, Valdovinos RM, Sánchez JS, Ferri FJ. The imbalanced training sample problem: Under or over sampling? Structural, Syntactic, and Statistical Pattern Recognition. Springer Berlin Heidelberg; 2004. p. 806-814.
  14. Zhou ZH, Liu XY. Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Transactions on Knowledge and Data Engineering 2006; 18:63-77.
  15. Seiffert C, Khoshgoftaar TM, Van Hulse J, Napolitano A. Building useful models from imbalanced data with sampling and boosting. In: FLAIRS Conference. 2008. p. 306-311.
  16. .Seiffert C, Khoshgoftaar TM, Van Hulse J, Napolitano A. RUSBoost: A hybrid approach to alleviating class imbalance. IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans 2010; 40:185-97.
  17. Galar M, Fernandez A, Barrenechea E, Bustince H, Herrera F. A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 2012; 42:463-84.
  18. Wang S, Minku LL, Yao X. A learning framework for online class imbalance learning. In: IEEE Symposium on Computational Intelligence and Ensemble Learning (CIEL). 2013. p. 36-45.
  19. V. García J.S. Sánchez R.A. Mollineda R. Alejo J.M. Sotoca, “The class imbalance problem in pattern classification and learning”, Pattern Analysis and Learning Group.
  20. Mike Wasikowski, Member and Xue-wen Chen, “Combating the Small Sample Class Imbalance Problem Using FeatureSelection”, IEEE Transactions on Knowledge and Data Engineering, Vol. 22, No. 10, October 2010.
  21. Peters F, Menzies T, Gong L, Zhang H. Balancing privacy and utility in cross-company defect prediction. IEEE Transactions on Software Engineering 2013; 39:1054-68.
  22. S. Wang and X. Yao, “Multiclass imbalance problems: Analysis and potential solutions,” IEEE Trans. Syst., Man Cybern. B, vol. 42, no. 4, pp. 1119–1130, Apr. 2012.
  23. G. E. A. P. A. Batista, R. C. Prati, and M. C. Monard, “A study of the behavior of several methods for balancing machine learning training data,” SIGKDD Expl. Newslett., vol. 6, pp. 20–29, 2004.
  24. Y. Sun, M. S. Kamel, and Y. Wang, “Boosting for learning multiple classes with imbalanced class distribution,” in Proc. Int. Conf. Data Min., 2006, pp. 592–602.
  25. N. V. Chawla, A. Lazarevic, L. O. Hall, and K. W. Bowyer, “SMOTEBoost: Improving prediction of the minority class in boosting,” in Proc. Knowl. Dis. Databas. Conf., 2003, pp. 107–119.
  26. N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “SMOTE: Synthetic minority over-sampling technique,” J. Artif. Intell. Res., vol. 16, no. 1, pp. 321–357, 2002.
  27. H. Han, W. Y. Wang, and B. H. Mao, “Borderline-SMOTE: A new oversampling method in imbalanced data sets learning,” Adv. Intell. Comput.,vol. 2, no. 5, pp. 878–887, 2005.
  28. Mr.Rushi Longadge, Ms. Snehlata S. Dongre, Dr. Latesh Malik”, Class Imbalance Problem in Data Mining: Review “ International Journal of Computer Science and Network (IJCSN), Volume 2, Issue 1, February 2013 www.ijcsn.org ISSN 2277-5420
Index Terms

Computer Science
Information Sciences

Keywords

Machine learning Imbalanced Data Binary Classification Multiclass Classification Dynamic Sampling