CFP last date
20 May 2024
Reseach Article

Machine Learning Classifier for Internet Traffic from Academic Perspective

Published on April 2012 by S. Agrawal, Jaspreet Kaur, Jaspreet Kaur, B.s.sohi
International Conference on Recent Advances and Future Trends in Information Technology (iRAFIT 2012)
Foundation of Computer Science USA
IRAFIT - Number 4
April 2012
Authors: S. Agrawal, Jaspreet Kaur, Jaspreet Kaur, B.s.sohi
dbf9b225-ad67-4c75-80e2-b481c0dc653b

S. Agrawal, Jaspreet Kaur, Jaspreet Kaur, B.s.sohi . Machine Learning Classifier for Internet Traffic from Academic Perspective. International Conference on Recent Advances and Future Trends in Information Technology (iRAFIT 2012). IRAFIT, 4 (April 2012), 4-9.

@article{
author = { S. Agrawal, Jaspreet Kaur, Jaspreet Kaur, B.s.sohi },
title = { Machine Learning Classifier for Internet Traffic from Academic Perspective },
journal = { International Conference on Recent Advances and Future Trends in Information Technology (iRAFIT 2012) },
issue_date = { April 2012 },
volume = { IRAFIT },
number = { 4 },
month = { April },
year = { 2012 },
issn = 0975-8887,
pages = { 4-9 },
numpages = 6,
url = { /proceedings/irafit/number4/5870-1026/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Recent Advances and Future Trends in Information Technology (iRAFIT 2012)
%A S. Agrawal
%A Jaspreet Kaur
%A Jaspreet Kaur
%A B.s.sohi
%T Machine Learning Classifier for Internet Traffic from Academic Perspective
%J International Conference on Recent Advances and Future Trends in Information Technology (iRAFIT 2012)
%@ 0975-8887
%V IRAFIT
%N 4
%P 4-9
%D 2012
%I International Journal of Computer Applications
Abstract

The infinite number of websites in the internet world can be classified into different categories in different ways. But if we talk about the educational institutions, websites can be classified into two categories, educational websites and non-educational websites. Educational websites are those websites which are used by the students to acquire knowledge, to explore educational topics, for the research work etc. The non-educational websites are used for entertainment and to keep in touch with people and to get to know more people. In educational institutes for the optimum use of network resources and for the welfare of the students, the use of non-educational websites should be banned while only the educational websites should be allowed to access. Recent trends are use of ML (machine learning) algorithms for internet traffic classification. In this paper, we use three ML classifiers Bayes Net, C4.5 and Radial basis function (RBF) neural network to classify the educational and non-educational websites and compare their performances. Results show that Bayes Net gives best performance for intended classification of internet traffic in terms of classification accuracy, training time of classifiers, recall and precision.

References
  1. Arthur Callado, Carlos Kamienski, Géza Szabó, Balázs Péter Ger?o, Judith Kelner, Stênio Fernandes and Djamel Sadok. Third Quarter 2009. A Survey on Internet Traffic Identification. IEEE Communications Survey & tutorials, vol. 11, no. 3, pp. 37-52.
  2. Thuy T.T. Nguyen and Grenville Armitage. Fourth Quarter 2008. A Survey of Techniques for Internet Traffic Classification using Machine Learning. IEEE Communications Survey & tutorials, vol. 10, no. 4, pp. 56-76.
  3. Runyuan Sun, Bo Yang, Lizhi Peng, Zhenxiang Chen, Lei Zhang, and Shan Jing. 2010. Traffic Classification Using Probabilistic Neural Network. In Sixth International Conference on Natural Computation (ICNC 2010), pp. 1914-1919.
  4. http:/www.iana.org/assignments/port numbers.
  5. Andrew W. Moore, Denis Zuev, Michael L. Crogan. August 2005. Discriminators for use in flow-based classification. Queen Mary University of London, Department of Computer Science, RR-05-13, ISSN 1470-5559.
  6. S. Zander, T. Nguyen and G. Armitage. November 2005. Automated Traffic Classification and Application Identification using Machine Learning. In LCN'05, Sydney, Australia.
  7. S. Zander, T. Nguyen and G. Armitage. March 2005. Self-Learning IP Traffic Classification Based on Statistical Flow Characteristics. In PAM'05, Boston, USA.
  8. Murat Soysal, Ece Guran Schmidt. 2010. Machine learning algorithms for accurate flow-based network traffic classification: Evaluation and comparison. Performance Evaluation Elsevier Journal, Vol. 67, pp. 451-467.
  9. Indra Bhan Arya and Rachna Mishra. 2011. Internet Traffic Classification: An Enhancement in Performance using Classifiers Combination. International Journal of Computer Science and Information Technologies, Vol. 2 (2), pp. 663-667.
  10. Kuldeep Singh and Sunil Agrawal. 2011. Internet Traffic Classification using RBF Neural Network. In Proceedings of International Conference on Communication and Computing technologies (ICCCT-2011), (Jalandhar, Punjab, India) 39-43.
  11. Yongli Ma, Zongjue Qian, Guochu Shou, Yihong Hu. 2008. Study on Preliminary Performance of Algorithms for Network Traffic Identification. 978-0-7695-3336-0/08 $25.00 © IEEE DOI 10.1109/CSSE.1277, pp.629-633.
  12. Kuldeep Singh and Sunil Agrawal. 2011. Comparative Analysis of five Machine Learning Algorithms for IP Traffic Classification. International Conference on Emerging Trends in Networks and Computing Communications (ENCTT-2011), Udaipur, Rajasthan, India.
  13. Y.L. Chong and K. Sundaraj. 2009. A Study of Back Propagation and Radial Basis Neural Networks on ECG signal classification. In 6th International Symposium on Mechatronics and its Applications (ISMA09), (Sharjah, UAE).
  14. Simon Haykin. 2005. Neural Networks: A Comprehensive foundation. 2th edition, Pearson Prentice Hall, New Delhi.
  15. Thales Sehn Korting. C4.5 algorithm and Multivariate Decision Trees. Image Processing Division, National Institute for Space Research – INPE, SP, Brazil.
  16. Weka website 2011. http://www.cs.waikato.ac.nz/ml/weka/
  17. Ian H, Witten and Eibe Frank.. 2005. Data Mining: Practical Machine Learning Tools and Techniques. 2th edition, Morgan Kaufmann Publishers, San Francisco, CA.
  18. Jie Cheng, Russell Greiner. Learning Bayesian Belief Network Classifiers: Algorithms and System. Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada.
  19. Wireshark, Available: http:// www.wireshark.org/
Index Terms

Computer Science
Information Sciences

Keywords

Rbf C4.5 Bayes Net Features