CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

Performance Comparison for Mining Large Data from the Internet and Learning using ID3 Algorithm in a Docker versus Virtual Machine Environment

by Abishek Ravichandran, Aishwarya Sundararajan, V. Balasubramanian
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 153 - Number 7
Year of Publication: 2016
Authors: Abishek Ravichandran, Aishwarya Sundararajan, V. Balasubramanian
10.5120/ijca2016912100

Abishek Ravichandran, Aishwarya Sundararajan, V. Balasubramanian . Performance Comparison for Mining Large Data from the Internet and Learning using ID3 Algorithm in a Docker versus Virtual Machine Environment. International Journal of Computer Applications. 153, 7 ( Nov 2016), 18-22. DOI=10.5120/ijca2016912100

@article{ 10.5120/ijca2016912100,
author = { Abishek Ravichandran, Aishwarya Sundararajan, V. Balasubramanian },
title = { Performance Comparison for Mining Large Data from the Internet and Learning using ID3 Algorithm in a Docker versus Virtual Machine Environment },
journal = { International Journal of Computer Applications },
issue_date = { Nov 2016 },
volume = { 153 },
number = { 7 },
month = { Nov },
year = { 2016 },
issn = { 0975-8887 },
pages = { 18-22 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume153/number7/26415-2016912100/ },
doi = { 10.5120/ijca2016912100 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:58:30.730775+05:30
%A Abishek Ravichandran
%A Aishwarya Sundararajan
%A V. Balasubramanian
%T Performance Comparison for Mining Large Data from the Internet and Learning using ID3 Algorithm in a Docker versus Virtual Machine Environment
%J International Journal of Computer Applications
%@ 0975-8887
%V 153
%N 7
%P 18-22
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Every day, 2.5 quintillion bytes of data are generated. A sizeable portion of the data is available through the internet. The efficacy of the decisions being made revolves around the extent to which analysis is performed on the procured data. Containers provide Operating System Virtualization and Linux Containers present secure execution environments by independently executing processes.[1]This paper aims at proving that the performance of Docker Container in mining large data from the internet and learning using ID3 algorithm to generate a decision tree to predict useful results is much better than the performance in a Virtual Machine Environment.

References
  1. Major Hayden, “Securing Linux Containers”, GIAC (GCUX) Gold Certification, July 26, 2015.
  2. Samuel T. King, George W. Dunlap, Peter M. Chen, “Operating System Support for Virtual Machines,” Proceedings of the 2003 USENIX Technical Conference, 2003.
  3. Young, C.J., “Extended Architecture and Hypervisor Performance,” Proceedings IEEE Computer Society Conference, Boston, MA, September 1971.
  4. James Turnbull,2014,’The Docker Book’
  5. http://faculty.simpson.edu/lydia.sinapova/www/cmsc310/LN310_AIMA/L13-ML-ID3.pdf
  6. ID3Algorithm,https://en.wikipedia.org/wiki/ID3_algorithm
  7. https://www.cise.ufl.edu/~ddd/cap6635/Fall-97/Short-papers/2.htm
  8. Jeffrey Dean and Sanjay Ghemawat, “MapReduce: Simplified Data Processing on Large Clusters,” Google,Inc.,OSDI 2004.
  9. Carl Boettiger, An introduction to Docker for reproducible research, with examples from the R environment, (2015) ACM SIGOPS Operating Systems Review, Special Issue on Repeatability and Sharing of Experimental Artifacts. 49(1), pp.71-79.
Index Terms

Computer Science
Information Sciences

Keywords

Docker Container Virtual Machine.