CFP last date
20 May 2024
Reseach Article

A Life Cycle on Processing Large Dataset - LCPL

by Rajit Nair, Amit Bhagat
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 179 - Number 53
Year of Publication: 2018
Authors: Rajit Nair, Amit Bhagat
10.5120/ijca2018917382

Rajit Nair, Amit Bhagat . A Life Cycle on Processing Large Dataset - LCPL. International Journal of Computer Applications. 179, 53 ( Jun 2018), 27-34. DOI=10.5120/ijca2018917382

@article{ 10.5120/ijca2018917382,
author = { Rajit Nair, Amit Bhagat },
title = { A Life Cycle on Processing Large Dataset - LCPL },
journal = { International Journal of Computer Applications },
issue_date = { Jun 2018 },
volume = { 179 },
number = { 53 },
month = { Jun },
year = { 2018 },
issn = { 0975-8887 },
pages = { 27-34 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume179/number53/29541-2018917382/ },
doi = { 10.5120/ijca2018917382 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:59:08.182845+05:30
%A Rajit Nair
%A Amit Bhagat
%T A Life Cycle on Processing Large Dataset - LCPL
%J International Journal of Computer Applications
%@ 0975-8887
%V 179
%N 53
%P 27-34
%D 2018
%I Foundation of Computer Science (FCS), NY, USA
Abstract

We all know that today’s era is of big data, we can also call it as large data set where data from different sources are collected at one place and it is very difficult to process these type of data. Mainly these types of data are collected where there is huge volume, high velocity and different varieties of data. These types of data are processed for analytical purpose because if we did not analyze it then there is no sense of collecting these type of data. So in this paper we are explaining about the life cycle of processing large data set and propose a new term for it i.e. LCPLD(Life Cycle on Processing Large Data set. Here the discussion will mainly focus on the steps which are involved during the life cycle of processing that are pre-processing, feature selection, feature extraction, classification, clustering and many more.

References
  1. P. Rheingans and M. DesJardins, “Visualizing high-dimensional predictive model quality,” in Proceedings Visualization 2000. VIS 2000 (Cat. No.00CH37145), 2000, p. 493–496,.
  2. C.-W. Tsai, C.-F. Lai, H.-C. Chao, and A. V. Vasilakos, “Big data analytics: a survey,” J. Big Data, vol. 2, no. 1, p. 21, 2015.
  3. N. Tang, “Big data cleaning,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, vol. 8709 LNCS, pp. 13–24.
  4. C. J. Zhang, L. Chen, Y. Tong, and Z. Liu, “Cleaning uncertain data with a noisy crowd,” in Proceedings - International Conference on Data Engineering, 2015, vol. 2015–May, pp. 6–17.
  5. D. Peralta, S. Del Río, S. Ramírez-Gallego, I. Triguero, J. M. Benitez, and F. Herrera, “Evolutionary Feature Selection for Big Data Classification: A MapReduce Approach,” Math. Probl. Eng., vol. 2015, 2015.
  6. A. K. Jain, M. N. Murty, and P. J. Flynn, “Data clustering: a review,” ACM Comput. Surv., vol. 31, no. 3, pp. 264–323, 1999.
  7. M. Verleysen, “Learning high-dimensional data,” Limitations Futur. Trends Neural Comput., pp. 141–162, 2003.
  8. P. Rheingans and M. DesJardins, “Visualizing high-dimensional predictive model quality,” in Proceedings Visualization 2000. VIS 2000 (Cat. No.00CH37145), 2000, p. 493–496,.
  9. C.-W. Tsai, C.-F. Lai, H.-C. Chao, and A. V. Vasilakos, “Big data analytics: a survey,” J. Big Data, vol. 2, no. 1, p. 21, 2015.
  10. N. Tang, “Big data cleaning,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, vol. 8709 LNCS, pp. 13–24.
  11. C. J. Zhang, L. Chen, Y. Tong, and Z. Liu, “Cleaning uncertain data with a noisy crowd,” in Proceedings - International Conference on Data Engineering, 2015, vol. 2015–May, pp. 6–17.
Index Terms

Computer Science
Information Sciences

Keywords

Preprocessing Features Classification Clustering Dimensions Features.