CFP last date
22 July 2024
Reseach Article

Data Engineering: using Data Analysis Techniques in Producing Data Driven Products

by V. I. Nnebedum
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 161 - Number 1
Year of Publication: 2017
Authors: V. I. Nnebedum
10.5120/ijca2017912712

V. I. Nnebedum . Data Engineering: using Data Analysis Techniques in Producing Data Driven Products. International Journal of Computer Applications. 161, 1 ( Mar 2017), 13-16. DOI=10.5120/ijca2017912712

@article{ 10.5120/ijca2017912712,
author = { V. I. Nnebedum },
title = { Data Engineering: using Data Analysis Techniques in Producing Data Driven Products },
journal = { International Journal of Computer Applications },
issue_date = { Mar 2017 },
volume = { 161 },
number = { 1 },
month = { Mar },
year = { 2017 },
issn = { 0975-8887 },
pages = { 13-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume161/number1/27112-2017912712/ },
doi = { 10.5120/ijca2017912712 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:06:34.324012+05:30
%A V. I. Nnebedum
%T Data Engineering: using Data Analysis Techniques in Producing Data Driven Products
%J International Journal of Computer Applications
%@ 0975-8887
%V 161
%N 1
%P 13-16
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Data analysis is prominent in data science researches, but by each day data usage is expanding, and in recent times the usage is becoming indispensable and inseparable in all works of life including engineering profession. This is why data engineering as a discipline sprang up - using the data analysis techniques from statistics, machine learning, pattern recognition or neural networks, together with other technologies such as visualization, optimization, database systems, knowledge discovery etc to produce systems needed in diverse business, science and social science domains. This paper is a novel presentation of data analysis and data engineering discipline, focusing on critical issues that are relevant to both, but divulging more the new trend of moving data science beyond data analysis, to data engineering. Data engineering is a multi-disciplinary field with applications in control, decision theory, and in the emerging areas like bioinformatics. Data engineering is needed in critical activities for business, engineering, and scientific organizations, since service oriented architecture and web services has moved into full swing.

References
  1. Hector Cuesta (2013) Practical Data Analysis; Packt Publishing; ISBN: 978-1-78328-099-5
  2. Calvin Andrus, Jon Cook, Suresh Sood; 2016; “Data Science: An Introduction”; WikiBook (last modified on 1 November 2016): https://en.wikibooks.org/wiki/Data_Science:_An_Introduction
  3. Judd, Charles and, McCleland, Gary (1989). “Data Analysis”. Harcourt Brace Jovanovich Publication ISBN 0-15-516765-0; http://en.wikipedia.org/wiki/Data_analysis
  4. Resnik, D. (2000). Statistics, ethics, and research: an agenda for educations and reform. Accountability in Research. 8: 163-88
  5. Chris Olsen, Roxy Peck, Jey L Devore; Introduction to Statistics and Data Analysis; Chegg Books EISBN-13: 9781305445963
  6. J. Scott Long, 2009, “The Workflow of Data Analysis Using Stata”, Stata Press, ISBN-13:978-1-59718-047-4
  7. DJ Patil, Hilary Mason (2015) Data Driven Publisher: O'Reilly Media
  8. Brian Shive (2013), “Data Engineering”, Technics Publications, LLC; ISBN-13: 978-1935504603
  9. Yupo Chan, John Talburt Terry M. Talley, 2010,  Data Engineering: Mining, Information and Intelligence (International Series in Operations Research & Management Science) ISBN-13: 978-1441901750 ; Publisher: Springer
  10. Viktor Mayer-Schönberger, Kenneth Cukier (2013) “Big Data: A Revolution That Will Transform How We Live, Work, and Think”, publisher- Eamon Dolan/Houghton Mifflin Harcourt; ISBN-13: 978-0544002692
Index Terms

Computer Science
Information Sciences

Keywords

Data Science Data Engineering Data analysis Data pipelines Data infrastructure