CFP last date
22 April 2024
Reseach Article

An Effective Data Preprocessing Technique for Improved Data Management in a Distributed Environment

Published on July 2012 by Sharon Christa, V. Suma, Lakshmi Maduri
Advanced Computing and Communication Technologies for HPC Applications
Foundation of Computer Science USA
ACCTHPCA - Number 3
July 2012
Authors: Sharon Christa, V. Suma, Lakshmi Maduri
6f16b25b-355a-47ee-8bf4-aa6ff48ccb46

Sharon Christa, V. Suma, Lakshmi Maduri . An Effective Data Preprocessing Technique for Improved Data Management in a Distributed Environment. Advanced Computing and Communication Technologies for HPC Applications. ACCTHPCA, 3 (July 2012), 25-29.

@article{
author = { Sharon Christa, V. Suma, Lakshmi Maduri },
title = { An Effective Data Preprocessing Technique for Improved Data Management in a Distributed Environment },
journal = { Advanced Computing and Communication Technologies for HPC Applications },
issue_date = { July 2012 },
volume = { ACCTHPCA },
number = { 3 },
month = { July },
year = { 2012 },
issn = 0975-8887,
pages = { 25-29 },
numpages = 5,
url = { /specialissues/accthpca/number3/7568-1022/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Special Issue Article
%1 Advanced Computing and Communication Technologies for HPC Applications
%A Sharon Christa
%A V. Suma
%A Lakshmi Maduri
%T An Effective Data Preprocessing Technique for Improved Data Management in a Distributed Environment
%J Advanced Computing and Communication Technologies for HPC Applications
%@ 0975-8887
%V ACCTHPCA
%N 3
%P 25-29
%D 2012
%I International Journal of Computer Applications
Abstract

With the evolution of distributed computing, the databases are inherently distributed across the globe and therefore data analysis from various data sources is very essential in decision making. The core need in the current industrial environment is hence to extract information from the huge, complex and dynamic data through data mining techniques. Integrating data from multiple data sources and analysing the large, complex dynamic data is a tedious and complex work. Additionally, database consists of inconsistent and noisy data. Further, with the decrease in quality of data to be mined the quality of knowledge model obtained from it also decrease which inturn affects the decision making process. However optimization of data preprocessing can resolve the aforementioned issues. This paper provides design and development of data preprocessing software, based on intelligent agents. This software enables data preprocessing operations to be performed in an automated mode, and gives accurate results in lesser time when compared to manual data preprocessing.

References
  1. Peng Jin, Yun-Long Zhu And Kun-Yuan Hu. August , 2007 A Clustering Algorithm For Data Mining Based On Swarm Intelligence Proceedings Of Sixth International Conference On Machine Learning Cybernetics, Hong Kong, 19-22
  2. Pyle, D. 1999 Data Preparation for Data Mining. Morgan Kaufmann Publishers, Inc. , San Francisco, CA, USA
  3. C. , Lavrac, N. , Moyle, S. , Kavsek, B. 2001Integrating Aspects of Data Mining, Decision Support and Meta-Learning: Internal SolEuNet Session, ECML/PKDD'01 workshop notes 43–52
  4. B. Liu and A. Tuzhilin 2008: Managing and Analyzing Large Collections of Data Mining Models, Communications of ACM, Vol. 51, No. 2.
  5. Zulaiha Ali Othman, Azuraliza Abu Bakar, Abdul Razak Hamdan, Khairuddin Omar and Nor Liyana Mohd Shui, 2007 "Agent based preprocessing," International Conference on Intelligent and Advanced Systems.
  6. Cristian Aflori and Florin Leon 2008: "Efficient Distributed Data Mining using Intelligent Agents Authors"
  7. Usama Fayyad, Gregory Piatetsky-Shapiro, and Padhraic Smyth 2005: "From Data Mining to Knowledge Discovery in Databases"
  8. I. A. Mohtar, 2006 "Multiagent Approach to Stock Price Prediction," University Kebangsaan, Malaysia.
  9. P. Nurmi, M. Przybilski, G. Linden, and P. Floreen, 2005 "An architecture for distributed agent based data pre-processing. " Pp. 122-132.
  10. C. Li, and Y Gao, 2006"Agent-based pattern mining of discredited activities in public services," Proceedings of the 2006 IEEE/WI C/ACM International Conference on Web Intelligence and Intelligent Agent Technology.
  11. Dr. T. R. Gopalakrishnan Nair, Lakshmi Madhuri, Sharon Christa, Dr. V. Suma, 2012 "Data Preprocessing Model Using Intelligent Agents" International Conference on Information Systems Design and Intelligent Applications.
  12. Agent Working Group, 2000 "Agent technology," OMG Document ec/2000-08-01, Version 1. 0.
  13. Stuart Russell and Peter Norvig 1995 "Artificial Intelligence: A Modern Approach", c Prentice-Hall, Inc.
  14. Sharon Christa, K. Lakshmi Madhuri and V. Suma 2012, "A Comparative Analysis of Data Mining Tools in Agent Based Systems", International Conference on Systemics, Cybernetics and Informatics
  15. Ranjit Bose, Vijayan Sugumaran 1998, "IDM: An Intelligent Software Agent Based Data Mining Environment," IEEE.
  16. K. Sycara, et. al 1996 "Distributed Intelligent Agents," IEEE Intelligent Systems, pp- 35-46.
  17. P. Maes July 1994, "Agents that Reduce the Work and Information Overload," Com. of ACM, Vol. 36,No. 7, pp. 29-39.
  18. S. Masina, K. Y. Lee, and R. Garduno-Ramirez 2004, "An Architecture of Multi-Agent System Applied to Fossil-Fuel Power Unit," IEEE Power Engineering Society General Meeting, pp. 1982- 1988.
  19. Abd. Manan Ahmad, AG. Noorajis Ag. Nordin, Emrul Hamide Md. Saaim, Fairol Samaon and Mohd Danial Ibrahim 2004, "An architecture design of the intelligent agent for speech recognition and translation" IEEE.
  20. D. Kehagias, K. C. Chatzidimitriou, A. L. Symeonidis, and P. A. Mitkas 2004 "Information agents cooperating with heterogeneous data sources for customer-order management," ACM Symposium on Applied Computing, pp. 52-57.
  21. Dr. Joseph P. Bigus, Jennifer Bigus "Constructing Intelligent Agents with JAVA"
Index Terms

Computer Science
Information Sciences

Keywords

Discretization Agent Transformation Agent