CFP last date
20 May 2024
Reseach Article

Query Optimization using SQL Approach for Data Mining Analysis

Published on April 2012 by M. Stella Inba Mary, V. Kalaivani
International Conference in Recent trends in Computational Methods, Communication and Controls
Foundation of Computer Science USA
ICON3C - Number 3
April 2012
Authors: M. Stella Inba Mary, V. Kalaivani
c98e26f6-83ca-439b-b9e9-8d811834cba5

M. Stella Inba Mary, V. Kalaivani . Query Optimization using SQL Approach for Data Mining Analysis. International Conference in Recent trends in Computational Methods, Communication and Controls. ICON3C, 3 (April 2012), 17-21.

@article{
author = { M. Stella Inba Mary, V. Kalaivani },
title = { Query Optimization using SQL Approach for Data Mining Analysis },
journal = { International Conference in Recent trends in Computational Methods, Communication and Controls },
issue_date = { April 2012 },
volume = { ICON3C },
number = { 3 },
month = { April },
year = { 2012 },
issn = 0975-8887,
pages = { 17-21 },
numpages = 5,
url = { /proceedings/icon3c/number3/6019-1020/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference in Recent trends in Computational Methods, Communication and Controls
%A M. Stella Inba Mary
%A V. Kalaivani
%T Query Optimization using SQL Approach for Data Mining Analysis
%J International Conference in Recent trends in Computational Methods, Communication and Controls
%@ 0975-8887
%V ICON3C
%N 3
%P 17-21
%D 2012
%I International Journal of Computer Applications
Abstract

Relational databases are acceptable repository for structured data; integrating data mining algorithms with a relational DBMS is an essential research issue for database programmers. In a relational database, a significant effort is required to prepare a summary data set that can be used as input for the data mining process. It requires many complex SQL queries, joining tables and aggregating columns. This paper realizes the research on extending SQL code for data mining processing and related work on query optimization. Also the paper proposes the following approaches, transposition, pivoting and cross tabulation. The approaches exhibit efficient optimizations with SQL extensions using aggregated Queries

References
  1. Carlos Ordonez," Statistical model computation with UDFs", IEEE Transactions on Knowledge and Data Engineering, vol. 22, no. 22, pp. 1752 - 1765, Dec. 2010.
  2. Carlos Ordonez, Pitchaimalai. S. K, "Bayesian Classifiers Programmed in SQL", IEEE Trans. Knowledge and Data Eng, vol. 22, no. 1, pp. 909-921, Jan. 2010.
  3. Carlos Ordonez, "Integrating K-means clustering with a relational DBMS using SQL" IEEE Trans. Knowledge and Data Eng, vol. 18 no. 2, pp. 181-201, Feb. 2006
  4. Carlos Ordonez, Omiecinski. E, "Efficient Disk-Based K-Means Clustering for Relational Databases", IEEE Trans. Knowledge and Data Eng. , vol. 16, no. 8, pp. 909-921, Aug. 2004.
  5. Carlos Ordonez, Zhibo Chen, "Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis", IEEE Trans. Knowledge and Data Eng. , vol. PP, no. 99, Jan. 2011.
  6. Carrasco, R. A. ; Vila, M. A. ; Araque, F. ," dmFSQL: a Language for Data Mining", DEXA '06. 17th International Workshop on 2006, pp-440-444, 2006
  7. Charu C. Aggarwal, Philip S. Yu. "A Survey of Uncertain Data Algorithms and Applications", IEEE Transactions on Knowledge and Data Engineering, Vol. 21, No. 5. pp. 609-623, May 2009.
  8. Cunningham. C, Graefe. G, and Galindo-Legaria. C. A, PIVOT and UNPIVOT: Optimization and execution strategies in an RDBMS, In Proc. VLDB Conference, pages 998–1009, 2004.
  9. Elena Baralis, Tania Cerquitelli, Silvia Chiusano, "IMine: Index Support for Item Set Mining," IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 4, pp 493-506, April 2009
  10. Hendrik Decker, "Inconsistency – Tolerant Integrity Checking", IEEE Transactions on Knowledge and Data Engineering, Vol. 23, No. 2. , pp- 218 – 234, Feb 2011
  11. McClean, S. Scotney, B. Shapcott, M. "Aggregation of Imprecise and Uncertain nformation in Databases", Knowledge and Data Engineering, IEEE Transactions , Vol. 13, No. 6, pp 902 – 912, Nov/Dec 2001 .
  12. Netz, A, Chaudhuri. S, Fayyad. U, Bernhardt. J, "Integrating Data Mining with SQL Databases: OLE DB for Data Mining",17th International Conference on 2001, pp. 379-387, 2001
  13. Pitchaimalai, S. , Ordonez, C. , Garcia-Alvarado, C. , "Efficient Distance computation Using SQL Queries and UDFs", IEEE HPDM (High Performance Data Mining Workshop, at ICDM), 2008.
  14. Wang. H, Zaniolo. C, and Luo. C. R, "ATLaS: A small but complete SQL extension for data mining and data streams". In Proc. VLDB Conference, pages -1113–1116, 2003
  15. Yin, X. Han, J. Yang, J. Yu, P. S. "Efficient Classification across Multiple Database Relations: A Cross Mine Approach" IEEE Trans. Knowledge and Data Eng. , vol. 18, no. 6, pp. 770-783, Jun. 2006.
  16. A. Witkowski, S. Bellamkonda, T. Bozkaya, G. Dorman, N. Folkert, A. Gupta, L. Sheng, and S. bramanian. "Spreadsheets in RDBMS for OLAP" In Proc. ACM SIGMOD Conference, pages 52–63, 2003.
  17. S. Sarawagi, S. Thomas, and R. Agrawal. Integrating association rule mining with relational database systems: lternatives and implications. ' In Proc. ACM SIGMOD Conference, pages 343–354, 1998
Index Terms

Computer Science
Information Sciences

Keywords

Relational Dbms Sql Aggregation Query Optimization