CFP last date
20 May 2024
Reseach Article

An Algorithm to Evaluate Iceberg Query using Compacted Bitmap Vector

by V. Shankar, C. V. Guru Rao
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 60 - Number 15
Year of Publication: 2012
Authors: V. Shankar, C. V. Guru Rao
10.5120/9765-1422

V. Shankar, C. V. Guru Rao . An Algorithm to Evaluate Iceberg Query using Compacted Bitmap Vector. International Journal of Computer Applications. 60, 15 ( December 2012), 1-7. DOI=10.5120/9765-1422

@article{ 10.5120/9765-1422,
author = { V. Shankar, C. V. Guru Rao },
title = { An Algorithm to Evaluate Iceberg Query using Compacted Bitmap Vector },
journal = { International Journal of Computer Applications },
issue_date = { December 2012 },
volume = { 60 },
number = { 15 },
month = { December },
year = { 2012 },
issn = { 0975-8887 },
pages = { 1-7 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume60/number15/9765-1422/ },
doi = { 10.5120/9765-1422 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:06:49.912644+05:30
%A V. Shankar
%A C. V. Guru Rao
%T An Algorithm to Evaluate Iceberg Query using Compacted Bitmap Vector
%J International Journal of Computer Applications
%@ 0975-8887
%V 60
%N 15
%P 1-7
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The data storing and retrieving are playing a major role in the data clustering and data warehousing techniques. The effectiveness of a data retrieving method depends upon the data specific queries for retrieving the data from the database. Iceberg query is a unique class of aggregation query, which computes aggregate values above a given threshold. Many data mining queries are basically ice berg queries. The major part taken into the consideration about the AND operation in the iceberg queries. The reduced number of AND operation increases the effectiveness of the iceberg query. In this work, an efficient iceberg query evaluation process is proposed by reducing the bitwise AND operations needed to find the item pairs. In the proposed approach two solutions are introduced to reduce the bitwise AND operations. Randomly identifying 'N' 1-bit positions instead of first 1-bit position and reducing the zero bit values from the Most Significant Side so that the bit map vector will be reduced in such a way, the bitwise operations needed is reduced. The experimentation is conducted on two datasets in order to evaluate the performance of the proposed iceberg query evaluation algorithm. In the case of retail datasets, the time required for processing 100000 tuples is 864 milliseconds. On the other hand, the same for synthetic dataset, T10I4D100K, is 910 milliseconds.

References
  1. Kevin S. Beyer and Raghu Ramakrishnan "Bottom-up computation of sparse and iceberg cubes". In Proc. of the Int. Conf. on Management of Data (ACM SIGMOD), pages 359-370, 1999.
  2. S. Brin, R. Motwani, J. D. Ullman, and S. Tsur. Dynamic itemset counting and implication rules for market basket data. In Proc. of the Int. Conf. on Management of Data (ACM SIGMOD), pages 255-264, 1997.
  3. Leela, Krishna P and Tolani, Pankaj M and Haritsa, Jayant R, "On Incorporating Iceberg Queries in Query Processors. ", In: 9th International Conference on Database Systems for Advanced Applications:DASFAA, vol. 2973, pp. 431-442, 2004.
  4. G. Graefe. "Query Evaluation Techniques for Large Databases", ACM, pp. 73–170, 1993.
  5. W. P. Yan and Larson, "Data Reduction through EarlyGrouping", In CASCON, page 74, 1994.
  6. P. -A?. Larson, "Grouping and Duplicate Elimination: Benefits of Early Aggregation" Technical Report MSR-TR-97-36, Microsoft Research, 1997.
  7. A. Broder, S. C. Glassman and M. S. Manasse, "Syntactic clustering of the web. InProc. of the 6th Int. World Wide Web Conference," 1997.
  8. S. Agarwal, R. Agrawal, P. Deshpande, A. Gupta, J. F. Naughton,R. Ramakrishnan, and S. Sarawagi. "On the Computation of Multidimensional Aggregates. In VLDB", pp. 506–521, 1996.
  9. M. Fang, N. Shivakumar, H. Garcia-Molina, R. Motwani, and J. D. Ullman. Computing Iceberg Queries Efficiently. In VLDB, pages299–310, 1998.
  10. Bin He, Hui-I Hsiao, Ziyang Liu, Yu Huang and Yi Chen, "Efficient Iceberg Query Evaluation using Compressed Bitmap Index", IEEE Transactions On Knowledge And Data Engineering, pp. 1-2, 2011.
  11. Hsiao H, Liu Z, Huang Y, Chen Y, "Efficient Iceberg Query Evaluation using Compressed Bitmap Index" , Knowledge and Data Engineering, IEEE, Issue: 99, pp. 1, 2011.
  12. Jinuk Bae, Sukho Lee, "Partitioning Algorithms for the Computation of Average Iceberg Queries", Springer-Verlag, ISBN:3-540-67980-4,pp. 276 – 286, 2000.
  13. Frequent Item set Mining Dataset Repository http://fimi. ua. ac. be/data/
  14. Alfredo Cuzzocrea, Filippo Furfaro and Giuseppe M. Mazzeo, "A Probabilistic Approach for Computing Approximate Iceberg Cubes", Lecture Notes in Computer Science, vol. 5181, pp. 348-361, 2008.
  15. Fianny Ming-fei Jiang, Jian Pei and Ada Wai-chee Fu, "Ix-cubes: iceberg cubes for data warehousing and olap on xml data", in Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, pp. 905-908, 2007.
  16. Stockinger, Kurt. (2009). Using Bitmap Index for Joint Queries on Structured and Text Data. Lawrence Berkeley National Laboratory: Lawrence Berkeley National Laboratory. Retrieved from: http://www. escholarship. org/uc/item/0db7c07n
Index Terms

Computer Science
Information Sciences

Keywords

Database iceberg query bitwise-AND operation dynamic pruning bitmap table