CFP last date
20 May 2024
Reseach Article

Fast Retrieval with Column Store using RLE Compression Algorithm

by Ishtiaq Ahmed, Sheesh Ahmad, Durga Shankar Shukla
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 111 - Number 5
Year of Publication: 2015
Authors: Ishtiaq Ahmed, Sheesh Ahmad, Durga Shankar Shukla
10.5120/19537-1193

Ishtiaq Ahmed, Sheesh Ahmad, Durga Shankar Shukla . Fast Retrieval with Column Store using RLE Compression Algorithm. International Journal of Computer Applications. 111, 5 ( February 2015), 30-34. DOI=10.5120/19537-1193

@article{ 10.5120/19537-1193,
author = { Ishtiaq Ahmed, Sheesh Ahmad, Durga Shankar Shukla },
title = { Fast Retrieval with Column Store using RLE Compression Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { February 2015 },
volume = { 111 },
number = { 5 },
month = { February },
year = { 2015 },
issn = { 0975-8887 },
pages = { 30-34 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume111/number5/19537-1193/ },
doi = { 10.5120/19537-1193 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:47:05.801421+05:30
%A Ishtiaq Ahmed
%A Sheesh Ahmad
%A Durga Shankar Shukla
%T Fast Retrieval with Column Store using RLE Compression Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 111
%N 5
%P 30-34
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Column oriented database have continued to grow over the past few decades. C-Store, Vertica Monet DB and Lucid DB are popular open source column oriented database. Column-store in a nutshell, store each attribute values belonging to same column contiguously. Since column data is uniform type therefore, there are some opportunities for storage size optimization in Column-store, many renowned compression schemes such as RLE & LZW that make use of similarity of adjacent data to compress. Good Compression can also be achieved using bitmap index by order of magnitude through the sorting. The Run Length Encoding works best for the columns of ordered data, or data with few distinct values. This ensures long runs of identical values which RLE compresses quite well. In this paper we have put an effort to build a simulation of Column-Store and applied the best bitmap compression technique RLE which further improves the retrieval time.

References
  1. S. Idreos, F. Groffen, N. Nes, S. Manegold, S. Mullender, M. Kersten. MonetDB: Two Decades of Research in Column-oriented Database Artitectures. 2012. W. -K. Chen, Linear Networks and Systems. Belmont, Calif. : Wadsworth, pp. 123-135, 1993. (Book style)
  2. P. Boncz, M. Zukowski, N. Nes. MonetDB/X100: Hyper-pipeliningquery execution. In CIDR, 2005K. Elissa, "An Overview of Decision Theory," unpublished. (Unplublished manuscript)
  3. M. Stonebraker, D. J. Abadi, A. Batkin, X. Chen, M. Cherniack, M. Ferreira, E. Lau, A. Lin, S. R. Madden, E. J. O'Neil, P. E. O'Neil, A. Rasin, N. Tran, S. B. Zdonik. C-Store: A Column-Oriented DBMS. In VLDB, pages 553–564, 2005
  4. http:///www. Data Business Intelligence and Column Database Technology/InfinitDB/by Calpont. mht
  5. Column-oriented DBMS-Wikipedia, the free encyclopedia. mht
  6. Gheorghe MATEI: Column-Oriented Databases, an Alternative for Analytical Environment
  7. Data/HRG/Home. htm
  8. D. J. Abadi, S. R. Madden, N. Hachem. Column-stores vs. row-stores: how different are they really? In Proc. SIGMOD, 2008.
  9. P. E. O'Neil, X. Chen, E. J. O'Neil. Adjoined Dimension Column Index (ADC Index) to Improve Star Schema Query Performance. In ICDE, 2008
  10. P. E. O'Neil, E. J. O'Neil, X. Chen. The Star Schema Benchmark (SSB). http: //www. cs. umb. edu
  11. G. P. Copeland, S. Khosafian. A Decomposition Storage Model. Proceedingsof ACM SIGMOD 1985.
  12. Daniel J. Abadi, Samuel R Madden,Miguel C. Ferreira. :Integrating Compression and Execution in Column –Oriented Database Systems
  13. Daniel J. Abadi, Samuel R Madden,Miguel C. Ferreira. :Integrating Compression and Execution in Column –Oriented Database Systems
  14. G. Graefe and L. Shapiro. Data compression and database performance. In ACM/IEEE-CS Symp. On Applied computing pages 22 -27, April 1991.
  15. M. A. Roth and S. J. V. Horn. Database compression. SIGMOD Rec. , 22(3):31{39, 1993.
  16. Z. Chen, J. Gehrke, and F. Korn. Query optimization in compressed database systems. In SIGMOD '01, Pages 271- 282, 2001
  17. M. Zukowski, S. Heman, N. Nes, and P. Boncz. Super-scalar ram-cpu cache compression. In ICDE, 2006.
  18. Shish Ahmad. Evaluation of security risk associated with different network layers' published in International Journal of computer application Jul 2012
Index Terms

Computer Science
Information Sciences

Keywords

Bitmap Column-Store LZW OLAP OLTP RLE