CFP last date
20 June 2024
Call for Paper
July Edition
IJCA solicits high quality original research papers for the upcoming July edition of the journal. The last date of research paper submission is 20 June 2024

Submit your paper
Know more
Reseach Article

Optimized Cloud Storage with High Throughput Deduplication Approach

Published on None 2011 by Y. V. Lokeshwari, B. Prabavathy, Chitra Babu
International Conference on Emerging Technology Trends
Foundation of Computer Science USA
ICETT2011 - Number 1
None 2011
Authors: Y. V. Lokeshwari, B. Prabavathy, Chitra Babu
c902bf71-b737-4130-b7ef-7c7df65050cb

Y. V. Lokeshwari, B. Prabavathy, Chitra Babu . Optimized Cloud Storage with High Throughput Deduplication Approach. International Conference on Emerging Technology Trends. ICETT2011, 1 (None 2011), 32-37.

@article{
author = { Y. V. Lokeshwari, B. Prabavathy, Chitra Babu },
title = { Optimized Cloud Storage with High Throughput Deduplication Approach },
journal = { International Conference on Emerging Technology Trends },
issue_date = { None 2011 },
volume = { ICETT2011 },
number = { 1 },
month = { None },
year = { 2011 },
issn = 0975-8887,
pages = { 32-37 },
numpages = 6,
url = { /proceedings/icett2011/number1/3493-icett003/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Emerging Technology Trends
%A Y. V. Lokeshwari
%A B. Prabavathy
%A Chitra Babu
%T Optimized Cloud Storage with High Throughput Deduplication Approach
%J International Conference on Emerging Technology Trends
%@ 0975-8887
%V ICETT2011
%N 1
%P 32-37
%D 2011
%I International Journal of Computer Applications
Abstract

Cloud computing has revolutionised e-commerce by facilitating the consolidation of computing and storage resources. Many organizations have set up private clouds as it results in better utilization of resources. Private cloud storage can be built from the unused resources to store the data that belongs to the organization. Since private cloud storage has a limited amount of hardware resources, they need to be optimally utilized to accommodate maximum data. Deduplication is an effective technique to optimize the utilization of storage space. Two methods adopted for deduplication, namely, chunk level and file level, are studied here. This paper discusses the implementation of both these methods in cloud storage through a case study. The present work also proposes a variation in file level deduplication to further increase the throughput.

References
  1. Wei, J. Jiang, H. Zhou, K. and Feng, D. 2010. Mad2: A scalable High-throughput exact deduplication approach for network backup services, Mass Storage Systems and Technologies, IEEE / NASA Goddard Conference on, 0:1– 14.
  2. Abe, Y. and Gibson, G. 2010 pwalrus: Towards better integration of parallel file systems into cloud storage. In Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS), IEEE International Conference, pp. 1 –7.
  3. Bhagwat, D., Eshghi, K., Long, D. D. E., and Lillibridge, M. 2009. Extreme binning: Scalable, parallel deduplication for chunk-based file backup.
  4. Zhu, B., Li, K., and Patterson, H. 2008. Avoiding the disk bottleneck in the data domain deduplication file system. In Proceedings of the 6th USENIX Conference on File and Storage Technologies, FAST’08, Berkeley, CA, USA. USENIX Association, pp. 18:1–18:14,
  5. Stallings, W. 2002 Cryptography and Network Security, Principles and Practice, Pearson Education, 3rd edition.
  6. Policroniades, C. and Pratt, I. 2004. Alternatives for detecting redundancy in storage systems data, ATEC ’04: Proceedings of the annual conference on USENIX Annual Technical Conference, pp. 1-15.
  7. Zeng W, Zhao Y, Ou K and Song W, 2009, Research on cloud storage architecture and key technologies, ICIS ’09: Proceedings of the second International Conference on Interaction Sciences, pp.1044-1048.
  8. Open Source Eucalyptus manual. Eucalyptus Installation [Online]Available: http://open.eucalyptus.com/wiki/EucalyptusInstall_v2.0.
  9. Amazon’s Elastic Compute Cloud. Elastic Compute Cloud, [Online] Available : http://aws.amazon.com/ec2/.
  10. Open Source Eucalyptus manual. Interacting with Walrus, [Online] Available : http://open.eucalyptus.com/wiki/EucalyptusWalrusInteracti ng_v2.0.
  11. Amazon’s Simple Storage Service. Simple Storage Service, [Online] Available : http://aws.amazon.com/s3/.
  12. Amazon’s Elastic Block Storage. Elastic Block Storage, [Online] Available : http://aws.amazon.com/ebs/.
  13. Open Source Eucalyptus manual. Third Party tool to interact with walrus, [Online] Available : http://open.eucalyptus.com/wiki/EucalyptusWalrusS3Curl_ v2.0.
  14. Gluster file system. http://www.gluster.org.
  15. Gluster file system documentation. http://gluster.com/community/documentation/index.php/MainPa ge
Index Terms

Computer Science
Information Sciences

Keywords

Privatev cloud storage duplicate detection methods deduplication Eucalyptus Gluster file system