IJSTE - International Journal of Science Technology & Engineering | Volume 2 | Issue 09 | March 2016 ISSN (online): 2349-784X
Secure Data Deduplication using Efficient Convergent Key Management D. Jeyakumari Associate Professor Department of Electronics & Communication Engineering RVS College of Engineering & Technology, Coimbatore, Tamil Nadu, India
D. Suganthi Assistant Professor Department of Information Technology RVS College of Engineering & Technology, Coimbatore, Tamil Nadu, India
S. Anitha Department of Information Technology RVS College of Engineering & Technology, Coimbatore, Tamil Nadu, India
Vindhyamol. C.V Department of Information Technology RVS College of Engineering & Technology, Coimbatore, Tamil Nadu, India
Abstract In cloud computing, data deduplication is used to reduce the amount of duplicate files. Data deduplication is a specialized data compression technique for eliminating duplicate copies of repeating data. To protect the confidentiality of sensitive data convergent encryption technique is used. Privileges are provided to users for accessing the cloud storage. Initially tags are generated for each data copies to be uploaded in the cloud server. User who has the tag can only access the data copies in the cloud storage. Cloud Service Provider performs duplicate check by comparing the contents of two files. Duplicate files are detected to reduce the storage space in cloud server. Keywords: Deduplication, convergent encryption algorithm, authorized duplicate check, confidentiality, hybrid cloud ________________________________________________________________________________________________________ I.
INTRODUCTION
Cloud computing is a type of computing that relies on sharing computing resources rather than having local servers or personal devices to handle applications. Cloud computing services can be private, public or hybrid. Here we are using hybrid cloud. Hybrid cloud is a combination of public cloud services and on-premises private cloud – with orchestration and automation between the two. In the administration of a multi-user computer system, a privilege is an identified right that a particular user has to a particular system resource, such as a file folder, the use of certain system commands, or an amount of storage. Generally, a system administrator or, in the case of network resources such as access to a particular device, a network administrator assigns privileges to users. System software then automatically enforces these privileges. Data deduplication is a method of reducing storage needs by eliminating redundant data. Only one unique instance of the data is actually retained on storage media, such as disk or tape. Redundant data is replaced with a pointer to the unique data copy. Data security is another area of increasing importance in modern storage systems and, unfortunately, deduplication and encryption are, to a great extent, diametrically opposed to one another. Deduplication takes advantage of data similarity in order to achieve a reduction in storage space. Convergent encryption, also known as content hash keying, is a cryptosystem that produces identical cipher text from identical plaintext files. This has applications in cloud computing to remove duplicate files from storage without the provider having access to the encryption keys. II. LITERATURE SURVEY Jin Li, Yan Kit Li, Xiaofeng Chen, Patrick P.C.Lee, and Wenjing Lou[1] proposed a method for advanced deduplication system supporting authorized duplicate check. It is secure because the private keys for privileges will not be issued to users directly, which will be kept and managed by the private cloud server instead. In this way, the users cannot share these private keys of privileges, which mean that it can prevent the privilege key sharing among uses in the above straight forward construction. To get a file, the user needs to send a request to the private cloud server. The motive of this research is to address the problem of authorized data deduplication. Paul Anderson, Le Zhang [2] proposed and investigated an algorithm to increase the speed of backups, and reduce the storage requirements. It also supports a unique feature which allows immediate detection of common subtrees, avoiding the need to query the backup system for every file. Pasquale puzio secludit and eurecom, refik molva eurecom, and melek o nen eurecom[3] proposed and investigated ClouDedup, a secure and efficient storage service which assures block-level deduplication and data confidentiality at the same
All rights reserved by www.ijste.org
293