An Efficient Reversible Watermarking Technique for Textual Data

IJIRST â&#x20AC;&#x201C;International Journal for Innovative Research in Science & Technology| Volume 3 | Issue 02 | July 2016 ISSN (online): 2349-6010

An Efficient Reversible Watermarking Technique for Textual Data Vaidyanathan A N M. Tech Student Department of Computer Science & Engineering NCERC, Pampady Thiruvilwamala, Thrissur, Kerala

Dr. S Subasree Professor & Head of Dept. Department of Computer Science & Engineering NCERC, Pampady Thiruvilwamala, Thrissur, Kerala

Ms. Preethymol B Assistant Professor Department of Computer Science & Engineering NCERC, Pampady Thiruvilwamala, Thrissur, Kerala

Abstract Database is a collection of large set of data and information which are organized so that it can be accessed efficiently for knowledge discovery. Many real world applications uses open databases which are available in the internet to extract information based on their needs. The relational databases which are freely available are used by research community for mining new information regarding to their research works. These databases are vulnerable to security issues related to ownership and data tampering. The reliability of the data source must be verified before using it for any research or application purpose. In order to ensure ownership and reliability, watermarking is done to the data. When watermark is embedded to the database it reduces the quality of the data thereby making it unfit for information retrieval .In order to avoid this scenario reversible watermarking is deployed which preserves data quality by recovering the original data along with data security .There are many effective approaches that performs reversible watermarking to ensure ownership along with data recovery. But the main problems with these techniques are, they only focus on numerical databases. Due to this, many of the databases which contain textual data cannot be watermarked with the existing approaches. In order to watermark the textual database an efficient method is proposed here, that uses the Unicode and ASCII value of the alphabets to watermark the textual data. It encodes the textual data with numeric values but retrieves the original textual data at the receiving end. Since a numerical value replaces the textual data field during transmission it makes it difficult for the attacker to retrieve the original information held in the database. Keywords: Reversible watermarking, genetic algorithm, relational data, Textual data _______________________________________________________________________________________________________ I.

INTRODUCTION

The advancement of information technology has boosted the growth of business and research. In many fields, data are extracted widely from various sources for information retrieval and decision making. Many real world application mine data available in different formats like text, audio, video, images and relational data to gather new ideas and information. Especially relational data which is more prominent among the scholar community is shared extensively by the researchers. Open databases are surplus in the internet which helps the scholars to refer different sources. However these databases are viable to many attacks. The data are illegally copied by the attackers thereby posing threat to its ownership rights. The personal information of customer is also retrieved by the attacker causing major security issue for the data. In order to resolve these issues, and to enforce ownership to data, watermarking technique is being used for many years which effectively denies illegal copyrighting. The watermark generated will be embedded to the original data which helps to identify the ownership of data. The data owner can easily identify their data if it contains a unique watermark. The issue regarding watermark is that, while embedding the watermark to the data, the database undergoes certain modification based on the bandwidth of the watermark causing the quality to be compromised. To resolve this scenario reversible watermarking technique is introduced in which the embedded watermark can be revised by the data owner and the original data can be decoded from the watermarked data thereby the data quality is kept intact. Moreover in reversible watermarking the data owner can specify the distortion tolerance i.e. the amount of change in the data that can be allowed by owner while embedding watermark. Based on the distortion tolerance the watermark is embedded to the data. The Deferential Expansion Watermarking (DEW), Genetic algorithm based on difference expansion watermarking (GADEW), A robust and reversible watermarking technique for relation data (RRW) are the main reversible watermarking approaches used in which all the technique uses different method to watermark and decode the data. Since these approaches follow reversible watermarking it is possible to recover the original data from them. But the fact is that none of these focus on textual database. It all focuses on numerical data and does not watermark textual fields. Here an efficient technique is proposed, by which the textual database can be watermarked and can be made secured against heavy attacks.

316