Multi-Language Character Encoding Technique for DNA Storage Wei Wang*1, Zhengxu Zhao2, Wei Zhang3 School of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang, China
1,2
Beijing Aerospace Control Center, Beijing, China
3
wangwei@stdu.edu.cn; 2zhaozx @stdu.edu.cn; 363858606@qq.com
*1
Abstract In 1994, Dr Adleman solved problem using DNA as computational mechanism. He proved the principle that DNA computing could be used to solve computationally complex problems. Recent 20 years with the rapid development of biological molecular computer, scientist have set a series of theoretical model and succeed in biochemical experiment. DNA computing has become an important research direction of the computer science and molecular biology. This research present a novel approach in which character could be encoded by the permutation and combination of the four nitrogenous bases (Adenine, Guanine, Cytosine and Thymine) in DNA molecules. And the character encoding should support multi-language and unique identifier. Keywords DNA Storag; Character Encoding; DNA Computing
Introduction The rapid development of science and information industry, especially the development of multimedia technology, cloud computer and computer network, computer storage equipment not only has a larger data storage capacity, higher data transmission rate and more reliable data storage quality. Also on how to make the data more economic and safe storage, storage in time and space on the extensibility, have put forward higher requirements. Current computer storage system the birth defects are revealed and the subsequent development of lack of power, has become one of the bottleneck of the computer promotion. Whether the HDD or optical storage technology is unable to cope with the future demand for storage of computer. It is estimated that in the future semiconductor, disk, and CD-ROM data storage density will achieve its physical limit, it is urgent need to develop a new generation of alternative storage technology. On the other hand, Biological molecular computer
which Adleman completed the first experimental verification has been rapid development. Nearly two decades, a variety of theoretical models and experimental methods emerge in endlessly, such as Adleman model, Splicing System model, InsertionDeletion System model and DNA-EC model. DNA storage as an important branch in the field of biological molecular computer, because it has high storage density and low hardware cost, access procedure parallelizable, good scalability and integration, and long term storage. In the foreseeable future DNA storage system will be likely to replace the traditional storage systems. DNA molecule is a powerful and effective natural information storage medium, it has been widely used since 1985 when DNA molecule was synthesized for the first time. There are obvious similarities between DNA storage system and traditional storage system, both of two storage system are sequential storage devices, and use special symbols to indicate the beginning and end of a single information section, and the data error correction coding is used to ensure the integrity of their information. As a result, DNA molecules can be used as a medium of the information is stored. DNA storage technology is based on the DNA molecule storage medium. The four nitrogenous bases (Adenine, Guanine, Cytosine and Thymine) what are contained within DNA molecule can be used to encode information. With the existing biochemical experiment method, it's easily complete the clone operation of DNA molecules and the modify operation of the nitrogenous bases what has been encode in the DNA molecules, these operations are similar with the traditional storage system which read and write operations. Because of the advantages of DNA storage system such as stable and reliable work, no wear, huge information capacity, long life, high quality, low price of bits of information and access procedure parallelizable, DNA storage system is seen as high
International Journal of Automation and Control Engineering, Vol. 4, No. 1—April 2015 2325-7407/15/01 019-3 Š 2015 DEStech Publications, Inc. doi:10.12783/ijace.2015.0401.05
19