International Journal of Engineering and Advanced Research Technology (IJEART) ISSN: 2454-9290, Volume-1, Issue-6, December 2015
Multifratal Analysis of the Pathogenesis of Alzheimer Disease Tong-han Lan, Zi-yang Lan, Xiao Li, Hao Cai, Fei –fei Yu
mathematical terms. In addition, related concepts are those of self-similarity and sub-divisibility. The length of the coastline of Britain or the length of the perimeter of the Koch curve increases as we measure it at finer spatial resolution.The use of fractal analysis has been applied to many field[17]. The main purpose of this paper is to analyze the pathogenesis of Alzheimer Disease by means of fratal method.DNA research has previously focused on searching for low level patterns directly visible within the sequence,while ignoring high level patterns.The multifractal analysis of the pathogenesis of Alzheimer’s Disease is intended to demonstrate that a higer level of pattern information may be availible within the the pathogenesis of Alzheimer’s Disease[18].Thus the research has technically significance. This paper is divided into four parts: the first part is the introduction of the pathogenesis of Alzheimer Disease and DNA series analysis basic state; in the second part we briefly review materials and the fratal method; the third and four part are the results and discussion.
Abstract—In this paper the fractal analysis method is introduced to characterize the self-similarity of DNA sequences. Using this method, four whole genomes of the pathogenesis of Alzheimer’s Disease provided by NCBI are analyzed. the pathogenesis of Alzheimer’s Disease were exhibited multifractal characteristics when calculating the probability measure for four difference the pathogenesis sequences of each base indidually.From the results we find that Renyi dimension range between 0.0090 to 3.5043. the multifractal characteristic properties in the whole the pathogenesis of Alzheimer’s Disease are verified. The results of this paper suggest that the the multifractal characteristic property is one of the natural properties that DNA sequences possess, and is directly related to the structure and function of the whole DNA molecule. Index Terms—mutilfractal,pathogenesis Disease,Renyi dimension
of
Alzheimer’s
I. INTRODUCTION Alzheimer’s disease (AD) is the most common form of dementia and the most frequent degenerative brain disorder encountered in old age, The risk of developing AD substantially increases after 65 years of age [1]. AD is quickly becoming one of the major universal healthcare problems.While the cause of this disease remains unknown, there is evidence for substantial genetic influence [1]-[5]. With the unclear pathogenesis, several hypotheses about the pathogenesis of AD, such as the Abnormal protein hypothesis, cholinergic hypothesis, oxidative stress theory and the estrogen hypothesis, etc. were proposed. Mutations in three genes, a myloid precursorprote in ( APP ), presenilin 1 and 2( PSEN1 , PSEN2 ), result in an early onset, autosomal dominant form of the disease beginning in the third or fourth decade. The ɛ4 allele of apolipoprotein E (APOE) increases the risk of both sporadic and familial AD occurring later in life around the sixth decade. Each of these genes is involved in the production or processing of the amyloid β peptide, which is deposited in the brain as dense plaques that are characteristic of the disease [3]. Today, however, there are neither precise diagnostic approaches nor effective therapeutic agents available for Alzheimer disease.
II. MATERIALS AND METHODS A. Data Resources We will use the tools of the World Wide Web to search the GenBank DNA sequence database (http://www.ncbi.nlm.nih.gov). Homo sapiens amyloid beta (A4) precursor protein (APP), RefSeqGene on chromosome 21(NCBI Reference Sequence: NG_007376.1, GI:166795291); Homo sapiens apolipoprotein E (APOE), RefSeqGene on chromosome 19(NCBI Reference Sequence: NG_007084.2, GI:163954918); Mus musculus presenilin-1 gene, alternatively spliced transcripts, complete cds,GenBank: AF007560.1, GI:2463667; Mus musculus presenilin-2 gene,strain 129X1/SvJ chromosome 12 CRA_211000022007779, whole genome shotgun sequence,GenBank: AAHY01101600.1, GI:69874353. B. Mapping rule
n
A DNA sequence i (i=1,2,…,L)of length L is comprised a series of 4 types of base as follows: adenine(A); thymine(T); guanine(G); and cytosine(C). In order to apply numerical methods to nucleotide sequence, We can also use(-2,-1,1,2) to replace {A,G,C,T},We expect it to reveal more information than one dimensional DNA walk [19].
In the past decade or so there has been a ground swell of interest in unraveling the mysteries of DNA. In order to distinguish coding regions from non-coding ones, many approaches have been proposed[6]-[11], at the same times, the nonlinear scaling method, such as complexity(12)and fractal analysis[13]−[17]were used. The word―fractal‖ was coined by Benoit Mandelbrot in the late 1970s,but objects now defined as fractal in form have been known to artists and mathematicians for centuries. Mandelbrot’s definition---―a set whose hausdorff dimension is not integer‖—is clear in
C. Fractal analysis The use of fractal analysis has been applied to many fields[Kins94],such as ion channel kinetics analysis etal, Fractals are sets which exhibit self-similar properties at different scales and are characterized by their fractal
13
www.ijeart.com
Multifratal Analysis of the Pathogenesis of Alzheimer Disease dimension. The word―fractal‖ was coined by Benoit Mandelbrot in the late 1970s,but objects now defined as fractal in form have been known to artists and mathematicians for centuries. Mandelbrot’s definition---―a set whose hausdorff dimension is not integer‖—is clear in mathematical terms. In addition, related concepts are those of self-similarity and sub-divisibility. The length of the coastline of Britain or the length of the perimeter of the Koch curve increases as we measure it at finer spatial resolution.The use of fractal analysis has been applied to many field[19]-[21].
1.0 80
y(n)
y(n)
0.2 0.0
0
white noise
-0.2 0
2000
4000
6000
n
8000
10000
4000
4400n
apolipoprotein E adenine(A) window length 100
600
15 10
300 200
5
300
100 0
0.20 0.30 probability
0.40
0.1
0.2 0.3 0.4 probability
25 apolipoprotein E thymine(T) window length 100
25
20 Number
20 15
apolipoprotein E cytosine(C) window length 100
10
10
0.1
0.2 0.3 0.4 probability
0.5
0.6
0.0
APP cytosine(C) window length 100
0.5
0.1
0.2
0.3 0.4 probability
0.5
100
0
0.5
0 0.0
80
APP thymine(T) window length 100
200
100
0.4
0.4
300
5
0.2 0.3 probability
0.2 0.3 probability
400
300 200
5
0.1
500 400
15
Number
30
0 0.0
0.5
Number
0.10
0.1
0.2 0.3 probability
0.4
0.1
0.2
0.3 0.4 probability
0.5
0.6
200
presenilin-1 adenine(A) window length 100
60
80
250
presenilin-1 guanine(G) window length 100
presenilin-2 adenine(A) window length 100
150
Number
Number
40
100
presenilin-2 guanine(G) window length 100
200
60
40
150 100
20
log r
50
20
50 0
(1)
0.1
0.2 0.3 probability
0.4
0.5
0.1
0.2 0.3 probability
0 0.2
0.4
0.3 0.4 probability
0.5
0.10
60 40
200
60 40
presenilin-2 cytosine(C) window length 100
150 100
20
y ( n)
0.3 0.4 probability
100
50
0
0.5
0.1
0.2 0.3 probability
0.4
0.05
0.10
0.15 0.20 0.25 probability
0.30
0.35
0.1
0.2
0.3 0.4 probability
0.5
0.6
Fig 3 show the pathogenesis of Alzheimer Disease difference base at window 100 condition probability contribution.
i
and
D. Estimating the multifractal dimension We compute Renyi dimension for difference the pathogenesis of Alzheimer Disease of the four bases.The results were shown as Fig 4. Renyi dimension for the amyloid beta precursor protein (APP)between 0.0090 to 3.5043, Renyi dimension for apolipoprotein E between0.0023 to 1.7731, Renyi dimension for presenilin-1 gene between0.0028 to2.6578 , Renyi dimension for presenilin-2 gene between0.0217 to 3.0965.
A. The characteristic of DNA walk We find apparent patchiness in real DNA sequences—both in the noncoding and coding regions. Figure1 shows a representative DNA walk for four different of the pathogenesis of Alzheimer Disease DNA sequences. amyloid beta precursor protein
3
0.2
presenilin-2 thymine(T) window length 100
150
20
0.1
n
u i 1
walker after n steps, y(n) is defined as will display on a graph of y(n) vs n .
15x10
0.30
250
presenilin-1 cytosine(C) window length 100 Number
Number
Number
80
Number
presenilin-1 thymine(T) window length 100
50
600
0.20 probability
100
A useful way of analyzing patchiness arising from the heterogeneous purine-pyrimidine content is DNA walk, defined as above 2.2 mapping rulers. The displacement of
apolipoprotein E (APOE) Y(n)
200
10
4
2
3
0
2
1
1
0
0
2
4
6
n
8
10x10
0
3
3000
50
100
150 n
200 250x10
0.5
0
3
0
-1 5
5000
2500
y
y
-200
4
4000
2000
presenilin-2
50
10
Y(n)
2000
500
1000
0
0 10
20
n
30
40x10
30 20
2
x
10 1
q
0
3
4
presenilin1
2.5
0
50 40
3
q
0
3000
1000
3
4
30 20
2
x
1
1500
-0.5 5
40
3
presenilin-1
APOESERIES
1.5
amyloid beta (A4) precursor protein
5
0
20
40 n 60
80
100x10
1.5
2
y
1
Fig1. shows a representative DNA walk for four different of the pathogenesis of Alzheimer Disease DNA sequences.
1
0.5 0
0
-0.5 5 4
50 40
3 x
-1 5 4
50
30 20
2
10 1
B. The characteristic of brown noise and white noise We use simulation method and produce two series,namely brown noise and white noise , The displacement of walker after n steps y(n) is used, Fig2.shown brown noise and white noise walker.
presenilin2
3
2
3 y
Y(n)
APP guanine(G) window length 100
400
100
0
Number
500
200
5
q j
APP adenine(A) window length 100
400
Number
10
apolipoprotein E guanine(G) window length 100
20
Number
15
number
Number
20
III. RESULTS
Y(n)
5200
C. Calculation of probabilities We can also use(-2,-1,1,2) to replace {A,G,C,T},The approach implemented is to calculate the probabilities for each individual base of the pathogenesis of Alzheimer Disease separately,Fig 3 show the pathogenesis of Alzheimer Disease difference base at window 100 condition probability distribution.The results defer guass distribution.
80
400
4800
Fig2.shown brown noise and white noise
Number
j 1
0.4
20
Number
1 Dq lim r 0 q 1
40
0.1
log p
0.6
60
D. Multifractal Dimension In practice,it is more useful to consider a signal as a multifractal, namely considering of more than one fractal.Using Renyi dimension can provide comprehension of complexity within the signal.The Renyi dimension is defined as(1)[19].Where r is the volume element size,Nr is the number of volume elements for a given volume element size,Pj is the probability of occurrence within a given volume element. The q value can be considered as a fractal dimension index. Nr
0.8
Brown noise
0
q
40
3 x
30 20
2 1
10
q
0
Fig 4.shown Renyi dimension for difference the pathogenesis of Alzheimer Disease E. After Sliding window compute multifractal dimension Multifractal characteristic of {A,G,C,T}computed over
14
www.ijeart.com
International Journal of Engineering and Advanced Research Technology (IJEART) ISSN: 2454-9290, Volume-1, Issue-6, December 2015 100 moving windows with a window size of 128 and offset of 1 was shown in Fig.5 for the amyloid beta precursor protein (APP), Multifractal value of A between 0.0357 to 1.0883, Multifractal value of G between 0.0345 to1.1860, Multifractal value of C between 0.0245 to 0.9557, Multifractal value of T between 0.0220 to 1.1364.The similary apolipoprotein E 、 presenilin-1 gene and presenilin-2 gene of Multifractal characteristic of {A,G,C,T} can be computed. Multifractal characteristic of A
promote transcriptional activation, while others form the protein basis of the amyloid plaques found in the brains of patients with Alzheimer disease. Mutations in this gene have been implicated in autosomal dominant Alzheimer disease and cerebroarterial amyloidosis. Multiple transcript variants encoding several different isoforms have been found for this gene[22].Chylomicron remnants and very low density lipoprotein (VLDL) remnants are rapidly removed from the circulation by receptor-mediated endocytosis in the liver. Apolipoprotein E, a main apoprotein of the chylomicron, binds to a specific receptor on liver cells and peripheral cells.ApoE is essential for the normal catabolism of triglyceride-rich lipoprotein constituents[23].act. these properties are considered to be related to the construction of the higher order structure of the DNA molecule. This paper is preliminary with respect to the multifractal characteristic Exist in potential relevance to the pathogenesis of Alzheimer’s Disease. Its aim is to provide the means for the analysis of the multifractal characteristic properties of DNA sequences.although the research resuls ahown the multifractal characteristic Exist the pathogenesis of Alzheimer’s Disease, This may led to series of questions, what is the physical phenomena and possible mechanism of the pathogenesis of Alzheimer’s Disease ? How to categorize different types of the pathogenesis of Alzheimer’s Disease, how to understand the function of the pathogenesis of Alzheimer’s Disease, how to build the pathogenesis of Alzheimer’s Disease kinetics model systematically? Although the data tell us that the multifractal characteristic does exist in the pathogenesis of Alzheimer’s Disease, we will still think about the physical properties of the pathogenesis of Alzheimer’s Disease.what kind of distribution form do these the pathogenesis of Alzheimer’s Disease have? All these studies should be the future direction of research.
Multifractal characteristic of G
0.2
0.4
0
0.2 -0.2
0 -0.2
-0.6
-Dq
-Dq
-0.4
-0.8
-0.4 -0.6
-1
-0.8
-1.2 60 100
40
80
-1 60
60
20
100
40
40
80
20 0
q
0
60
20
re
40 20 0
q
0
re
Multifractal characteristic of C Multifractal characteristic of T
0.4 0.2
0
0 -0.2
-0.4
-0.4
-Dq
-Dq
0.2
-0.2
-0.6
-0.6 -0.8
-0.8
-1
-1 60 100
40
80 60
20
40
-1.2 60 100
40
80
20 q
0
0
60
20
40
re
20 q
0
0
re
Fig.5 shown Multifractal characteristic of {A,G,C,T}computed over 100 moving windows with a window size of 128 and offset of 1.
IV. DISCUSSION Multifractal analysis was applied to analysize amyloid beta precursor protein 、 apolipoprotein E 、 presenilin-1 and presenilin-2 the sequences of the pathogenesis of Alzheimer’s Disease. the pathogenesis of Alzheimer’s Disease were exhibited multifractal characteristics,at the same times,it was calculated the probability measure for four difference the pathogenesis sequences of each base indidually.From the results we find that Renyi dimension range between 0.0090 to 3.5043.After Sliding window compute multifractal dimension,Multifractal characteristic of {A,G,C,T}computed over 100 moving windows with a window size of 128 and offset of 1 , for the amyloid beta precursor protein (APP), Multifractal value of {A,G,C,T} can be computed. The results were between 0.0357 to 1.0883, 0.0345 to1.1860, 0.0245 to 0.9557, 0.0220 to 1.1364 indvididly.The similary apolipoprotein E、presenilin-1 gene and presenilin-2 gene Multifractal value of {A,G,C,T}can be computed. In this paper the multifractal characteristic properties in the whole genomes of amyloid beta (A4) precursor protein (APP) 、apolipoprotein E、presenilin-1 gene、presenilin-2 gene are verified. The results of this paper suggest that the the multifractal characteristic property is one of the natural properties that DNA sequences possess, and is directly related to the structure and function of the whole DNA molecule.There are still many discussions concerning the biological meaning and the origin of the the multifractal characteristic properties in the DNA sequences. Such as, APP gene encodes a cell surface receptor and transmembrane precursor protein that is cleaved by secretases to form a number of peptides. Some of these peptides are secreted and can bind to the acetyltransferase complex APBB1/TIP60 to
ACKNOWLEDGMENT This project was supported by China National Nature Science Foundation No:30470413, No:31160200.Hubei province Nature Science Foundation No:2004ABA220 and China Postdoctoral Science Foundation. We are also indebted to sir lan zi yang for his valuable support and helpful discussions. REFERENCES [1]
Joseph H. Lee& Sa ndra Bar ral& Rong Cheng.etc.Age-at-ons et linkage analysis in Caribbea n Hispanics with familial late-onset Alzheimer’s disease. Neurogenetics (2008) 9:51 –60 [2] Nussbaum RL, Ellis CE (2003) Alzheimer ’s disease and Parkinson’sdisease. N Engl J Med 348:1356 –1364 [3] St George-Hyslop PH, Petit A (2005) Molecular biology andgenetics of Alzheimer ’s diseas e. C R Biol 328:119 –130 [4] Tanzi RE, Bertram L (2005) Twenty years of the Alzheimer ’s disease amyloid hypothesis: a genetic perspective. Cell 120:545 –555 [5] Pastor P, Goate AM (2004) Molecular genetics of Alzheimer’s disease. CurrPsychiatry Rep 6:125– 133. [6] Li W, Kaneko K.Long-range correlation and partial 1/ f spectrum in a non- coding DNA sequence.Europhys Lett,1992 17 :655~660. [7] Peng C K, Buldyrev S, Goldberg A L, et al. Long- range correlation in nucleotide sequence.Nature,1992 (356) :168~170. [8] Stanley H E, Buldyrev S V, Goldberg A L, et al. Analysis of DNA sequences Using methods of statistical physics. J.PhysicaA,1998.( 249) :430~438. [9] Herzel H, Grobe I. Correlations in DNA sequences – the role of protein coding segments J.PhysRevE,1997(55) :800~810. [10] Lou L F , Tsai Li, Zhou Y M. Informational parameters of nucleicacid and molecular evolution,J TheorBiol,1988(130) :351~361.
15
www.ijeart.com
Multifratal Analysis of the Pathogenesis of Alzheimer Disease [11] Luo L F, Tsai Li . Fractal dimension of nucleicacid sequences and its relation to evolutionary level.ChinPhysLett,1988(5) :421~424. [12] Jun Xu, Yang Chao, Rensheng Chen. Fractal geometry study of DNA binding proteins.J theorBiol,1994(171) :239~249 [13] Zheng G J,YU Z G.. Correlation dimension and Kolmogorov entropy of DNA sequences. Natural Science Journal of Xiangtan University.2000 Vol.22No.1.115-119. [14] Bai-linHao,Hoong-Chien Lee,and Shu-yu Zhang, Fractals related to long DNA sequences and complete genomes,Chaos,Solitons and Fractals, 2000.825-836. [15] Bassingth waighte, J.B. and Raymond, G.M. Evaluting rescaled range analysis for time series, Ann. Biomed. Eng. 1994;22:432-444. [16] Churilla,A.M.,Gottschalk,W.A.,Liebovitch,L.S.,Selector,L.Y.,Todoro v,A..L&Yeandle,S. Membrane potential fluctuations of human T-lymphocytes have fractal characteristics of fractional Brownian motion. Ann. Biomed. Eng.1996;24:99-108. [17] Caballero R, Jewson S, Brix A .Long memory in surface air temperature: detection, modeling, and application to weather derivative valuation. Clim Res.2002; 21:127-140. [18] R.Rifaat,W. Kinsner.and P.McAlpine.Multifratal analysis of DNA. [19] WilliamY.C.Chen and James D. Louck. ‖ Necklaces, MSS Seuences, DNA Sequences ‖.Adv.inAppl.Math.18(1) (1997)18-32. [20] Tong-Han Lan, Bang-Quan Xu, Hui-Jun yuan, Jia-Rui Lin Rescaled range analysis applied to the study delayed rectifier potassium channel kinetics, Biophysical Chemistry. 2003a;Vol106/1. 67-74. [21] Tong-Han Lan, Xiang-Ming Liu, Hui-Jun yuan, Jia-Rui Lin Gating kinetics of potassium channel in rat dorsal root ganglion neurons analyzed with fractal model. Biophysical chemistry. .2003b;Vol.106/3. 203-209. [22] Yazawa, T., Tanaka, K., Kato, A., & Katsuyama, T.. The scaling exponent calculated by the detrended fluctuation analysis, distinguishes the injured sick hearts against normal healthy hearts. Proceeding IAING (WCECS08) International Conference on Computational Biology (ICCB), 2008;V(2):7-12. 22-24 October, San Francisco, USA. [23] Yazawa, T., Tanaka, K., & Katsuyama, T. Alternans lowers the scaling exponent of heartbeat fluctuation dynamics: A detrended fluctuation analysis in animal models and humans‖. Proceeding CSIE2009, World Congress on Computer Science and Information Engineering. Computer Soc.2009;April, 221-225, Los Angeles, CA, USA.IEEE DOI 10.1109/CSIE.2009.784.
Tong-han Lan,Gongnongbin Road, Jiangan District Wuhan 430012,P R China Zi-yang Lan, Gongnongbin Road, Jiangan District Wuhan 430012,P R China Xiao Li, Gongnongbin Road, Jiangan District Wuhan 430012,P R China Hao Cai, Gongnongbin Road, Jiangan District Wuhan 430012,P R China Fei –fei Yu Gongnongbin Road, Jiangan District Wuhan 430012,P R China
16
www.ijeart.com