外文翻译--Host Specific Codon Usage Pattern of H1N1 Influenza A Viruses

上传人:pu****.1 文档编号:569416347 上传时间:2024-07-29 格式:PDF 页数:4 大小:423.20KB
返回 下载 相关 举报
外文翻译--Host Specific Codon Usage Pattern of H1N1 Influenza A Viruses_第1页
第1页 / 共4页
外文翻译--Host Specific Codon Usage Pattern of H1N1 Influenza A Viruses_第2页
第2页 / 共4页
外文翻译--Host Specific Codon Usage Pattern of H1N1 Influenza A Viruses_第3页
第3页 / 共4页
外文翻译--Host Specific Codon Usage Pattern of H1N1 Influenza A Viruses_第4页
第4页 / 共4页
亲,该文档总共4页,全部预览完了,如果喜欢就下载吧!
资源描述

《外文翻译--Host Specific Codon Usage Pattern of H1N1 Influenza A Viruses》由会员分享,可在线阅读,更多相关《外文翻译--Host Specific Codon Usage Pattern of H1N1 Influenza A Viruses(4页珍藏版)》请在金锄头文库上搜索。

1、Host Specific Codon Usage Pattern of H1N1 Influenza A Viruses Ming-Wei Su,1 P. C. Chen,2 Woei C. CHU1* 1Institute of Biomedical Engineering National Yang Ming University, Beitou, Taipei 112, Taiwan 2Department of Medical Imaging and Radiological Sciences, I-Shou University, Taiwan *wchuym.edu.tw Han

2、ns S. YUAN Institute of Molecular Biology Academia Sinica Nang Kang, Taipei 115, Taiwan AbstractCodon usage preference and the highly expressed genes have strong correlations occur in many organisms. Codon usage preference of viruses may evolve much similar with its infected host to increase the fit

3、ness. In this study we investigated differences in codon usage preferences among influenza A H1N1 viruses which infected avian, swine and human, and may cause major pandemic around world. The relative synonymous codon usage (RSCU) indices of HA gene in H1N1 viruses were calculated and we further inc

4、orporate the principal component analysis (PCA) to characterizing different host infected viruses. Host-specific codon usage pattern of H1N1 viruses were reported in this study. In 2009 influenza A H1N1 virus was a major epidemic challenge of disease control department. We propose to use the codon-b

5、ased method to gain a better understanding of the features of virus genome and evolutionary processes. Keywords-influenza A H1N1 virus; host; codon usage; principal component analysis I. INTRODUCTION Proteins are composed of 20 amino acids, and each amino acid is encoded by triplets of mRNA called c

6、odons. Because mRNA composed 4 nucleotides, so there are 64 possible triplet combinations to encode only 20 amino acids, most amino acids are encoded by more than one codon. Those codons that code the same amino acids are referred to as synonymous codons show in figure 1. Figure 1. Synonymous codons

7、 of serine between different species. Among different species we can easily find that the codons are not used with equal frequencies to code serine. For example, in dengue virus type 1, the codon UCA is much more frequently used than the other five codons. In contrast the codon UCG appears to be rar

8、ely used. The choice of a preference codon is different from one organism to another. It is observed that some codons are used more frequently than others in different organisms. Codon usage preference and the highly expressed genes have stronger selective correlation were first observed in bacteria

9、 1, 2. We hypothesize that virus codon usage preference may evolve differently due to the infected host species. Synonymous codon usage was nonrandom, and different genome had different “preferred” codons for a given amino acid. When sequence data are accumulated, how to explain this nonrandom use o

10、f synonymous codon became an important issue to biologists. Influenza A virus is a negative-stained RNA virus majorly infected in human, swine and avian and caused the 1918 Spanish flu pandemic that killed 50 to 100 million people around the word and cause the major pandemic in 2009. Swine has sugge

11、sted being an intermediary host species between human and avian influenza viruses 3, 4. In this study, we make choice of different host infected H1N1 viruses to analysis their codon usage preference. The surface glycoprotein hemagglutinin (HA) is the host binding protein to initialize virus infectio

12、n. HA proteins preferentially bind to specific receptors govern the virus particle into dissimilar host species 5. Due to the importance of HA protein, we chosen HA gene sequences to facilities our study of virus and host codon usage relationships. In Escherichia coli, Saccharomyces cerevisiae, Caen

13、orhabditis elegans, Drosophila and Arabidopsis thaliana, highly expressed genes have a strong selective preference for codons with a high concentration of the corresponding tRNA molecule, whereas genes expressed at lower levels display a more uniform pattern of codon usage 2, 6. These earlier result

14、s indicate that the codon usage preference of viruses also may influenced by the host environment. Thus these studies raise the possibility that the codon usage preference of a virus may be used as an indicator of its host species. An analysis of H3N2, H9N2 and H5N1 showed that there are no obvious

15、differences in codon usage patterns among virus isolated from human, avian and swine host 7. Synonymous 978-1-4244-4713-8/10/$25.00 2010 IEEEcodon usage between different subtypes of influenza A viruses is different and mutation pressure is most important in synonymous codon usage 8. We first reveal

16、ed the host-specific codon usage pattern in influenza A virus H1N1. In the Principal Component Analysis (PCA), three different host infected H1N1 viruses are clustering according to its infected host species to demonstrate the host-specific codon usage pattern in H1N1 viruses. II. MATERIAL AND METHO

17、D A. Sequence Analysis The HA gene sequences of 971 influenza A H1N1 viruses were genetically and ecologically diverse, epidemic in human, avian and swine collected from NCBI influenza virus resource 9. Whole data sets have 68 avian host H1N1 sequences, 761 human host H1N1 sequences and 142 swine ho

18、st H1N1 sequences. Sequences aligned by clustalw 10 software, and further genome composition analysis using DnaSP 11 software program. B. Relative Synonymous Codon Usage (RSCU) In order to analyze codon usage patterns of a virus genome, the first step is to calculate its Relative Synonymous Codon Us

19、age (RSCU) values 6. RSCU value for a specific codon (i) is given by RSCUi = Xi / (XI / n) Where Xi is the counting numbers of ith codon for a given amino acid; XI is the sum of the occurrence numbers for all the synonymous codons in a certain amino acid; and n is the number of synonymous codons for

20、 a specific amino acid 6. Methionine and tryptophan are only associated with one single codon, together with the three stop codons, were excluded from the codon usage preference analysis. A final 971 rows of virus strain and 59 columns of RSCU index to constructed the influenza A H1N1 codon profile

21、matrix show in figure 2. Figure 2. Codon profile matrix. This matrix was further feed to the principal component analysis to confer the codon usage analysis. C. Principal Component Analysis (PCA) Principal component analysis (PCA) 12 is a multivariate analysis method, where the columns of the data m

22、atrix are generally a set of variables and the rows are a relatively homogeneous sample objects. PCA can be used for the differentiation of genes according to their codon usage. It is a method of identifying patterns in data, and presenting the data set in such a way to highlight their similarities

23、and differences. It gives a more direct observation the space distribution of the complete data. PCA implementation can be divided into six steps: STEP 1: Get data set from measurements. STEP 2: Subtract the mean from each of the data dimensions, thus producing a zero-mean data set. STEP 3: Calculat

24、e the covariance matrix. STEP 4: Calculate the eigenvectors and eigenvalues of the covariance matrix. STEP 5: Determine how many components is used to forming a feature vector. STEP 6: Deriving the new data set. III. RESULTS A. Codon Usage Preferences of H1N1 Viruses The coding region of each viral

25、strain was extracted and the RSCU values were calculated by our developed Perl script. Average of RSCU values of H1N1 viruses HA gene among three different kinds of host infected viruses. Codon AGA has the highest RSCU value among all three different host viruses indicate that arginine is the most b

26、iased amino acid in codon usage preference. We can found that in two-fold degenerate amino acids the codon usage bias were unobvious than others. Besides amino acid phenylalanine, AT-ended codons were generally used more preferentially in all three different host viruses among all amino acids. Avian

27、 host H1N1 viruses used slight more GC-end codons than other host infected viruses. Human and swine H1N1 virus isolates reveal much similar codon usage preference than avian H1N1 viruses. B. Principal Component Analysis of H1N1 viruses To study codon usage preference of 971 influenza A H1N1 viruses

28、we utilize Principal Component Analysis (PCA) to visualizing all 59 codons in a three dimensional codon usage space. We applied the PCA on the H1N1 HA gene RSCU values for all three different host viruses to realize the codon usage relationships among influenza A H1N1 viruses. In order to avoid the

29、amino acid compositional effect, each virus is represented as a 59 dimensional RSCU vector to facility the principal component analysis. To explore the codon usage pattern among H1N1 viruses, a 971 59 RSCU matrix was processed by PCA to calculate the principal components (PCs) on the purpose of high

30、light the similarities and differences in codon usage patterns in all three different host H1N1 viruses. Figure 3 shows the accumulated variance explanted by first ten principal components. Figure 3. Accumulate variance of first ten principal components. First PC explained 46.83% of the variance amo

31、ng 59 RSCU indices. The first two PCs accounted for 60.57% of the variance and the first three PCs accounted for 69.3% of the variance in original dataset. Influenza A H1N1 codon usage space formed by first three principal component axes shows in figure 4. Figure 4. PCA of three different host infec

32、ted H1N1 virus HA genes. Avian host H1N1 virus showed in figure 4 plotted in triangular, human host H1N1 virus showed in square and swine H1N1 viruses represented by circle. Swine and human infected virus groups can be differentiated by first principal component axis. Second and third principal comp

33、onent can help us to further separate avian host H1N1 viruses more clearly. Figure showed that the codon usage preference categorized by the first three principal components possessed sufficient information to figure out the host specific codon usage pattern of influenza A virus H1N1. First three pr

34、incipal component axes primary affected by six-fold and four-fold degenerate amino acids due to the obviously bias of codon usage. In figure 4 avian host H1N1 viruses cluster, we can also found another small group of avian-like swine H1N1 viruses mainly composed by European isolates. IV. DISSCUSSION

35、 Influenza A H1N1 viruses are distributed worldwide and cause epidemic disease in various animal host including human, avian and swine. Virulence may influences by viral replication efficiency, transmissibility, host adaptation and tissue tropism. Viruses can overcome the host species barriers by ev

36、olutional constrain affected on the virus-host interaction to sustained transmission in a new host species 13. Transmission of influenza viruses from swine to human occurred occasionally in past years. Different with 2009 pandemic, most of these cross species infections human can be seen as the dead

37、-end host and well not further transmitted to second contact human to make epidemic. In 2009 H1N1 viruses of swine origin, first document case was on 11 March in Mexico City and as of 4 May over 1000 confirmed cases around 21 countries 14. An earlier study in bacteriophages revealed that they usuall

38、y presented the same codon usage preference of their bacterial hosts 15. Aims of our study is to investigate the possible similarities of codon usage between viruses and its host, and to conferred the prospect of shaping codon usage preference inside the viral genomes which might influenced by the h

39、ost. HA gene is hypothesized to be the major factor for species specificity of Influenza A viruses because it is the receptor binding protein 4. Principal component analysis identified three different host infected H1N1 virus cluster. The first three principal components are dominated by six-fold an

40、d four-fold amino acids due to the most biased in codon usage preference. Host specific codon usage preference was first determined among influenza A H1N1 viruses in our study, and not occurs in H3N2, H9N2 and H5N1 subtypes 7. In swine host viruses cluster, there are also appear five avian and eight

41、 human host H1N1 viruses, point out that the swine may be more sensitive in different host influenza viruses than human and avian. This result further supported that different host influenza viruses might occur gene reassortment in swine to generate novel synthesize strain causing pandemic. Another

42、small group of swine viruses occur in the avian cluster, this group of swine viruses represented the evolutional history of avian H1N1 virus introduced into swine populations in Europe around 1979 so called avian-like H1N1 viruses 16. Virus evolution influenced by host species is a very complex and

43、interest issue, in this study we use virus and host codon usage as an approach to reveal the virus host adapt evolution. Influenza A virus is negative strain RNA virus, rely on the higher mutation rate and purifying selection adapt to a certain host species. Comprehensive codon usage analysis we pro

44、posed in this study are very helpful to understand the process governing the virus evolution, especially the role of random mutation and selection. ACKNOWLEDGMENT This work was supported by research grants from Ministry of Education, aim for the Top University Plan; the Summit Projects of Academia S

45、inica, Taiwan, and the National Science Council, Taiwan (NSC 97-2320-B-010-003-MY3). REFERENCES 1 M. Gouy and C. Gautier, Codon usage in bacteria: correlation with gene expressivity, Nucleic Acids Res, vol. 10, pp. 7055-74, Nov 25 1982. 2 R. Grantham, C. Gautier, M. Gouy, R. Mercier, and A. Pave, Co

46、don catalog usage and the genome hypothesis, Nucleic Acids Res, vol. 8, pp. r49-r62, Jan 11 1980. 3 I. H. Brown, The epidemiology and evolution of influenza viruses in pigs, Vet Microbiol, vol. 74, pp. 29-46, May 22 2000. 4 D. van Riel, V. J. Munster, E. de Wit, G. F. Rimmelzwaan, R. A. Fouchier, A.

47、 D. Osterhaus, and T. Kuiken, H5N1 Virus Attachment to Lower Respiratory Tract, Science, vol. 312, p. 399, Apr 21 2006. 5 S. Karlin and J. Mrazek, What drives codon choices in human genes?, J Mol Biol, vol. 262, pp. 459-72, Oct 4 1996. 6 P. M. Sharp, T. M. Tuohy, and K. R. Mosurski, Codon usage in y

48、east: cluster analysis clearly differentiates highly and lowly expressed genes, Nucleic Acids Res, vol. 14, pp. 5125-43, Jul 11 1986. 7 T. Zhou, W. Gu, J. Ma, X. Sun, and Z. Lu, Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses, Biosystems, vol. 81, pp. 77-86, Jul 2005.

49、8 Y. Bao, P. Bolotov, D. Dernovoy, B. Kiryutin, L. Zaslavsky, T. Tatusova, J. Ostell, and D. Lipman, The influenza virus resource at the National Center for Biotechnology Information, J Virol, vol. 82, pp. 596-601, Jan 2008. 9 J. D. Thompson, D. G. Higgins, and T. J. Gibson, CLUSTAL W: improving the

50、 sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice, Nucleic Acids Res, vol. 22, pp. 4673-80, Nov 11 1994. 10 J. Rozas, J. C. Sanchez-DelBarrio, X. Messeguer, and R. Rozas, DnaSP, DNA polymorphism analyses by th

51、e coalescent and other methods, Bioinformatics, vol. 19, pp. 2496-7, Dec 12 2003. 11 G. Perriere and J. Thioulouse, Use and misuse of correspondence analysis in codon usage studies, Nucleic Acids Res, vol. 30, pp. 4548-55, Oct 15 2002. 12 I. T. Jolliffe, Principal Component Analysis, 2nd ed., 2002.

52、13 J. Cohen, Swine flu outbreak. Out of Mexico? Scientists ponder swine flus origins, Science, vol. 324, pp. 700-2, May 8 2009. 14 T. Kunisawa, S. Kanaya, and E. Kutter, Comparison of synonymous codon distribution patterns of bacteriophage and host genomes, DNA Res, vol. 5, pp. 319-26, Dec 31 1998.

53、15 T. Ito, Interspecies transmission and receptor recognition of influenza A viruses, Microbiol Immunol, vol. 44, pp. 423-30, 2000. 16 C. Scholtissek, H. Burger, P. A. Bachmann, and C. Hannoun, Genetic relatedness of hemagglutinins of the H1 subtype of influenza A viruses isolated from swine and birds, Virology, vol. 129, pp. 521-3, Sep 1983

展开阅读全文
相关资源
正为您匹配相似的精品文档
相关搜索

最新文档


当前位置:首页 > 大杂烩/其它

电脑版 |金锄头文库版权所有
经营许可证:蜀ICP备13022795号 | 川公网安备 51140202000112号