key: cord-345493-3bb1zuqp authors: Itoyama, Satoru; Keicho, Naoto; Hijikata, Minako; Quy, Tran; Phi, Nguyen Chi; Long, Hoang Thuy; Ha, Le Dang; Ban, Vo Van; Matsushita, Ikumi; Yanai, Hideki; Kirikae, Fumiko; Kirikae, Teruo; Kuratsuji, Tadatoshi; Sasazuki, Takehiko title: Identification of an alternative 5′‐untranslated exon and new polymorphisms of angiotensin‐converting enzyme 2 gene: Lack of association with SARS in the Vietnamese population date: 2005-06-03 journal: Am J Med Genet A DOI: 10.1002/ajmg.a.30779 sha: doc_id: 345493 cord_uid: 3bb1zuqp We analyzed genetic variations of angiotensin‐converting enzyme 2 (ACE2), considering that it might influence patients' susceptibility to severe acute respiratory syndrome‐associated coronavirus (SARS‐CoV) or development of SARS as a functional receptor. By cloning of the full‐length cDNA of the ACE2 gene in the lung, where replication occurs on SARS‐CoV, it was shown that there are different splicing sites. All exons including the new alternative exon, exon‐intron boundaries, and the corresponding 5′‐flanking region of the gene were investigated and 19 single nucleotide polymorphisms (SNPs) were found. Out of these, 13 SNPs including one non‐synonymous substitution and three 3′‐UTR polymorphisms were newly identified. A case control study involving 44 SARS cases, 16 anti‐SARS‐CoV antibody‐positive contacts, 87 antibody‐negative contacts, and 50 non‐contacts in Vietnam, failed to obtain any evidence that the ACE2 gene polymorphisms are involved in the disease process in the population. Nevertheless, identification of new 5′‐untranslated exon and new SNPs is considered helpful in investigating regulation of ACE2 gene expression in the future. © 2005 Wiley‐Liss, Inc. We analyzed genetic variations of angiotensinconverting enzyme 2 (ACE2), considering that it might influence patients' susceptibility to severe acute respiratory syndrome-associated coronavirus (SARS-CoV) or development of SARS as a functional receptor. By cloning of the full-length cDNA of the ACE2 gene in the lung, where replication occurs on SARS-CoV, it was shown that there are different splicing sites. All exons including the new alternative exon, exon-intron boundaries, and the corresponding 5 0 -flanking region of the gene were investigated and 19 single nucleotide polymorphisms (SNPs) were found. Out of these, 13 SNPs including one non-synonymous substitution and three 3 0 -UTR polymorphisms were newly identified. A case control study involving 44 SARS cases, 16 anti-SARS-CoV antibodypositive contacts, 87 antibody-negative contacts, and 50 non-contacts in Vietnam, failed to obtain any evidence that the ACE2 gene polymorphisms are involved in the disease process in the population. Nevertheless, identification of new 5 0untranslated exon and new SNPs is considered helpful in investigating regulation of ACE2 gene expression in the future. Severe acute respiratory syndrome (SARS) is an emerging infectious disease characterized by systemic inflammation followed by atypical pneumonia [Peiris et al., 2003b] . Shortly after the initial worldwide outbreak in 2003, SARS-associated coronavirus (SARS-CoV) was discovered as an etiological agent of SARS Ksiazek et al., 2003; Kuiken et al., 2003; Peiris et al., 2003a] , and then angiotensin-converting enzyme 2 (ACE2) was identified as a functional receptor of this newly arrived virus [Li et al., 2003 ]. More recently, CD209L was reported as being another alternative receptor for the virus, but it appears to be a less efficient entry site than ACE2 [Jeffers et al., 2004] . Virus receptors generally play a key role in the entry of the pathogen into the host cells and may influence development or progression of viral diseases. For example, it is well known that genetic polymorphism of chemokine receptor 5 (CCR5), a coreceptor for human immunodeficiency virus-1 (HIV-1), influences the natural history of HIV-1 infection. The mutant allele CCR5-D32 does not produce a functional protein and has been shown to protect host cells against HIV-1 infection, and progression into acquired immunodeficiency syndrome is delayed after seroconversion takes place [Dean et al., 1996; Liu et al., 1996; Samson et al., 1996] . By analogy with the above, we considered that genetic polymorphisms of ACE2 could influence SARS-CoV infection or clinical manifestations of SARS. ACE2 is a homologue of ACE1 and exhibits 40% identity of amino acid sequence to its N-and C-terminal domains [Tipnis et al., 2000] . Similar to ACE1, ACE2 is a metalloprotease that constitutes a renin-angiotensin system. Human full-length ACE2 cDNAs have been cloned already from lymphoma (GenBank accession No. AF241254) [Tipnis et al., 2000] , cardiac left ventricle (AF291820) [Donoghue et al., 2000] and testis (AY623811) [Douglas et al., 2004] . Based on published data, it has been said that the ACE2 gene (ACE2) contains 18 exons, and spans approximately 40 kb of genomic DNA on the human X-chromosome. Although ACE2 mRNA expressions were demonstrated in the lung by the method of quantitative reverse transcription-PCR (RT/PCR) [Harmer et al., 2002] and its protein expression was obviously shown by immunohistochemistry [Hamming et al., 2004] , full-length ACE2 cDNA has not been cloned from the lung so far. This is considered to be very likely as being an important replication site of SARS-CoV [Haagmans et al., 2004] . In the present study, we attempted a full-length cloning of ACE2 cDNA from the human lung and found a new alternative, the 5 0 -untranslated exon. During this process, an extended region of the original exon 1 was identified in the testis' RNAs. Then, we explored genetic polymorphisms within 19 exons including new regions and the 5 0 -flanking region of ACE2 and tried to determine whether the polymorphisms of ACE2 are associated with SARS in Vietnamese. Cloning was performed by combination of RT/PCR and 5 0and 3 0 -rapid amplification of cDNA ends (RACE) procedures, using human lung total RNA (Stratagene, La Jolla, CA) and human testis total RNA (Stratagene) as a control. The total RNAs were reverse-transcribed using SuperScript III Reverse Transcriptase (Invitrogen, Carlsbad, CA) with oligo(dT) 12-18 , and then cDNA was amplified using Platinum Taq DNA Polymerase High Fidelity (Invitrogen) with primers ACE2exon 1s (5 0 -CAA AGG CTG ATA AGA GAG AA-3 0 ) and ACE2exon 18 as (5 0 -GAA CAG AAG TCA AAT CCA GA-3 0 ) to amplify the transcript of 2721 bp encompassing the original 18 exons of ACE2 gene on database. The First Choice RLM-RACE Kit (Ambion, Austin, TX) was used for 5 0 -and 3 0 -RACE procedures following the manufacturer's recommendation. Gene-specific primer sets for 5 0 -RACE were ACE2-5 0 Outer1 and ACE2-5 0 Inner1 (5 0 -GTG GAT ACA TTT GGG CAA GT-3 0 and 5 0 -CCT AGA CTA AAA CCT CCT CA-3 0 ), and ACE2-5 0 Outer2 and ACE2-5 0 Inner2 (5 0 -GAA GTA AGA AAG CCT CCA CA-3 0 and 5 0 -CTC CTG ATC CTC TGT AGC CA-3 0 ). Gene specific primer set for 3 0 -RACE was ACE2-3 0 Outer and ACE2-3 0 Inner (5 0 -CAA TGA TGC TTT CCG TCT GA-3 0 and 5 0 -ACA CTT GGA CCT CCT AAC CA-3 0 ). Nucleotide sequences of PCR products were directly determined by the automated DNA sequencer (PRISM 3100 Genetic Analyzer, Applied Biosystems, Foster City, CA). To investigate expression of the exons on the 5 0 side, RT/ PCR procedures were performed on the total RNAs of human lung, testis, trachea (Stratagene), primary-cultured bronchial epithelial cells [Lechner and LaVeck, 1985] , small intestine (Ambion), and on the human major organ cDNAs (Bio Chain Institute) with the sense primer New-exon (5 0 -TTC TTA CTT CCA CGT GAC CT-3 0 ) or Extended-exon 1 (5 0 -GCT CAG CAG ATT GTT TAC TG-3 0 ) and the antisense primer ACE2-5 0 Outer1. An association study between SARS patients and controls was reviewed and approved by local ethics committees. Of 62 cases fulfilling the World Health Organization case definition of probable SARS in Vietnam [WHO, 2003] , 5 fatal cases and 3 non-Vietnamese cases were excluded from this study. In the remaining 54 cases, 44 individuals agreed to participate in this study as cases. One hundred and three Vietnamese staff members, who did not develop SARS but may have come in contact with SARS patients in the hospital where nosocomial infection of SARS had arisen, were enrolled as contacts. Furthermore, 50 medical staff members who had been working in a separate building and those considered having no history of contact with SARS patients joined in this study as non-contacts, according to information obtained by questionnaire. Peripheral blood samples of all the subjects were collected and genomic DNA was extracted from the blood cells by a method described elsewhere [Wang et al., 1994] . To detect the antibody to the SARS-CoV in serum, all the blood samples were tested with SARS ELISA (Genelabs Diagnostics Pte. Ltd., Singapore Science Park, Singapore) in accordance with the manufacturer's recommendation . Within ACE2 Gene Of the 44 SARS cases and 103 contacts recruited, a half of the samples were randomly selected for searching polymorphisms within the ACE2 gene. PCR primers were designed to amplify 19 exons including the new alternative exon, exon-intron boundaries and approximately 1,000 bp of the 5 0 -flanking region of the new exon, reaching 2,000 bp upstream of the 5 0end of the original exon 1 (Table I) . Genomic DNA of each sample was subjected to PCR amplification followed by direct sequencing. Non-synonymous nucleotide substitutions and other variations with a minor allele frequency higher than 0.05 were subjected to genotyping in all SARS cases, contacts and noncontacts. Consequently, one novel non-synonymous substitution, two possible non-synonymous polymorphisms in the database (dbSNP identification nos. rs4646116 and rs11798104), and variations of 3 0 -UTR in exon 18 (position 39844) and of intron 3 (rs2285666, position 8789) were genotyped by the combination of direct sequencing method and single-strand conformation polymorphism (SSCP) analysis or PCR-based restriction fragment length polymorphism (RFLP) analysis. Disease associations were assessed by the chi-square test. The P values less than 0.05 were considered significant in all the tests and data analysis was carried out using JMP version 5 (SAS Institute, Inc., Cary, NC). By the use of the RT/PCR encompassing all known exons of ACE2 and 3 0 -RACE method, we could amplify ACE2 cDNA as PCR fragments completely corresponding to the published sequence of ACE2 cDNA (AF241254). The 5 0 -RACE procedure on the total RNA of the lung demonstrated the presence of a new alternative exon (registered as AB193259), which consisted of a segment between position À1141 and À942 and was connected to the 5 0 -end of the original exon 1. The 5 0 -end of transcripts was extended to position À1141 repeatedly by both sets of gene-specific primers. In addition, novel 65 nucleotides on the 5 0 -side (registered as AB193260), extending the 5 0 -end of the original exon 1 upstream, were amplified from the total RNA of testis. A schematic diagram of the exon-intron structure is shown in Figure 1 . RT-PCR revealed that the expression of the new alternative exon could be seen not only in the lung but also in the testis, trachea, bronchial epithelial cells, small intestine, and various major organs (data not shown). The new extended region was expressed not only in the testis but also in other organs including bronchial epithelial cells and the small intestine (data not shown). Basic characteristics and sub-grouping of subjects are shown in Table II . The 44 SARS cases, 103 contacts, and 50 noncontacts were analyzed in the present study. Based on anti-SARS-CoV antibody titer in serum, the contacts were further divided into two subgroups, antibody-positive contacts, and antibody-negative contacts (data not shown). Within ACE2 Gene All exons including the new exon, exon-intron boundaries and the corresponding 5 0 -flanking region of ACE2 were tested to identify variations of ACE2 among SARS cases and contacts. As shown in Table III , 19 single nucleotide polymorphisms (SNPs) were identified. Six of them have already registered on dbSNP database, and 13 SNPs including one non-synonymous substitution, from asparagine to serine at 638 (N638S) in the exon 15 (position 33205) were identified. All SNPs but one in intron 3 (rs2285666, position 8789) and another in exon 18 (position 39844) were found to be considerably rare among both SARS cases and contacts tested. In subsequent analysis, we therefore chose polymorphisms, and analyzed possible nonsynonymous substitution, excluding rare non-coding variants among SARS patients and contacts. Two SNPs in intron 3 and exon 18 with minor allele frequencies higher than 0.05 and a newly identified nonsynonymous SNP, N638S in exon 15 were analyzed in all samples (Table IV) . Relative positions of these SNPs are shown in Figure 1 . Genotyping results by direct sequencing method were confirmed by RFLP or SSCP methods. Because ACE2 is located to the X chromosome in humans, samples from both males and females were analyzed, respectively. Two possible non-synonymous SNPs that are shown in the dbSNP database (rs4646116 and rs11798104) were not found in our samples this time. When the antibody-negative contacts group was compared with antibody-positive group including SARS cases in either males or females, no difference was observed between the two groups both in regards to genotype and allele frequencies. Comparison between antibody-positive contacts and SARS cases, and comparison between contacts and noncontacts did not show any significant differences in genotype and allele frequencies of the tested polymorphisms. During the worldwide outbreak of SARS in 2003, a subset (about 20%-30%) of SARS patients required mechanical ventilation, having developed pneumonia. The fatality rate was 11%, although the majority of patients recovered without unfavorable outcome [Peiris et al., 2003b] . As a natural consequence, asymptomatic individuals produce antibodies against SARS-CoV in their sera [Ip et al., 2004; Woo et al., 2004] . In one of the studies, it was shown that 2.3% of contacts who did not develop clinical SARS had serum antibody titer over the threshold [Ip et al., 2004] , and this implies the presence of asymptomatic individuals. We hypothesized that the functional polymorphism of ACE2, which is considered as being a virus receptor of SARS-CoV, might influence the clinical history of SARS-CoV infection at least in part. This is because, a variation of the co-receptor to HIV, CCR5-D32 where allele frequency is approximately 10% in the European population [Martinson et al., 1997] , has been well known to resist HIV infection and alter its clinical course [Dean et al., 1996; Liu et al., 1996; Samson et al., 1996] . Minor allele frequencies of the SNPs shown in bold and italic were higher than 0.05. Using the PCR-based cloning procedure, we identified for the first time an alternative exon upstream of the original exon 1 of ACE2 that is expressed in various organs, including the lung and trachea, primary-cultured bronchial epithelial cells, and the small intestine. These are considered to be important replication sites of SARS-CoV [Haagmans et al., 2004] . Both 5 0and 3 0 -ends of the intron between the new alternative exon and the original exon 1 followed the GT/AG rule of Breathnach and Chambon [1981] . Although the organ specificity of the transcripts was not confirmed in this study due to the limitation of non-quantitative PCR amplification, implication of the new exon was definitely shown in the lung and small intestine. Also, we found the extended region of the original exon 1, 65 bp on the 5 0 side. Neither the new alternative exon nor the new extended region of exon 1 gave rise to a new coding region and they were considered as 5 0 -untranslated region. It was recently reported that genetic variations of ACE2 did not affect SARS susceptibility or outcome in Hong Kong [Chiu et al., 2004] . In that study, five intronic SNPs (rs2106809, rs2285666, rs4646142, rs714205, and rs2074192) were chosen and analyzed in a case-control manner, based on the previously known exon-intron structure and SNPs already registered in the database. By contrast, we attempted to analyze not only previously known SNPs but also variations newly identified among actual SARS patients and contacts. Based on the information from the exon-intron structure of ACE2 cloned by ourselves, we searched for nucleotide sequences in all the exons including the new alternative exon and the corresponding 5 0 -flanking region, which are thought to contain promoters of the new exon and the original exon 1. We found one novel non-synonymous substitution N638S and 18 non-coding SNPs including two relatively common SNPs with minor allele frequency higher than 5%. We selected these SNPs and analyzed them furthermore in a case-control manner, because, while they are rare occurrence, non-synonymous substitution may directly modulate the function of the protein, and because relatively common SNPs can often be used as markers to ascertain a causative variation. Of 19 SNPs found in this study, 13 were new polymorphisms, 3 of which were located in 3 0 -UTR. Two possible non-synonymous SNPs in dbSNP database were not found in the population tested. Judging from the results so far obtained in this case-control study, there was no statistical evidence that ACE2 polymorphisms affect SARS infection or alter its clinical course. However, type II error was not negligible because of a relatively small size of samples tested. Taking also into consideration, the results from a previous study of ACE2 polymorphisms by others [Chiu et al., 2004] , it is unlikely that the genetic defect of ACE2 is involved in the disease resistance that has been shown in CCR5-D32 in HIV-1 infection cases. Nevertheless, this newly identified alternative 5 0 -untranslated exon expressed in the lung, and also newly recognized polymorphisms in this study might be of great help concerning investigations into the regulation of ACE2 gene expression and the possible significance of the variations in further more in-depth studies. The authors thank Dr. Nguyen Le Hang, Pham Thi Phuong Thuy, and Nguyen Thi Thu Ha for their help in the management and coordination of this study in Vietnam and Dr. Shuzo Organization and expression of eucaryotic split genes coding for proteins ACE2 gene polymorphisms do not affect outcome of severe acute respiratory syndrome Genetic restriction of HIV-1 infection and progression to AIDS by a deletion allele of the CKR5 structural gene. Hemophilia Growth and Development Study A novel angiotensin-converting enzyme-related carboxypeptidase (ACE2) converts angiotensin I to angiotensin 1-9 The novel angiotensin-converting enzyme (ACE) homolog, ACE2, is selectively expressed by adult Leydig cells of the testis Identification of a novel coronavirus in patients with severe acute respiratory syndrome Recombinant protein-based enzyme-linked immunosorbent assay and immunochromatographic tests for detection of immunoglobulin G antibodies to severe acute respiratory syndrome (SARS) coronavirus in SARS patients Pegylated interferon-alpha protects type 1 pneumocytes against SARS coronavirus infection in macaques Tissue distribution of ACE2 protein, the functional receptor for SARS coronavirus. A first step in understanding SARS pathogenesis Quantitative mRNA expression profiling of ACE 2, a novel homologue of angiotensin converting enzyme Seroprevalence of antibody to severe acute respiratory syndrome (SARS)-associated coronavirus among health care workers in SARS and non-SARS medical wards CD209L (L-SIGN) is a receptor for severe acute respiratory syndrome coronavirus A novel coronavirus associated with severe acute respiratory syndrome Newly discovered coronavirus as the primary cause of severe acute respiratory syndrome A serum-free method for culturing normal human bronchial epithelial cells at clonal density Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus Homozygous defect in HIV-1 coreceptor accounts for resistance of some multiply-exposed individuals to HIV-1 infection Global distribution of the CCR5 gene 32-basepair deletion Coronavirus as a possible cause of severe acute respiratory syndrome The severe acute respiratory syndrome Resistance to HIV-1 infection in caucasian individuals bearing mutant alleles of the CCR-5 chemokine receptor gene A human homolog of angiotensin-converting enzyme. Cloning and functional expression as a captopril-insensitive carboxypeptidase Purification of genomic DNA from human whole blood by isopropanol-fractionation with concentrated Nal and SDS Global surveillance for severe acute respiratory syndrome (SARS) Relative rates of non-pneumonic SARS coronavirus infection and SARS coronavirus pneumonia