key: cord-0911740-nscy2c4z authors: Moore, Kristi M.; Bennett, Joyce D.; Seal, Bruce S.; Jackwood, Mark W. title: Sequence Comparison of Avian Infectious Bronchitis Virus S1 Glycoproteins of the Florida Serotype and Five Variant Isolates from Georgia and California date: 1998 journal: Virus Genes DOI: 10.1023/a:1008057118625 sha: ce20967db3da8871c330e084bf2c58a412a9feb1 doc_id: 911740 cord_uid: nscy2c4z The infectious bronchitis virus (IBV) spike glycoprotein S1 subunit is required to initiate infection and contains virus-neutralizing and serotype-specific epitope(s). Reported are the S1 gene nucleotide and predicted amino acid sequences for the Florida 18288 strain and isolates GA-92, CV-56b, CV-9437, CV-1686, and 1013. These sequences were compared with previously published gene sequences of IBV strains, and phylogenetic relationships are reported. The S1 amino acid sequence of Florida 18288 was 94.9% similar to the Connecticut strain, and GA-92 was 92.8% similar to the Arkansas 99 strain. S1 amino acid sequences of the California variants, CV-56b, CV-9437, and CV-1686, were 97.6–99.3% similar to one another and only 76.6%–76.8% similar to the Arkansas-type strains. Isolate 1013, also from California, was 84.0% similar to Ark DPI and 77.9% similar to CV-56b. When comparing 19 viruses isolated from the United States, sequence variations were observed between amino acids 55–96, 115–149, 255–309, and 378–395. Similar regions are reported to be involved in virus-neutralizing and/or serotype-specific epitopes. These data demonstrate that variant IBV strains continue to emerge, and unique variants may circulate among poultry in geographically isolated areas. Avian infectious bronchitis virus (IBV) causes an economically important upper respiratory tract disease in chickens that results in reduced weight gains and reduction of egg production. Infectious bronchitis is complicated by secondary bacterial infections such as Mycoplasma sp. and Escherichia coli that may increase mortality (1) . Vaccination for infectious bronchitis is hindered because monovalent IBV vaccines do not provide complete protection against heterologous serotypes (2, 3) . Infectious bronchitis virus, a member of the Coronaviridae family, contains four structural proteins. These are the small membrane protein, the integral membrane glycoprotein, the phosphorylated nucleocapsid protein, and the spike (S) glycoprotein (4, 5) . The S glycoprotein is cleaved posttranslationally by cellular proteases into the S1 and S2 subunits (6) . The globular S1 subunit forms the tip of the spike, extending outwardly, whereas S2 anchors S1 to the viral membrane (7) . The S1 subunit is involved with infectivity and contains virus-neutralizing epitopes, serotype-speci®c sequences, and hemagglutinin activity (7±11). Different serotypes and subtypes of IBV are thought to be generated by nucleotide insertions, deletions, or point mutations in the S1 subunit made by the viral polymerase (12) . Another mechanism for variability in the S1 subunit may be RNA recombination (13±16). Consequently, changes in the S1 subunit The nucleotide sequence data reported in this paper have been submitted to the GenBank nucleotide sequence database and have been assigned the accession numbers 1013 AF027508, CV-9437 AF027510, CV-1686 AF027511, CV56-b AF027509, FL18288 AF027512. can lead to emergence and proliferation of variant serotypes. Sequence variations in S1 subunits have been identi®ed among European IBV strains and within the Massachusetts (Mass) serotype. These are designated hypervariable region (HVR) I (residues 56±69) and HVR II (residues 117±131) (17, 18) . Isolates from chickens in the United States (US), other than the Mass serotype, have similar highly variable regions in the S1 subunit between residues 53±148 (15) . Other variable regions have been detected between residues 250±310 for the strains H120, D207, D1466, V1397, Mass 41, Mass 42, and residues 269±365 (including the signal sequence) in the strains 6/82, 123/82, and 167/82 (12, 19) . These variable regions (HVR I, HVR II, and residues 250±365) are similar to sequence regions designated I, II, and III associated with three virus-neutralizing epitopes determined by selection of monoclonal antibody (Mab) neutralization-resistant (NR) mutants (20) . HVR I was associated with a virus-neutralizing epitope and a hemagglutinin, whereas amino acids 250±386 are associated with a serotype-speci®c epitope (17, 21) . The objectives of this investigation were to sequence the S1 gene of the Florida 18288 strain and variant IBV isolates GA-92, CV-56b, CV-9437, CV-1686, and 1013. Sequence comparison and phylogenetic analysis were used to determine the relationship of these viruses with other USA isolates of IBV and to further identify variable regions that may be virus-neutralizing epitopes on the S1 glycoprotein subunit. The S1 nucleotide and predicted amino acids sequences were determined from these viruses isolated in the United States; Florida 18288 (FL-18288), Georgia isolate 1992 (GA-92), and isolates from California CV-56b, CV-9437, CV-1686, and 1013. FL-18288 strain, representing the Florida serotype, was isolated from layers in 1972, and was obtained from Dr. Pedro Villegas at the University of Georgia (22) . GA-92 (Arkansas serotype) was isolated in 1992 at the Poultry Diagnostic and Research Center (PDRC) from a broiler¯ock in Georgia exhibiting respiratory distress (23) . All isolates from California were from chickens showing respiratory distress. CV-56b (isolated in 1991 from broilers), CV-1686 (isolated in 1995 from broilers), and 1013 (isolated in 1995 from silkies) were obtained from the California Veterinary Diagnostic Laboratory System (Fresno, CA), whereas CV-9437 (isolated in 1995 from broilers) was submitted to PDRC. Isolation of IBV strains from California started in 1984 and are referred to as California variants. California variants in this study are represented by CV-56b. Data from serum neutralization studies suggest that these variant strains may compose a new serotype (23, 24) . All procedures for RNA isolation and reverse transcriptase-polymerase chain reaction (RT-PCR) were previously described (25) . Brie¯y, RNA was isolated from IBV infective allantoic¯uids after treatment with Proteinase K (250 mg/ml) and SDS (®nal 2% solution wt/wt), with incubation at 55 C for 5 min. Phenol-chloroform-isoamyl alcohol (25:24:1) and chloroform-isoamyl alcohol (24:1) extractions were completed, and RNA was isolated from infective allantoic¯uids using the RNAid Kit (Bio 101). Primers MIBVPCR and NIBVPCR were used to amplify a conserved region between the membrane and nucleocapsid genes for identi®cation of IBV (26, 27) . The S1oligo5 H (5 H -TGAAACTGAACAA-AAGAC-3 H ) and S1oligo3 H (5 H -CATAACTAACAT-AAGGGCAA-3 H ) primers were used to amplify S1 gene as previously described (25, 28) . The S1 gene ampli®cation product was used for subsequent restriction fragment length polymorphism (RFLP) or DNA sequencing. Ampli®cation products were excised from an agarose gel and puri®ed using Geneclean (Bio 101). The restriction endonucleases, Bst YI, Hae III, and Xcm I (New England Biolabs) were used to digest the S1 gene ampli®cation product as previously described (25) . RFLPS were determined following agarose gel electrophoresis (25) . The S1 gene polymerase chain reaction (PCR) products for FL-18288, GA-92, and CV-9437 were cloned into the TA cloning vector (Invitrogen Corp.), and ligated areas of the plasmids were sequenced with the M13 universal forward and reverse primers (Molecular Genetics Instrumentation Facility, University of Georgia). After initial sequence data was obtained from plasmid DNA, the S1 gene for each sample was ampli®ed by RT-PCR in six or more tubes then combined and used as template for subsequent sequencing. Sequencing primers to various regions within the S1 gene for FL-18288, GA-92, CV-56b, CV-9437, CV-1686, and 1013 were designed using OLIGO TM version 4.0 software (National Biosciences, Inc.) and are available upon request. Double-stranded DNA sequencing was conducted using automated sequencing with the Prism TM DyeDeoxy terminator cycle sequencing kit as recommended by the manufacturer (Perkin Elmer; 29,30). All samples were puri®ed for DNA sequencing using Microcon TM 30 columns (Amicon, Inc.). Automated sequencing was conducted at the USDA Southeastern Poultry Research Laboratory (Athens, GA). The nucleotide sequence of the S1 gene from the ATG start site to the cleavage site for the FL-18288, GA-92, CV-56b, CV-9437, CV-1686, and 1013 were compared with published sequences from Beaudette (accession # X02342), Mass 41 (accession # X04722), Connecticut (Conn) (accession # L18990), Arkansas Alignments and phylogenetic analysis were conducted using CLUSTAL V method in MegAlign software version 1.03 (DNAStar Inc., Madison, WI). This method uses a multiple alignment algorithm, and unweighted pair group method with arithmetic mean algorithm (UPGMA) to derive a preliminary phylogeny (31, 32) . The ®nal phylogeny is produced by applying the neighbor-joining method (33) . The CLUSTAL V method was compared with UPGMA with MacDNASIS ProV3.0 software (Hitachi Software Engineering Co., Ltd) and IntelliGenetics GeneWorks 2.45 TM (Intelligenetics, Inc. Mountain View, CA). Reverse-transcriptase PCR in conjunction with RFLP were conducted for initial characterization of IBV strain FL-18288 and isolates, GA-92, CV-56b, CV-9437, CV-1686, and 1013 (25) . All samples were identi®ed as IBV by the ampli®cation of a 1.02 kb PCR product with the MIBVPCR and NIBVPCR primers (data not shown). The S1 gene PCR product (1.7 kb) was subsequently ampli®ed with the S1oligo5 H and S1oligo3 H primers for all isolates (data not shown). Using the restriction enzymes Bst YI, Hae III, and Xcm I, the RFLP pattern for ampli®cation products from FL-18288 corresponded to a pattern reported by Kwon et al. (25) . The RFLP patterns for CV-9437 and CV-1686 ampli®cation products were identical to one another and to patterns for other California variants represented by CV-56b. However, the RFLP pattern for GA-92 and 1013 differed from one another and from other IBV strains using the restriction enzyme, Bst YI (Fig. 1 ). The S1 nucleotide sequence alignments and pair distances data are presented in Fig. 2 and Table 1 respectively. The S1 gene nucleotide sequence of FL-18288 was most similar to the Conn strain (97.3%). Isolates CV-9437, CV-1686, and CV-56b had S1 genes most similar (98.7%±99.7%) to one another, and 78.5% to 79.0% similar to Ark 99. The IBV isolate 1013 was most similar to Ark 99 (84.7%) and Ark DPI (85.1%), whereas GA-92 was most similar to Ark 99 (95.3%). All methods used to create phylogenetic trees produced similar dendrograms when comparing S1 genes and had similar topology to the trees based on S1 amino acid sequences as illustrated in Fig. 3 . A dendrogram was generated following alignment of the S1 glycoprotein sequences to determine phylogenetic relationships among IBV strains in the United States (Fig. 3) . Similar topologies were obtained when Fla. Strain and Variant IBV S1 Sequence generating phylogenetic trees using UPGMA, PAUP, and the neighbor-joining method. The S1 amino acid sequence alignments and pair distances data are presented in Fig. 4 and Table 2 respectively. The FL-18288 strain S1 protein sequence was 94.9% similar to that of the Conn strain. California variants, CV-56b, CV-9437, and CV-1686, were 97.6% to 99.3% similar to one another, and 76.6% to 76.8% similar to the S1 protein from Ark strains. Another isolate from California, 1013, had an S1 glycoprotein that was 84.0% similar to Ark DPI and only 77.9% similar to CV-56b. The GA-92 IBV isolates S1 protein was 92.8% similar to the Ark 99 glycoprotein. The predicted S1 glycoprotein amino acid sequences for IBV isolates FL-18288, GA-92, CV-56b, CV-9437, CV-1686, and 1013 were aligned with previously published S1 glycoprotein sequences of other USA strains in the GenBank database. When comparing the S1 sequences, the most variations were observed between residues 55±96, 115±149, 255±309, and 378±395 (numbering is in reference to Mass 41 S1 and includes the signal sequence). When comparing strains Mass 41, Conn, and FL-18288 representing three serotypes with closely related sequences, variations occurred in the S1 subunit of all three strains at residues 91, 125, and 392 as reported in Table 3 . The S precursor glycoprotein is cleaved posttranslationally marking the carboxyl terminus of the S1 subunit and the S2 subunit amino terminus (9) . Several IBV isolates reported here had amino acid sequence variations at the cleavage site (residues Because of variant RFLP patterns using the RT-PCR/ RFLP diagnostic test for IBV, GA-92, CV-56b, CV-9437, CV-1686, 1013 were selected for further characterization. The S1 gene nucleotide and predicted amino acid sequences of FL-18288 strain, GA-92, CV-56b, CV-9437, CV-1686, 1013 were determined, and compared with other IBV sequences. Although GA-92 has a unique RFLP pattern, it is serologically related to the Arkansas serotype (23) . Consistent with serologic data, we determined that the S1 glycoprotein sequence was 92.8% similar to the Ark 99 strain. Initial characterization of isolates from California obtained from poultry between 1984 and 1993 (including CV-56b) indicated these variant strains (designated California variants) were serologically distinct from other IBV serotypes (23, 24) . The epidemiology of California variants is unknown. It is unlikely that these California variants originated directly from a live vaccine virus because the S1 glycoprotein sequences are not closely related to other IBV strains. Recombination of genomic RNA also is unlikely because no crossover sites were observed in the S1 gene. We observed a genetic drift from CV-56b (1991) to CV-9437 (1995) and CV-1686 (1995) with 98.0% and 97.6% similarity, respectively. Based on RFLP patterns and sequence analysis, we predict that CV-9437 and CV-1686 are serologically similar to other California variants represented by CV-56b. Another isolate from California, 1013, has a unique RFLP pattern, and the S1 glycoprotein sequence was more similar to Ark DPI (84.0%) than CV-56b (77.9%). Isolate 1013 may represent a new serotype or may belong to an existing serotype, such as the Ark serotype. Following alignment of these newly reported S1 glycoprotein sequences with those previously published from other IBV strains, regions of sequence variation were detected between residues 59±96, 115± 149, 255±309, and 378±395. Amino acid variations between 59±96 and 115±149 reported herein were similar to HVR I (residues 56±69) and HVR II (residues 117±131) of Mass and European strains as well as residues 53±148 in strains isolated from the United States (15, 17) . These variable regions were also similar to regions I (residues 38±67) and II (residues 97±141) associated with two separate virus-neutralizing, conformationally dependent epitopes (20) . It appears that regions between residues 59±96 and 115±149 of the S1 glycoproteins reported in this study, may also be involved with epitopes that induce virus-neutralizing and serotype speci®c antibodies. Two other regions of S1 glycoprotein variation were also observed. The ®rst region between residues 255±309 corresponds to a previously reported hypervariable region (18, 19) , and the second domain between amino acids 378±395 has not been previously reported. Amino acid variation between residues 255± 309 and 378±395 are similar to region III (274±387) associated with one virus-neutralizing epitope in European strains using monoclonal antibody resistant (MAR) mutants (20) . In another study, MAR mutants of the Ark DPI strain had substitutions at Thr 304 and Arg 386 (Thr 306 and Arg 388 in this multiple alignment) of the S1 protein and were associated with one serotype-speci®c, virus-neutralizing epitope (21) . During this investigation, amino acids surrounding 306 and 388 were identi®ed as hypervariable. This suggests that sequences of the S1 glycoprotein between 255±309 and 378±395 may be important in forming serotype-speci®c, con®rmationally dependent epitopes. FL-18288 was 95.8% and 89.4% similar to Conn and Mass 41, respectively. This demonstrates that the S1 subunit may vary by a few amino acids giving rise to different serotypes. Amino acid variation in all three serotypes were observed at amino acids 91, 125, and 392. Amino acids 91 and 125 are within regions of variation, I and II, respectively. These regions are associated with two separate virus-neutralizing, conformationally dependent epitopes (20) . Amino acid 392 is within region III that is associated with one virus-neutralizing, serotype-speci®c epitope. Amino acid substitutions in the cleavage site among various IBV strains was detected during these studies. As reported previously, the cleavage site sequence for Beaudette and Mass 41 strains in Arg-Arg-Phe-Arg-Arg (6, 17) , whereas Gray and JMK strains have Arg-Arg-Ser-Arg-Arg (34) . In this study, differences were detected giving a cleavage site sequence of X1-Arg-X2-Arg-Arg. Herein X1 represents His or Arg and X2 represents Ser or Phe. The importance of these variations in the cleavage sequence to virulence is currently not known. In summary, we observed S1 glycoprotein variations similar to others (residues 55±96, 115±149, and 255±309). However, a newly de®ned region of variation between amino acids 378±395 may interact with residues between 255±309 to form one conformationally dependent epitope with virusneutralizing and serotype speci®city. In addition, the S1 subunit may vary by 8% and remain the same serotype, whereas, a variation of only 4.2% may also change the serotype because of critical amino acid substitutions in the S1 glycoprotein. These data also demonstrate that variant IBV strains can circulate among poultry in geographically isolated areas. Diseases of Poultry World Veterinary Poultry Association, 154±160 Arch of Virol 142 Numerical TaxonomyÐThe Principles and Practice of Numerical Taxonomy We thank Ellen Collisson for providing the sequence of Iowa 609. We acknowledge Jessica A. Mulhern and Deborah A. Hilt for their assistance. Funding for these investigations was provided by the United States Poultry and Egg Association.