key: cord-267014-3vi7pgvr authors: Vennema, H.; Rossen, J.W.A.; Wesseling, J.; Horzinek, M. C.; Rottier, P.J.M. title: Genomic organization and expression of the 3′ end of the canine and feline enteric coronaviruses date: 1992-11-30 journal: Virology DOI: 10.1016/0042-6822(92)90174-n sha: doc_id: 267014 cord_uid: 3vi7pgvr Abstract The genomic organization at the 3′ end of canine coronavirus (CCV) and feline enteric coronavirus (FECV) was determined by sequence analysis and compared to that of feline infectious peritonitis virus (FIPV) and transmissible gastroenteritis virus (TGEV) of swine. Comparison of the latter two has previously revealed an extra open reading frame (ORF) at the 3′ end of the FIPV genome, lacking in TGEV, which is currently designated ORF 6b. Both CCV and FECV possess 6b-related ORFs at the 3′ ends of their genomes. The presence of ORF 6b in three of four viruses in this antigenic cluster strongly suggests that TGEV has lost this ORF by deletion. The CCV ORF 6b is collinear with that of FIPV, but the predicted amino acid sequences are only 58% identical. The FECV ORF 6b contains a large deletion compared to that of FIPV, reducing the collinear part to 60%. The sequence homologies were highest between CCV and TGEV on the one hand and between FECV and FIPV on the other. Previously, we showed that the expression product of the FIPV ORF 6b can be detected in infected cells by immunoprecipitation (Vennema et al., 1992). In the present study we have performed similar experiments with CCV and FECV. In infected cells both viruses produced proteins related to but different from the FIPV 6b protein. Canine coronavirus (CCV), feline enteric coronavirus (FECV), feline infectious peritonitis virus (FIPV), and transmissible gastroenteritis virus (TGEV) of swine belong to one antigenic cluster within the family Coronaviridae (Siddel et al., 1983) . Sequence analysis revealed a close genetic relatedness between FIPV and TGEV (Jacobs et al., 1987; de Groot et al., 1988; Vennema et a/., 1991) . FIPV contains an extra open reading frame (ORF) in the 3'-terminal region of its genome (de Groot et a/., 1988) . It is the second ORF of mRNA 6 which is currently designated ORF 6b, according to the new nomenclature (Cavanagh et al., 1990) . The first ORF of mRNA 6, designated ORF 6a, is the counterpart of TGEV ORF 7, previously called ORF-X3 (Kapke and Brian, 1986; Rasschaert et al., 1987) or ORF-4 (Britton et a/., 1988) . Comparison of ORFs 6a and 7 revealed that the FIPV genome contained an in frame insertion of 69 nucleotides (de Groot et a/., 1988) . The protein product of TGEV ORF 7 was identified in TGEV-infected cells (Garwes et al., 1989) . Recently, the 6b gene was shown to be expressed in FIPV-infected cells (Vennema et a/., 1992) . It is a glycoprotein which is released into the extracellular medium and is not stably associated with virus particles. In cats it induces antibodies during natural and experimental FIPV infections. Therefore, the 6b protein provides an antigenic distinction between FIPV and TGEV. The aim of the ' To whom reprint requests should be addressed. present study was to examine whether this distinction could be extended to CCV and FECV. We determined the genomic organization of the 3' end of the viral genomes and investigated whether these viruses produced proteins related to the 6b protein of FIPV. FIPV strain 79-l 146, FECV strain 79-l 683 (McKeirnan et a/., 1981; obtained from Dr. J. Evermann), CCV strain K378 (Dutch field isolate, obtained from Dr. H. Flare), and CCV strain l-71 (Binn et a/., 1975) were grown in Fe/is catus whole fetus cells (fcwf-D; obtained from Dr. N. C. Pedersen) and Crandell feline kidney cells (CrFK). Recombinant vaccinia virus vTF7-3 (Fuerst eta/., 1986 ; obtained from Dr. B. Moss) infections were carried out in HeLa cells. Cells were maintained in Dulbecco's modified Eagle's medium (GIBCO Laboratories) containing 5% fetal bovine serum. Cloning and sequence analysis of the 3' end of CCV cDNA libraries were prepared of intracellular poly(A)containing RNA from CCV-infected fcwf-D cells as described elsewhere (Wesseling et al., manuscript in preparation) . Clones containing sequences derived from the 3'end of the genome were selected by colony hybridization with restriction fragments of FIPV cDNA clone B12 (de Groot eta/., 1988) as probes. Nucleotide sequencing was performed by the dideoxy chain termi-nation procedure (Sanger et a/., 1977) using doublestranded DNA and a bacteriophage T7 DNA polymerase based kit (Pharmacia, LKB). Sequence data were analyzed using the computer programs of Devereux et a/. (1984). cDNA synthesis and PCR amplification of the 3' end of FECV Total intracellular RNA was isolated from FECV-infected fcwf-D cells as described (Chomczynski and Sacchi, 1987) . Synthesis of cDNA on total RNA was pet-formed as described (Kawasaki and Wang, 1989) by priming specifically with synthetic oligonucleotide 179 (Y-CCAGlllTAGACATCGGG-3', reverse complement of nucleotides 1006-l 023; de Groot et al., 1988) which binds to a sequence in the 3' noncoding region of FIPV, downstream of ORF 6b. Oligonucleotide 185 (5'-GATCCAGACGTTAGCTC-3', reverse complement of nucleotides 1202-l 218; de Groot et a/., 1988), was used to prime cDNA synthesis from a position closer to the 3' end. Amplification of cDNA was performed by the polymerase chain reaction (PCR) as described (Kawasaki and Wang, 1989) , after the addition of synthetic oligonucleotide 178, 5'-GATGACACACAGGlTGAG-3', which is identical to the carboxyl-terminus of the nucleocapsid (N) protein gene of FIPV (nucleotides 1945-l 962; Vennema et a/., 1991) . The positions of the primers are indicated in Fig. 5 . PCR-amplified FECV cDNA fragments were isolated from agarose gel, oligo-dC tailed with terminal transferase and cloned after annealing with oligo-dG-tailed pUC9 (Pharmacia LKB). Sequence analysis was performed as described above for CCV cDNA clones, with pUC/M 13 primers and with the primers used for PCR amplification. The CCV 6b gene was isolated from cDNA clone Cl6 as a /-/pall-Pstl fragment and recloned in pBluescript SK-(Stratagene) digested with Accl and Pstl, yielding pBSC6b. The 6b gene fragment was recloned from this construct as a Xhol-Psrl fragment into the vector part of pTFGb, (Vennema et a/., 1992) digested with the same enzymes. The final construct was designated pTC6b. The FECV 6b gene was recloned from cDNA-PCR clone FE1 as a Spel-Pstl fragment in the vector part of pTFGb, prepared by digestion with Spel and Pstl. This construct was designated pTE6b. All constructs were used in the transient T7 expression system with recombinant vaccinia virus vTF7-3 producing the T7 RNA polymerase (Fuerst et a/., 1986 ; obtained from Dr. B. Moss). Lysates from coronavirus-or vaccinia virus-infected cells were prepared after metabolic labeling with L-[35S]cysteine (ICN Biomedicals, Inc.). Lysis, RIPA with ascites fluid from a field case of FIP, and endo-P-Nacetylglucosaminidase H (endo H; Boehringer-Mannheim Biochemicals) treatment were carried out as described (Vennema et a/., 1990) . The ascites fluid contains antibodies to all viral proteins of FIPV identified so far, which are cross-reactive with proteins from CCV, FECV, and TGEV. Endo-P-N-acetylglucosaminidase F (endo F; Boehringer-Mannheim Biochemicals) digestions were carried out for 16 hr at 30" in 50 mM potassium phosphate buffer, pH 6.8, 20 mll/l EDTA, 1% Triton X-l 00, 0.2% sodium dodecyl sulphate (SDS), 1% 2-mercaptoethanol. Analysis by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) was performed as described (Laemmli, 1970) . Sequence analysis of the 3' end of the CCV genome CCV cDNA clones were prepared and selected as described under Materials and Methods. Most clones were derived from CCV strain K378, a field isolate from The Netherlands (Fig. 1) . A clone derived from the laboratory strain l-71 (Binn et al., 1975) was partially analyzed. The nucleotide sequences from these strains were more than 99% identical. We obtained a contiguous sequence of strain K378, extending 2.5 kb from the poly(A)-tail in the 5' direction ( Fig. 2) . Translation of the nucleotide sequence revealed three ORFs corresponding to the N protein and ORFs 6a and 6b of FIPV. The organization of the 3'end of the CCV genome was compared to that of FIPV and TGEV (Fig. 5 ). CCV contains an ORF 6b and an ORF 6a with 69 additional nucleotides which are lacking in ORF 7 of TGEV. The amino acid sequences were aligned with the corresponding sequences of FIPV and TGEV to determine the percentages of identical amino acid residues (Table 1). The N and 6a amino acid sequences have a higher level of identity when compared between CCV and TGEV than between CCV and FIPV. The same was found when the nucleotide sequences were compared. The 6b amino acid sequences of CCV and FIPV were only 58% identical and several small insertions in the CCV sequence were found. Nevertheless, the hydrophobicity plots of the putative 6b proteins were remarkably similar (data not shown), including a short hydrophobic amino-terminus, which may function as a signal sequence. In contrast to the FIPV ORF 6b, the CCV ORF 6b contains no potential N-glycosylation site. PCR amplification of cDNA derived from the 3' end of the FECV genome The nucleotide sequences flanking the ORFs 6a and 6b of FIPV and CCV and ORF 7 of TGEV were aligned to design primers for cDNA synthesis and polymerase chain reaction (PCR) amplification. The positions of the primers are indicated in Fig. 3 . Synthetic oligonucleotide 179 was used to prime cDNA synthesis on total RNA isolated from FECV and CCV-infected cells. The cDNA synthesis was followed by PCR amplification after addition of primer 178. The CCV product had the expected size of approximately 1 kbp (data not shown). The FECV product was considerably smaller, being approximately 750 bp. The controls with RNAfrom mockinfected cells and without RNA were both negative. The sequence analysis presented below revealed that the FECV ORF 6b extended into the sequences used to design the PCR primers. Therefore, cDNA synthesis and PCR amplification were repeated with synthetic oligonucleotides 185 and 178, resulting in a fragment of approximately 950 bp (data not shown). The PCR products of FECV were cloned and sequenced. Several independent clones were analyzed (Fig. 3) leading to a contiguous sequence of 957 nucleotides which ends approximately 36 nucleotides upstream of the poly(A)-tail (Fig. 4) . Comparison with the corresponding sequence of the FIPV genome showed a single deletion of 238 nucleotides and an overall sequence identity of 93.6%. Translation of the nucleotide sequence revealed the presence of two ORFs similar to the FIPV ORFs 6a and 6b. The 6a polypeptides are virtually identical, with only one amino acid difference. The deletion is located in ORF 6b. The FECV and FIPV 6b sequences are collinear for the amino-terminal 123 amino acid residues. In this part the identity is 89%. The deletion results in a shift to the -1 reading frame which extends 53 codons. The ORF specifies a polypeptide with a total length of 176 amino acid residues and a predicted mol wt of 20,300. The protein sequence predicts a short hydrophobic aminoterminus, probably acting as a signal sequence and one N-glycosylation site. The genomic organization of the FECV 3' end is similar to FIPV and CCV, which all contain an ORF 6b and the extra 69 nucleotides in ORF 6a as compared to TGEV ORF 7 (Fig. 5) . Paired alignments of the collinear parts of the amino acid sequences with the corresponding sequences of FIPV, CCV, and TGEV revealed that FECV is more closely related to FIPV than to CCV and TGEV (Table 1) . Similar homologies were found by comparing the nucleotide sequences (data not shown). The FIPV 6b protein was readily detected in lysates of FIPV-infected cells (Vennema et a/., 1992) . It comigrated in SDS-PAGE with an expression product of the cloned 6b gene which was prepared by using the recombinant vaccinia virus T7 RNA polymerase expres- sion system (Fuerst et a/., 1986) . To identify the CCV and FECV 6b proteins in a similar way, their 6b ORFs were recloned in a T7 expression vector. The resulting constructs pTC6b and pTEGb, respectively, and pTF6b containing the FIPV 6b gene (Vennema et a/., 1992) were used to transfect HeLa cells infected with recombinant vaccinia virus vTF7-3, which produces T7 RNA polymerase (Fuerst et a/., 1986) . The expression products were analyzed by metabolic labeling with [35S]cysteine, RIPA, and endo H treatment followed by SDS-PAGE (Fig. 6) . The CCV and FECV 6b proteins appeared to be slightly smaller than the FIPV 6b protein, the FECV 6b protein being the smallest. Digestion with endo H which cleaves high-mannose N-linked oligosaccharides, resulted in an approximately 2000 mol wt reduction of the FECV and FIPV 6b protein. The CCV 6b protein, however, was not affected and was also insensitive to digestion by endo F, which cleaves complex N-linked sugars (data not shown). These observations indicate that FECV and FIPV are glycoproteins while CCV 6b is not. The shift in molecular weight of the FECV and FIPV 6b proteins is consistent with the removal of one sugar side chain (Neuberger et a/., 1972) . This is in agreement with the predicted numbers of glycosylation sites in the amino acid sequences. The observed molecular weights of the CCV 6b protein and of the FECV 6b protein after deglycosylation are also in agreement with those predicted from the amino acid sequences. The recombinant expression products were compared to the proteins produced in CCV-, FECV-, and FIPV-infected cells, which were analyzed similarly (Fig. 6) . All three matched with a protein in the sample from the corresponding coronavirus-infected cell lysate. In the latter samples the three known structural proteins also appeared: the membrane (M), nucleocapsid (N), and spike (S) proteins. In all cases the M protein was partially resistant to endo H digestion. The same was observed for the 6b protein in FIPV-infected cells. The lanes of FECV were overexposed to reveal the 6b protein band. This indicates that the expression level of the FECV 6b protein was much lower than that of CCV and FIPV. The genomic organization of the 3' end of FIPV differs from that of TGEV in that it contains an additional ORF (de Groot et a/., 1988). Recently, we identified the expression product of this extra gene, designated 6b (Vennema et al., 1992) . These observations prompted us to study CCV and FECV of the same antigenie cluster. Sequence analysis showed that their genomic organization in the 3'terminal region is similar to that of FIPV. Therefore, the presence rather than the absence of ORF 6b appears to be the common theme, suggesting that TGEV has lost the corresponding ORF by deletion. The same inference probably holds true for the 69 nucleotides that are present in all 6a ORFs but not in the corresponding ORF 7 of TGEV. Deletions appear to happen frequently during coronavirus evolution. They occur, for example, in the S and HE protein genes of murine hepatitis virus (MHV; Parker et al., 1989; La Monica et a/., 1991) in the S protein gene of porcine respiratory coronavirus (PRCV; Rasschaert et al., 1990) and in the region between the S and M protein genes of MHV strain S (Yokomori and Lai, 1991 ) of a small plaque variant of TGEV (Wesley et al., 1990) and of PRCV (Rasschaert et a/., 1990) . Among the feline coronaviruses another deletion was found recently in FECV 79-1683 as compared to FIPV in the region between the S and M protein genes (E. Lewis and H. Vennema, unpublished data) . Alignment of nucleotide and amino acid sequences of TGEV, CCV, FIPV, and FECV revealed a close relationship among the strains of this cluster (Table 1) . The four strains could be divided into two pairs on the basis of their homologies, TGEV and CCV on the one hand and FIPV and FECV on the other. Both in CCV-and in FECV-infected cells 6b proteins are produced. Their characterization showed that the FECV 6b protein is glycosylated, like the FIPV 6b protein, while the CCV 6b protein is not. The observation that FIPV, CCV, and FECV induce the synthesis of 6b proteins appears to compromise their antigenic distinction. However, the differences between the 6b proteins may allow discrimination using, e.g., monoclonal antibodies. The deletion of 238 nucleotides in FECV 79-l 683 was revealed bycDNA-PCR, allowing discrimination from CCV (Fig. 4) and FIPV 79-l 146 (data not shown). It remains to be determined whether this is a universal distinguishing property of FIPV and FECV. The feline coronaviruses used in this study are almost the same with respect to growth in tissue culture, protein composition, and antigenicity (Boyle et a/., 1984; Fiscus and Teramoto, 1987) . Small differences were also reported; the N protein of FECV is slightly smaller than that of FIPV (Tupper et al., 1987) . Recently, a monoclonal antibody specific for the S protein of strain 79-1 146 was characterized (Hohdatsu et al., 1991) . The close resemblance and the low incidence of FIP despite the high proportion of feline coronavirus (FCV) seropositive cats have led to the hypothesis that FECV carriers are the source of FIPV which is generated de nova from FECV by minor mutations (Pedersen et a/., 1984; Pedersen 1989) . Our sequence comparisons show that FIPV 79-l 146 and FECV 79-l 683 are related more closely to each other than they are to CCV. Therefore, FIPV 79-l 146 did not originate from FECV 79-1683 by insertion of genetic information; it would be too much of a coincidence that an insertion with 77% identity is present in exactly the same position in CCV. An alternative mechanism for de nova generation of FIPV could be recombination, which is an established phenomenon for coronaviruses (Lai, 1992) . Avirulent FIPV-strains and/or FECV-strains are and SDS-PAGE analysis of lysates from CCV-, FECV-, FIPV-, and vTF7-3-infected cells, Recombinant vaccinia virus vTF7-3-infected cells were transfected with the plasmid DNAs pTCGb, pTEGb, and pTFGb, as indicated above the lanes. One-half of each sample was treated with endo H, the other half was mock treated (indicated with + and -, respectively). Structural proteins (S, N, and M) are indicated. In addition, the M protein bands are indicated with arrowheads. The region of the gel in which the 6b protein bands appear overlaps with that of M protein bands. Recovery and characterization of a coronavirus from military dogs with diarrhoea. froc Plaque assay, polypeptide composition and immunochemistry of feline infectious peritonitis virus and feline enteric coronavirus Sequence of the nucleoprotein gene from a virulent British field isolate of transmissible gastroenteritis virus and its expression in Saccharomyces cerevisiae Recommendations of the Coronavirus Study Group for the nomenclature of the structural proteins mRNAs, and genes of coronaviruses Single step method of RNA isolation by acid guanidium thiocyanate-phenol-chloroform extraction Sequence analysis of the B'end of the feline coronavirus FIPV 79-1146 genome: Comparison with the genome of porcine coronavirus TGEV reveals large insertions A comprehensive set of sequence analysis programs for the VAX Antigenic comparison of feline coronavirus isolates: Evidence for markedly different peplomer glycoproteins Eukaryotic transient-expression system based on recombinant vaccinia virus that synthesizes bacteriophage T7 RNA polymerase The polypeptide of M, 1400 of porcine transmissible gastroenteritis virus: Gene assignment and intracellular location Antigenic analysis of feline coronaviruses with monoclonal antibodies (Mabs): Preparation of Mabs which discriminate between FIPV strain 79-l 146 and FECV strain 79-1683 The nucleotide sequence of the peplomer gene of porcine transmissible gastroenteritis virus (TGEV): Comparison with the sequence of the peplomer protein of feline infectious peritonitis virus (FIPV) Sequence analysis of the porcine transmissible gastroenteritis corona virus nucleocapsid protein gene Detection of gene expression. ln "PCR Technology: Principles and Applications for DNA Amplification Cleavage of structural proteins during assembly of the head of bacteriophage T4 RNA recombination in animal and plant viruses Localization of extensive deletions in the structural genes of two neurotropic variants of murine coronvirus JHM Molecular Cloning. A Laboratory Manual Isolation of feline coronavirus from two cats with diverse disease manifestations Carbohydrate-peptide linkages in glycoproteins and methods for their elucidation Sequence analysis reveals extensive polymorphism and evidence of deletions within the E2 glycoprotein gene of several strains of murine hepatitis virus Animal infections that defy vaccination: Equine Infectious Anemia, Caprine Encephalitis, Maedi-Visna, and Feline Infectious Peritonitis Virus Pathogenicity studies of feline coronavirus isolates 79-1146 and 79-1683 Porcine respiratory coronavirus differs from transmissible gastroenteritis virus by a few genomic deletions Enteric coronavirus TGEV: Partial sequence of the genomic RNA, its organization and expression DNA sequencing with chain termination inhibitors The biology of coronaviruses Antigenic and biological diversity of feline coronaviruses: Feline infectious peritonitis and feline enteritis virus A novel glycoprotein of feline infectious peritonitis coronavirus contains a KDEL-like endoplasmic reticulum retention signal. f. Viral Intracellular transport of recombinant coronavirus spike proteins: Implications for virus assembly. 1. Viral Primary structure of the membrane and nucleocapsid protein genes of feline infectious peritonitis virus and immunogenicity of recombinant vaccinia viruses in kittens Genetic basis for the pathogenesis of transmissible gastroenteritis virus Mouse hepatitis virus S RNA sequence reveals that nonstructural proteins ns4 and ns5a are not essential for murine coronavirus replication. f. Viral The authors thank Raoul de Groot and Willy Spaan for critical reading of the manuscript. J.W. was supported by a grant from Solvay-Duphar B.V., Weesp, The Netherlands.