key: cord-293248-8vtd9e4n authors: Day, J. Michael; Zsak, Laszlo title: Determination and analysis of the full-length chicken parvovirus genome date: 2010-03-30 journal: Virology DOI: 10.1016/j.virol.2009.12.027 sha: doc_id: 293248 cord_uid: 8vtd9e4n Viral enteric disease in poultry is an ongoing problem in many parts of the world. Many enteric viruses have been identified in turkeys and chickens, including avian astroviruses, rotaviruses, reoviruses, and coronaviruses. Through the application of a molecular screening method targeting particle-associated nucleic acid (PAN), we recently described the detection and partial characterization of a novel enteric parvovirus in chickens. Subsequent surveys of intestinal homogenates from turkeys and chickens in the United States revealed widespread occurrence of parvovirus in poultry. Here we report the first full genome sequence of a novel chicken parvovirus, ChPV ABU-P1. ChPV ABU-P1 genome organization, predicted amino acid sequence, and phylogenetic relationships with other described parvoviruses are discussed. Enteric disease syndromes are a continual economic burden for the poultry industry throughout the world. The major recognized enteric syndromes in poultry are poult enteritis complex (PEC) and poult enteritis mortality syndrome (PEMS) in turkeys and runting-stunting syndrome (RSS) in broiler chickens (Barnes and Guy, 2003; Barnes et al., 2000; Goodwin et al., 1993) . Poultry enteric disease is marked by diarrhea, stunting, and increased time to market; the more severe forms of enteric disease are characterized by immune dysfunction and increased mortality. Numerous viruses have been detected in or isolated from the intestinal tract of poultry (both diseased and healthy), and have subsequently been implicated in poultry enteric disease. These viruses include the avian reoviruses, rotaviruses, astroviruses and coronaviruses; despite these diagnostic efforts and viral descriptions, the role these viruses play in the enteric syndromes are still poorly understood (Guy, 1998; Pantin-Jackwood et al., 2008a; Pantin-Jackwood et al., 2007a; Pantin-Jackwood et al., 2008b; Pantin-Jackwood et al., 2007b; Reynolds et al., 1987; Spackman et al., 2005) . We recently reported the use of a molecular screening protocol to amplify and identify novel enteric viruses in the intestinal tract of poultry showing characteristic enteric disease signs (Zsak et al., 2008) . This random amplification of viral particle-associated nucleic acid identified numerous parvovirus DNA sequences directly from the intestinal homogenates of chickens with signs of enteric disease. Initial analysis of this chicken-origin parvovirus (ChPV) revealed that it is a novel member of the Parvovirinae subfamily within the Parvoviridae, and led to the development of a specific molecular diagnostic test targeting the ChPV non-structural (NS) gene (Zsak et al., 2009) . Parvoviruses have been described that infect a wide range of invertebrates and vertebrates, and a thorough knowledge of the parvovirus genome and encoded proteins is essential to an understanding of the evolution of parvoviruses in various hosts, including tissue tropism during viral pathogenesis (Hueffer and Parrish, 2003) . Parvoviruses are well known for causing enteric disease in mammals, with the canine parvoviruses recently emerging as important pathogens in the 1970s (Decaro et al., 2009; Truyen, 1999) . Parvoviruses have been known for years as the causative agent of Derzsy's disease in young geese, and are also found in Muscovy ducks, where they cause multiple clinical signs ranging from enteritis to hepatitis and infectious myocarditis (Gough, 2008) . Parvoviruses have been previously described in chickens based upon their morphology via electron microscopy and upon their genome size Kisary et al., 1984) , and enteric disease signs have been attributed to parvovirus-like particles detected in turkey intestinal tracts (Trampel et al., 1983) . A recent diagnostic survey of enteric samples collected from United States turkey and chicken flocks revealed widespread presence of parvovirus in U.S. poultry (Zsak et al., 2009) . This is the first indepth molecular characterization and analysis of the full-length genome sequence of ChPV. The analysis includes comparisons to other members of the Parvovirinae that infect mammals and birds, including two novel turkey-origin parvoviruses (TuPV) recently sequenced using a similar molecular approach. Virology 399 (2010) 59-64 The ChPV ABU-P1 genome The fully assembled ChPV ABU-P1 genome is 5257 nucleotides (nt) in length and has a basic organization similar to previously described members of the Parvoviridae (Fig. 1) . The genome is flanked on the 5′ and 3′ ends by 206 nt direct repeat sequences, each of which contains 39 nt inverted repeats presumed to form a hairpin structure similar to the structures common in the parvoviruses (Fig. 2) (Farkas et al., 2004; Muzyczka and Berns, 2001; Sukhumsirichart et al., 2006) . The 206 nt direct repeats are identical and are found in the same orientation 5′ to 3′ in the genome. The genome contains 32.36% A, 21.95% G, 24.08% T, and 21.61% C, with an A + T content of 56.44% and a C + G content of 43.56%. The overall genomic organization of ChPV ABU-P1 is similar to other parvoviruses, with two major predicted open reading frames (ORFs). The 5′ ORF is 2085 nt long, and a blastx search using the putative amino acid sequence revealed a protein similar to the Parvoviridae non-structural (NS) proteins NS1, with the greatest similarity to human bocavirus NS1. The major 3′ ORF is 2028 nt long, and encodes a protein similar to the Parvoviridae capsid protein VP1, with the greatest similarity to avian adeno-associated virus, a member of the Dependovirus genus. The ChPV genome also contains a small (306 nt) ORF located between the major 5′ and 3′ ORFs. This ORF shows no homology to known protein coding regions in the databases. The ChPV NS1 ORF is preceded upstream by the 39 bp inverted repeat sequence. The NS1 start codon is in a strong Kozak context (AAGATGG) and is preceded directly upstream by a TATA box, an extended CAAT box, and a putative Inr-box (Hernandez, 1993; Rupp et al., 1990; Smale and Kadonaga, 2003) . NS1 also contains a wellconserved phosphate-binding loop (P-loop) motif (GXXXXGKT…EE), distinguishing it as a member of the ATP and GTP-binding superfamily of proteins (Saraste et al., 1990 ). At the amino acid level, the ChPV NS1 protein exhibited from 45.2 to 53.9% amino acid similarity (17.1 to 19.0% identity) with NS1 from other parvovirus isolates. The NS1 similarity increased to 89.3 and 100% (99.3 to 83.7% identity) when ChPV was compared with two turkey-origin parvoviruses (TuPV), TuPV 260 and TuPV 1078 respectively. The lower identity noted when comparing the ChPV and TuPV 1078 NS1 coding sequences can be partially explained by the presence of a stop codon beginning at position 1882 in the TuPV 1078 NS1 coding region. The predicted VP1 ORF begins at position 2998 in the ChPV genome. The predicted VP1 start codon (ATG) overlaps with the stop codon (AAT) of the predicted 306 nt ORF located between the ChPV NS1 and VP1 ORFs. VP1 is preceded upstream by a TATA box and a polyadenylation signal (AATAAA) follows the ORF; this poly-A signal immediately precedes the 39 nt inverted repeat at the 5′ end of the genome. One of two downstream ATG codons that begin 417 and 420 nt downstream to the start of the putative VP1 ORF may serve as the start codon for a VP2 capsid protein that shares a stop codon (TAA) with VP1. The second of these ATG codons has the stronger Kozak consensus sequence (ATGATGG) compared to the translation initiation context of the first codon (GAAATGA) and remains in frame (Kozak, 1987) . A third start codon with a favorable translation initiation context (GACATGG) is present at position 3067 in the ChPV genome, and may represent the beginning of a putative VP3 ORF as has been described in the goose and Muscovy duck parvoviruses (Zadori et al., 1995) . The ChPV VP1 protein shares from 46.6 to 59.0% amino acid similarity (10.1 to 24.5% identity) with VP1 from other parvoviruses. The VP1 similarity increases to 100 and 95.1% (99.7 and 79.2% identity) when compared to VP1 from TuPV 260 and TuPV 1078, respectively. ClustalW was used to align the complete ChPV genome coding region with complete parvovirus sequences from mammals and birds available in the databases, and with two recently sequenced TuPV genome coding regions. The Neighbor-Joining tree was produced using MEGA4. The turkey and chicken sequences clustered together, and were clearly separate from the other members of the Parvovirinae (Fig. 3) . A previous phylogenetic analysis using a conserved region of the ChPV and TuPV NS1 gene produced a similar tree (Zsak et al., 2008) , and a phylogenetic analysis of the ChPV and TuPV VP1 gene produced an identical tree when compared to the full-length parvovirus coding region (data not shown). This is the first analysis of the complete nucleotide sequence from the ChPV ABU-P1 strain. Small viral particles (15 to about 25 nm) resembling parvovirus have been identified for years in the intestinal tracts of poultry (Kisary et al., 1984; Trampel et al., 1983; Woolcock and Shivaprasad, 2008) . The use of a particle-associated nucleic acid (PAN) sequencing technique has recently allowed the partial molecular characterization of this novel ChPV (Zsak et al., 2008) , and led to the identification and sequencing of two additional avian parvoviruses from turkeys: TuPV 260, originally detected in the intestinal tract of a turkey from California, and TuPV 1078, originally detected in the pooled intestinal tracts from a North Carolina turkey farm. All three of these poultry isolates are very similar to each other, yet each differs significantly from other members of the Parvovirinae. The high amino acid identities noted among the turkey and chicken isolates suggests that they diverged at sometime in the recent past from a common ancestor. It is known from studies of another member of the Parvovirinae, canine parvovirus (CPV), that the parvoviruses, despite their singlestranded DNA genome, have a mutation rate that approaches that of RNA viruses, which allows rapid evolution and host adaptation (Hoelzer et al., 2008; Shackelton et al., 2005) . Interestingly, ChPV and TuPV do not group phylogenetically with the parvoviruses that infect geese (GoPV) and Muscovy ducks (MdPV), parvoviruses that are closely related to adeno-associated virus 2 (AAV-2), a member of the Dependovirus genus (Fig. 3) (Zadori et al., 1995; Zsak et al., 2008) . Further, the ChPV NS protein shares only 19.1% amino acid identity with the MdPV NS protein. The ITR sequences located at each end of the ChPV genome (Fig. 2 ) are each 206 nt long, somewhat shorter than those found in the prototypical human parvovirus B19 (383 nt) but longer than those found in some other autonomously replicating parvoviruses, namely the murine parvoviruses that have terminal palindromes of 115 and about 200 nt long in the same genome (Astell et al., 1979; Deiss et al., 1990) . The terminal palindromes found at the 5′ and 3′ ends of most autonomously replicating parvoviruses bear no nucleotide identity with one another. Interestingly, this is not the case with ChPV, which has identical palindrome sequences at its 5′ and 3′ end (Fig. 2) , a trait it shares with parvovirus B19 (Deiss et al., 1990; Muzyczka and Berns, 2001) . It would be interesting to determine if the 5′ end of the ChPV genome exists in an alternative orientation, as is the case with B19, MdPV and GoPV, in which the 5′ sequence can be found in one of two "flip" or "flop" orientations, with one orientation being the inverted complement of the other (Deiss et al., 1990; Zadori et al., 1995) . The ChPV NS1 deduced amino acid sequence contains highly conserved motifs important for the initiation of parvovirus replication (Zsak et al., 2008) , including a well-conserved phosphate-binding loop ("P-loop") motif specifically involved in the binding of nucleoside triphosphates by this protein (Saraste et al., 1990) . This motif is required for pathogenesis in the human parvovirus B19 and is found in many parvovirus isolates (Momoeda et al., 1994) . The 5′ major ORF of ChPV appears to encode the viral capsid proteins VP1, VP2, and VP3. These three proteins together comprise the parvovirus virion (Muzyczka and Berns, 2001) and are responsible for the production of neutralizing antibodies during an infection (Saikawa et al., 1993) . The present analysis does not include members of the Densovirinae subfamily, which infect insects and other invertebrates (Muzyczka and Berns, 2001) . ChPV and the closely related TuPVs were compared across their entire coding region with representative members of the Parvovirinae subfamily, which includes the Dependovirus, Bocavirus, Erythrovirus, Amdovirus and Parvovirus genera (Fig. 3) . It is clear from the multiple sequence alignments and phylogenetic analysis that ChPV, along with the closely related TuPVs, represents a distinct member of the Parvovirinae subfamily and should be recognized as the prototypical member of a novel genus within the Parvovirinae. Assembly of the prototypical chicken parvovirus (ChPV ABU-P1) genome The chicken parvovirus strain chosen for this analysis was originally isolated in Hungary from the intestines of chickens showing sings of a stunting syndrome (Kisary et al., 1984) , and this original cesium chloride gradient-purified parvovirus (provided by J. Kisary) was propagated in specific pathogen free (SPF) chickens as described previously (Kisary, 1985; Zsak et al., 2009) . A sequenceindependent polymerase chain reaction (PCR) protocol was employed to amplify particle-associated nucleic acid (PAN) present in ChPV ABU-P1-positive chicken intestinal homogenates, and has been described elsewhere in detail (Zsak et al., 2008) . The randomly amplified PAN was ligated into the TOPO-TA cloning vector (Invitrogen) and the ligation mix was used to transform competent TOP-10 E. coli cells (Invitrogen). Using this approach, a total of 768 clones were identified and sequenced using the M13 forward and reverse primers on an AB-3730 automated DNA sequencer. The sequenced clones were used as query sequences to search the GenBank non-redundant nucleotide (nr/nt) database using the blastn algorithm and the non-redundant (nr) protein database using the blastx algorithm. Four contigs with an average of eightfold coverage and lengths of 1417 nt, 933 nt, 1069 nt, and 739 nt were assembled from 27, 24, 19, and 21 clones, respectively. These contigs had no significant nucleotide similarity to database sequences, but the deduced amino acid sequence from each contig Fig. 3 . A phylogenetic tree prepared using the full coding sequence of the indicated parvoviruses. Representative genera within the Parvovirinae are indicated. The evolutionary relationships were inferred using the Neighbor-Joining method. Phylogenetic analyses were conducted using MEGA4. GenBank Accession numbers for ChPV, GU214704; TuPV 1078, GU214705; and TuPV 260, GU214706. revealed significant similarity to the members of the family Parvoviridae in the database. PCR primers were designed using the four assembled contigs and were subsequently used to close gaps of 58, 443, and 205 nt between the contigs and assemble the complete ChPV ABU-P1 genome. The right terminal region of contig 4 contained a portion of the right inverted terminal repeat (ITR). Primers designed using this partial ITR used in combination with forward and reverse primers within contig 1 (left terminal region of the genome) and contig 4 resulted in the cloning of the ITR regions at both the right and left genomic termini. The ChPV and TuPV genomes and ORFs were aligned with each other and with selected parvovirus sequences available in the databases using ClustalW (Thompson et al., 1994) . Searches for conserved domains and transcription factor binding sites within the ChPV genome were performed with GeneQuest (DNAStar/Laser-gene8) and the Conserved Domain Database (CDD) Search Service v2.17 at the National Center for Biotechnology Information (NCBI) website (Marchler-Bauer et al., 2009) . Phylogenetic analysis and tree construction were performed with MEGA4 using the Neighbor-Joining method (Tamura et al., 2007) . Structure of the 3′ hairpin termini of four rodent parvovirus genomes Poult enteritis mortality syndrome Genetic analysis of canine parvovirus type 2c Cloning of the human parvovirus B19 genome and structural analysis of its palindromic termini A parvovirus isolated from royal python (Python regius) is a member of the genus Dependovirus Enteritis (so-called runting stunting syndrome) in Georgia broiler chicks Parvovirus infections Virus infections of the gastrointestinal tract of poultry TBP, a universal eukaryotic transcription factor? Phylogenetic analysis reveals the emergence, evolution and dispersal of carnivore parvoviruses Parvovirus host range, cell tropism and evolution Experimental infection of chicken embryos and day-old chickens with parvovirus of chicken origin Presence of parvovirus in the intestine of chickens showing stunting syndrome The genome structure of a new chicken virus identifies it as a parvovirus An analysis of 5′-noncoding sequences from 699 vertebrate messenger RNAs CDD: specific functional annotation with the Conserved Domain Database A putative nucleoside triphosphate binding domain in the nonstructural protein of B19 parvovirus is required for cytotoxicity Parvoviridae: the viruses and their replication Pathology and virus tissue distribution of Turkey origin reoviruses in experimentally infected Turkey poults Periodic monitoring of commercial turkeys for enteric viruses indicates continuous presence of astrovirus and rotavirus on the farms Enteric viruses detected by molecular methods in commercial chicken and turkey flocks in the United States between Pathogenesis of type 2 turkey astroviruses with variant capsid genes in 2-day-old specific pathogen free poults Enteric viral infections of turkey poults: incidence of infection Chicken NFI/TGGCA proteins are encoded by at least three independent genes: NFI-A, NFI-B and NFI-C with homologues in mammalian genomes Neutralizing linear epitopes of B19 parvovirus cluster in the VP1 unique and VP1-VP2 junction regions The P-loop-a common motif in ATPand GTP-binding proteins High rate of viral evolution associated with the emergence of carnivore parvovirus The RNA polymerase II core promoter The pathogenesis of turkey origin reoviruses in turkeys and chickens Complete nucleotide sequence and genomic organization of hepatopancreatic parvovirus (HPV) of Penaeus monodon MEGA4: molecular evolutionary genetics analysis (MEGA) software version 4.0 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positionspecific gap penalties and weight matrix choice Parvovirus-like enteropathy in Missouri turkeys Emergence and recent evolution of canine parvovirus Electron microscopic identification of viruses associated with poult enteritis in turkeys grown in California Analysis of the complete nucleotide sequences of goose and muscovy duck parvoviruses indicates common ancestral origin with adeno-associated virus 2 Development of a polymerase chain reaction procedure for detection of chicken and turkey parvoviruses Partial genome sequence analysis of parvoviruses associated with enteric disease in poultry Thanks to Fenglan Li for excellent technical assistance and to the South Atlantic Area sequencing facility for outstanding support.