key: cord-0002409-m7129dcj authors: Du, Jiang; Lu, Liang; Liu, Feng; Su, Haoxiang; Dong, Jie; Sun, Lilian; Zhu, Yafang; Ren, Xianwen; Yang, Fan; Guo, Fei; Liu, Qiyong; Wu, Zhiqiang; Jin, Qi title: Distribution and characteristics of rodent picornaviruses in China date: 2016-09-29 journal: Sci Rep DOI: 10.1038/srep34381 sha: 182e49168024f82278796f47a253d64cb018f093 doc_id: 2409 cord_uid: m7129dcj Rodents are important reservoir hosts of many important zoonotic viruses. The family Picornaviridae contains clinically important pathogens that infect humans and animals, and increasing numbers of rodent picornaviruses have recently been associated with zoonoses. We collected 574 pharyngeal and anal swab specimens from 287 rodents of 10 different species from eight representative regions of China from October 2013 to July 2015. Seven representative sequences identified from six rodent species were amplified as full genomes and classified into four lineages. Three lineage 1 viruses belonged to a novel genus of picornaviruses and was more closely related to Hepatovirus than to others genera of picornaviruses based on aa homology. Lineage 2, lineage 3, and lineage 4 viruses belonged to the genera Rosavirus, Hunnivirus, and Enterovirus, respectively, representing new species. According to both phylogenetic and identity analyses, Lineage 2 viruses had a close relationship with rosavirus 2 which was recovered from the feces of a child in Gambia and Lineage 3 viruses had a close relationship with domestic animal Hunnivirus. Lineage 4 viruses provide the first evidence of these enteroviruses and their evolution in rodent hosts in China. Sample collection. A total of 574 pharyngeal and anal swabs from 287 rodents of 10 different species (Niviventer niviventer, Myodes rufocanus, Mus caroli, Dipus sagitta Pallas, Caryomys eva, Microtus clarkei, Rattus nitidus Hodgson, Cricetulus longicaudatus, Cricetulus kamensis, and Rattus losea) were collected from eight representative regions of China (Hubei, Jilin, Yunnan, Inner Mongolia, Tibet, Ningxia, Shanxi, and Fujiang) with different altitudes and climates, from October 2013 and July 2015 (Fig. 1 ). Pharyngeal and anal swab specimens (1 ml each) from the same rodent species and the same site were pooled into 10 samples and processed as described previously 9 . Metagenomic analysis. The amplified viral nucleic acid libraries were then sequenced using an Illumina HiSeq2500 (Illumina). In total, 13,928 reads of 81-bp in length showed the best matches with Picorniviridae viral proteins in the NCBI non-redundant database. These included 3179 and 2355 reads matching hepatitis A virus (HAV) and kobuviruses, respectively, and 562, 171, and 1132 reads matching aphthoviruses, enteroviruses, and cardioviruses, respectively. The remaining 6529 sequence reads were unclassified picornavirus sequences. The presence of picornaviruses was further confirmed by PCR amplification using specific primers. Pharyngeal and anal swab specimens were positive in 37 rodents of six different species from all eight provinces (Table 1) . Seven representative sequences from 37 positive samples were then amplified as full genomes. Genome organization. Seven novel rodent picornaviruses from diverse rodent species from different parts of China showed unique evolutionary characteristics and clustered into four distinct lineages. In addition to their phylogenetic clustering, the four lineages also displayed distinct genomic features, and the strains within each lineage possessed a high degree of aa and genomic feature conservation. Genome organization and size (Fig. 2) , the secondary structure of the 5′ UTR (Supplemental Figs 1-5), C + G content and ORFs (Table 2 ) of the seven Genome analyses. Lineage 1 viruses. The putative translation initiation site of lineage 1 viruses was contained in an optimal Kozak context (RNNAUGG) in Rodent/CK/PicoV/Tibet2014 (Tibet2014) (AAAAUGG), Rodent/Rn/PicoV/SX2015-1 (SX2015-1) (ACAAUGG), and Rodent/Ds/PicoV/IM2014 (IM2014) (GAAAUGG), respectively. The L protein was absent in lineage 1 viruses, as seen in the enteroviruses and hepatoviruses. The aa sequence identity of the P1 region between lineage 1 viruses and other picornaviruses ranged from 10.2-28.9% (Table 3) , and between the three Lineage1 viruses were 75.5-90.2%. The lineage 1 viruses had protein domains characteristic of HAV protein VP1-2A, with unknown function, located in the P1 region at aa 537-681, 536-681, and 536-680, respectively. The cleavages sites of VP4/VP2 in these were not obvious in the P1 region, and were consistent as VL/GN ( 15, 16 . The complete aa sequence of FJ2015 showed 51.3% and 51.4% identities with rosavirus M-7 (AEM05832) and rosavirus A2 (YP_009028557), respectively. The L protein of FJ2015 was 170 aa long, and lacked the catalytic dyad (Cys and His) of papain-like thiol proteases found in the foot-and-mouth disease virus L protein 17 . A presumed zinc-binding motif, Cys-His-Cys-Cys, found in Theiler's murine encephalomyelitis virus and quail picornavirus 18 . L protein was also lacking. The P1 region of FJ2015 shared 47-52% aa 19 , while the putative 2C protease region included the NTP-binding motif (GKPGCGKS) and helicase-activity motif (DDLGQ). The P3 region of FJ2015 virus shared 60-62% aa identity with rosaviruses and 21.1-33.6% with other picornaviruses. The GXCG cysteine active site was conserved in the putative 3C protease region (GFCG). Similar to cardioviruses and rosaviruses, the RNA-binding domain KFRDI motif was absent from the putative 3C protease region in FJ2015. The almost complete 5′ UTR of Rodent/Rn/PicoV/SX2015-2 (SX2015-2) showed 81% nt sequence identity with the corresponding region of OhuV-1 (HM153767). Compared with the 5′ UTR of OhuV-1, the conserved core-domain motifs I-J-K-L belonging to the type II IRES was also present at nt positions 141-458 (Supplemental Fig. 1 ). The predicted translation initiation site of the polyprotein CAUAUGG was set in a nearly optimal Kozak context. The polypyrimidine tract was located in the 18nt upstream of the AUG initiation codon. The Yn-Xm-AUG motif of SX2015-2 was Y9-X18-AUG. The L protein of SX2015-2 was 83 aa long. The L/VP4 predicted cleavage site E/G was present in both SX2015-2 and OHuV-1, and they shared 71% aa identity in their L proteins. The L protein of SX2015-2 had no GXCG motif and no putative zinc-finger motif (C-XH-X-(5)-C-X-(2)-C) 20, 21 . The P1 viral capsid of SX2015-2 had 85%, 84%, and 69% aa identities with BhuV-1 (JQ941880), OhuV-1, and Norway rat hunnivirus (YP009109563), respectively, and 17.4-30.1% identities with other genera of picornaviruses. The VP4 protein of SX2015-2 had a VP4 myristoylation motif GPGQSK 22 , and the predicted VP4/VP2 cleavage site was LA/DG. The P2 region of SX2015-2 (including 2A, which has high aa identity with other hunniviruses) had 68%, 70%, and 71% aa identities with BhuV-1, OhuV-1, and Norway rat hunnivirus, respectively, and 15-32.9% identity with other genera of picornaviruses. The P-loop NTPase fold was predicted in the 2C aa sequence from positions 1182-1374. The 2C protein of SX2015-2 possessed the highly conserved NTP-binding Phylogenetic analysis. Base on pairwise aa identities, phylogenetic analyses of the seven full genomes were conducted, which identified four distinct lineages based on 3D (Fig. 3) , P1 (Fig. 4) , and P2 (Fig. 5) . IM2014 (Dipus sagitta Pallas), SX2015-1 (Niviventer niviventer), and Tibet2014 (Microtus clarkei) were clustered into lineage 1. IM2014 was most closely related to SX2015-1 for all three regions, consistent with their aa identities. They formed a unique lineage just above the genera Tremovirus and Hepatovirus with 96 bootstrap values, and clearly separate from other genera, suggesting that they comprise a novel genus of picornaviruses (Supplemental Fig. 6 ). In a phylogenetic tree of representative picornaviruses and cripaviruses based on the aa of VP2 region, lineage 1 viruses clustered with HAV to form the sole independent branch between insect and mammalian picornaviruses (Supplemental Fig. 7 ). FJ2015 in lineage 2, from Rattus losea, together with rosavirus 2 (YP009028557) and rosavirus A (JF973686), formed a unique lineage in all three phylogenetic analyses of P1, P2, and 3D regions with 100 bootstrap values, in the genus Rosavirus, family Picornaviridae (Figs 3-5) . Phylogenetic analysis of lineage 3, including SX2015-2 from Niviventer niviventer, indicated that it was more closely related to BhuV-1 and OhuV-1 in the 3D and P1 phylogenetic trees, consistent with the aa identities, than to the Norway rat hunnivirus within the genus Hunnivirus. However, the P2 phylogenetic tree placed SX2015-2 closer to the Norway rat hunnivirus than to BhuV-1 and OhuV-1. The lineage 4 rodent-associated viruses Tibet2015 and NX2015, from Niviventer niviventer and Caryomys eva, respectively, possessed higher aa identities with homologous P3 regions in others members of the genus Enterovirus. Phylogenetic analyses showed that these two viruses were located in the genus Enterovirus between rhinovirus A (FJ445111) and enterovirus A (AY421760) in terms of their 3D regions (Fig. 3) , and formed an independent lineage adjacent to the other enteroviruses in phylogenetic trees based on the P1 and P2 regions (Figs 4 and 5 ). Increasing attention focused on rodents as the natural hosts of many important zoonotic virus. Firth et al. 25 identified a wide range of known and novel viruses from groups that include important human pathogens, including sapoviruses, cardioviruses, kobuviruses, parechoviruses, rotaviruses, and hepaciviruses carried by commensal Rattus norvegicus in New York city 25 . The role of rodent picornaviruses in the evolution, transmission, and biology of picornaviruses remains unclear. Drexler et al. 11 conducted a targeted search for hepatoviruses in 15,987 specimens of 209 small mammal species, and ancestral-state reconstructions suggested a hepatovirus origin in small insectivorous mammals, and a rodent origin of human HAV 11 . Zoll et al. 8 reported the first isolation, full-length sequence, characterization, and epidemiology of a Saffold virus SAFV-3 isolate 8 . Evidence suggests that this SAFV is an ubiquitous human virus causing infections early in life. It is genetically related to Theiler's virus in rodents, and was classified as a new species in the genus Cardiovirus 8 . A novel picornavirus, provisionally named rosavirus 2, was recovered from the feces of a child in The Gambia. The complete genome of rosavirus 2 demonstrated 71.9% nt identity with its closest relative rosavirus M-7, an unclassified picornavirus identified from rodent fecal material 16 . Those findings indicated that rodents are reservoirs for a diversity of picornaviruses, and that may contribute to the disease burden of these viruses in humans. New and improved surveillance and prevention strategies for disease control should concentrate on rodent infection and disease 25 . The seven novel rodent picornaviruses were proposed as new species or genera in the family Picornaviridae. According to the ICTV, virus taxology usually refer to their host species (or group of host species) and their defined geographic distribution. Like rodent arenaviruses and hantaviruses, a particular virus species generally only infects rodents in one subfamily or genus 26, 27 , such as Hantaan virus in Apodemus agrarius and Seoul virus in Rattus norvegicus and Rattus rattus [28] [29] [30] . In contrast, the rodent picornaviruses identified in the current study showed less specificity, and a particular picornavirus species was able to infect different rodents. Lineage 1 viruses were found in Dipus sagitta Pallas, Niviventer niviventer, and Microtus clarkei, lineage 4 viruses were found in Niviventer niviventer and Caryomys eva, and FJ2015 was found in Rattus losea. The situation of similar strains of rodent picornaviruses occurring in different rodent genera/species is also seen in other wild or domestic animals 11, [31] [32] [33] . The host range of picornaviruses appears to be reasonably broad and includes humans, canines, deer mice, bats and other rodents 1, 34 . Rodents are the most widely distributed mammals globally and many of them live in urban or suburban environments, giving them closer human contact than most other mammals [35] [36] [37] . The abilities of these rodent-related picornaviruses to cross-infect other rodent species or genera leading to their spread and emergence in their closest neighbor. HAV is an ancient and ubiquitous human pathogen, existing in both enveloped and non-enveloped forms, with a capsid structure intermediate between that of insect viruses and mammalian picornaviruses. According to the unrooted phylogenetic trees, lineage 1 viruses clustered with HAV to form the sole independent branch between insect viruses and mammalian picornaviruses. The origins of HAV were enigmatic prior to the recent discoveries of rodent-related and seal-related HAV 11, 38 . Lineage 1 viruses possessed relatively high aa homology with hepatovirus and tremoviruses. Similar to hepatovirus, the three lineage 1 viruses lacked the L protein and possessed an HAV viral protein VP1-2A domain of unknown function, which was also found in tremoviruses and phopivirus. Non-primate hepatoviruses "late domain" (YPX3L) motifs have been identified in lineage 1 viruses, and in seal-, bat-, hedgehog-, and rodent-related hepatoviruses, but not in tremoviruses. Lineage 1 viruses still have some genomic characteristics of HAV and have only low homology with other genera of picornaviruses. Lineage 1 viruses clustered with tremoviruses, hepatovirus and formed an independent branch that separated them from other genera of picornaviruses near the root of the phylogenetic tree. This indicates that the tremoviruses, hepatovirus and lineage 1 viruses have a common ancestor and evolved in their natural hosts, independently. Their co-evolution within their respective hosts contributed to their unique genomic characteristics. HAV originates in small insectivorous mammals and has evolved for interspecies transmission to rodents, given this evolution and the contact between humans and rodents HAV eventually evolved to infect humans 11 . BHuV-1 and OHuV-1 were isolated from cattle and sheep in Hungary in 2008 and 2009. Their complete aa sequences showed 78.8-78.9% identity with lineage 3 SX2015-2, and 72.5% identity with Norway rat hunnivirus. This indicates that SX2015-2 may be a new species in the genus Hunnivirus. In addition, based on the complete aa sequence identity and phylogenetic tree, our sequence was more closely related to that of domestic animal viruses(BHuV-1, OHuV-1) than to Norway rat hunnivirus, despite the distant geographic distribution and the long time intervals. This implies that picornaviruses have a worldwide geographic distribution, and a characteristic ability of cross-species transmission in humans, and wild and domestic animals 8, 16, 32 . FJ2015 represents a new species of the genus Rosavirus, based on aa identity and phylogenetic analysis. The complete aa sequence of rosavirus2 showed 79.7% identity with rosavirusM-7 (rodent feces) and 51.4% identity with FJ2015. Our results provide the first evidence for the presence and evolution of this rosavirus in rodent hosts in China. Although there was no statistical evidence that rosavirus 2 was associated with diarrhea 16 , many rodent-borne pathogens cause only mild or asymptomatic infections in the human population, and these illnesses Scientific RepoRts | 6:34381 | DOI: 10.1038/srep34381 are often misdiagnosed and under reported 25, [39] [40] [41] . It is a risk that asymptomatic infections in healthy people may evolve to become severe infectious disease in the future and in the case of the pathogens identified here this may already be occurring. Lineage 4 viruses appeared to be new species of rodent-related enteroviruses. 3D-based phylogenetic analysis placed lineage 4 viruses adjacent to rhinovirus A and enterovirus A. Enterovirus A is the pathogen of human hand-foot-and-mouthdisease, and rhinovirus A is the human rhinovirus. These results also provide the first evidence for these enteroviruses and their evolution in rodent hosts in China, and indicate the important role of rodents in picornavirus evolution and diversity. This study increases our understanding of picornavirus diversity in wild rodents and highlights the potentially large number of still-uncharacterized rodent picornaviruses [42] [43] [44] . The close relationships between picornaviruses (phylogenetic and genetic similarities) in rodents and those in domestic animals and humans have been noted previously 1, 8, 11, 16 . However, further epidemiological and molecular studies are also required to investigate the geographic distribution, diversity, and clinical importance of these novel rodent picornaviruses and their hosts. A better understanding of this virus family is essential in light of its potential to contribute new human pathogens in the future 38 . Ethics statement. Rodents Viral DNA and RNA library construction and next-generation sequencing. Samples from each species were pooled by combining 1 mL from each sample in maintenance medium in a fresh sample tube. The pooled samples, classified by species, were then processed using a viral particle-protected, nucleic acid purification method, as described previously 9 . The extracted RNA and DNA were amplified by sequence-independent PCR and the amplified viral nucleic acid libraries for each rodent species were sequenced using an Illumina HiSeq2500 (Illumina, San Diego, CA, USA), for single reads of 81 bp long. The raw sequence reads were then filtered using previously described criteria 9 to obtain valid sequences. Taxonomic assignment. Sequence-similarity-based taxonomic assignments were done as described previously 9 . The valid sequence reads were aligned to sequences in the NCBI non-redundant nucleotide (nt) and protein databases using BLASTn and BLASTx, respectively. The taxonomies of the aligned reads with the best BLAST scores (E score < 10 −5 ) were parsed using the MEGAN 4 -MetaGenome Analyzer 4. Genome sequencing. Sequence reads classified into the same virus family or genus by MEGAN 4 were extracted. The accurate locations of the reads and the relative distances between reads of the same virus were determined based on the alignment results exported with MEGAN 4. The located reads were then used for read-based PCR to identify partial genomes. The primers used to amplify the fragments of each virus are available upon request. Based on the partial genomic sequences of the viruses, the remaining genomic sequences were determined using inverse PCR, genome walking, and 5′ -and 3′ -rapid amplification of cDNA end 9 . Distribution map. The distribution map of specimens collected from eight representative regions of China was generated by online software SuperMap (http://www.supermapol.com/). Prepared from an Excel file which contained the name of each province, the total number of samples collected and the positive samples detected. This file was then uploaded to the SuperMap online tool to generate the distribution map. The size of the pie chart is in proportion with the total number of specimens collected from each region. Deep blue represents samples positive for picornavirus, and light blue represents negative samples. Phylogenetic analysis. Picornaviruses containing internal ribosomal entry site (IRES)-like sequences in their 5′ untranslated regions (UTRs) were identified by BLAST searches (http://www.ncbi.nlm.nih.gov/BLAST/) of viral sequences in the GenBank database. Secondary/tertiary structural elements in picornavirus 5′ UTRs were modeled using the aligned RNAalifold server of the ViennaRNA Web Services (http://rna.tbi.univie.ac.at/). MEGA5.0 (www.megasoftware.net) was used to align nt and deduced aa sequences using the MUSCLE package and default parameters. Phylogenetic trees showing the relationships among picornaviruses based on the aa sequences of P1 (capsid), P2, and P3 were generated using maximum likelihood mtREV with Freqs (+ F) model, gamma distributed with invariant sites (G + I), with 1000 bootstrap replicates. Routine sequence alignments were performed using Clustal Omega, Needle (available at: http://www.ebi.ac.uk/Tools/), MegAlign (Lasergene, DNAstar, Madison, Wisconsin), and T-coffee with manual curation. The conserved protein families and domains were predicted using Pfam and InterProScan 5 (available at: http://www.ebi.ac.uk/services/proteins). Amino acid identities and genetic distances were calculated using the ML method and were performed using a Pairwise evolutionary distance (PED) calculation as the distance metric. All genome sequences have been submitted to GenBank, with accession numbers for the seven rodent picornaviruses of KX156153-KX156159. The fecal viral flora of wild rodents Rodent-borne diseases and their risks for public health Wildlife, exotic pets, and emerging zoonoses The role of wildlife in emerging and re-emerging zoonoses Rodent-borne emerging viral zoonosis. Hemorrhagic fevers and hantavirus infections in South America Identification and genetic characterization of a novel picornavirus from chickens Clinical features and complete genome characterization of a distinct human rhinovirus (HRV) genetic cluster, probably representing a previously undetected HRV species, HRV-C, associated with acute respiratory illness in children Saffold virus, a human Theiler's-like cardiovirus, is ubiquitous and causes infection early in life Virome analysis for identification of novel mammalian viruses in bat species from Chinese provinces Primary structural comparison of RNA-dependent polymerases from plant, animal and bacterial viruses Evolutionary origins of hepatitis A virus in small mammals Cysteine proteases of positive strand RNA viruses and chymotrypsin-like serine proteases. A distinct protein superfamily with a common structural fold A new superfamily of putative NTP-binding domains encoded by genomes of small DNA and RNA viruses Complete nucleotide sequence and genetic organization of Aichi virus, a distinct member of the Picornaviridae associated with acute gastroenteritis in humans The impact of RNA structure on picornavirus IRES activity Discovery of rosavirus 2, a novel variant of a rodent-associated picornavirus, in children from The Gambia. Virology 454-455 Putative papain-related thiol proteases of positive-strand RNA viruses. Identification of rubi-and aphthovirus proteases and delineation of a novel conserved domain associated with proteases of rubi-, alpha-and coronaviruses Novel picornavirus in domesticated common quail (Coturnix coturnix) in Hungary The 2A proteins of three diverse picornaviruses are related to each other and to the H-rev107 family of proteins involved in the control of cell proliferation Sequence analysis of a duck picornavirus isolate indicates that it together with porcine enterovirus type 8 and simian picornavirus type 2 should be assigned to a new picornavirus genus The leader peptide of Theiler's murine encephalomyelitis virus is a zinc-binding protein Myristylation of picornavirus capsid protein VP4 and its structural significance Two related superfamilies of putative helicases involved in replication, recombination, repair and expression of DNA and RNA genomes Mutational analysis of the proposed FG loop of poliovirus proteinase 3C identifies amino acids that are necessary for 3CD cleavage and might be determinants of a function distinct from proteolytic activity Detection of zoonotic pathogens and characterization of novel viruses carried by commensal Rattus norvegicus Genetic identification of a hantavirus associated with an outbreak of acute respiratory illness Sin Nombre virus infection of deer mice in Montana: characteristics of newly infected mice, incidence, and temporal pattern of infection Phylogeny of the genus Arenavirus New ecological aspects of hantavirus infection: a change of a paradigm and a challenge of prevention-a review Antigenic and molecular characterization of hantavirus isolates from China Complete genome analysis of three novel picornaviruses from diverse bat species Two closely related novel picornaviruses in cattle and sheep in Hungary from 2008 to 2009, proposed as members of a new genus in the family Picornaviridae Deciphering the bat virome catalog to better understand the ecological diversity of bat viruses and the bat origin of emerging infectious diseases Kobuviruses -a comprehensive review Commensal ecology, urban landscapes, and their influence on the genetic characteristics of citydwelling Norway rats (Rattus norvegicus) Zoonotic aspects of arenavirus infections Public health threat of new, reemerging, and neglected zoonoses in the industrialized world Discovery of a Novel Hepatovirus (Phopivirus of Seals) Related to Human Hepatitis A Virus Clinical infectious diseases: an official publication of the Infectious Diseases Society of America Rat bite fever and Streptobacillus moniliformis Human-rodent contact and infection with lymphocytic choriomeningitis and Seoul viruses in an inner-city population Pathogenic simian immunodeficiency virus infection is associated with expansion of the enteric virome Genome-virome interactions: examining the role of common viral infections in complex disease Metagenomics and future perspectives in virus discovery Supplementary information accompanies this paper at http://www.nature.com/srep Competing financial interests: The authors declare no competing financial interests.