key: cord-0006381-mcy43i6a authors: Bojko, Jamie title: The mitochondrial genome of UK (non-native) Dikerogammarus haemobaphes (Amphipoda: Gammaridae) informs upon Dikerogammarus evolution, invasions and associated microparasites date: 2019-10-14 journal: Hydrobiologia DOI: 10.1007/s10750-019-04084-1 sha: ca06445bcd59d7eb2c04248714503fd1df833a86 doc_id: 6381 cord_uid: mcy43i6a The amphipod Dikerogammarus haemobaphes is a high-risk carrier of parasites that impact wildlife in its non-native range. Studies using the mitochondrial genes, Cytochrome Oxidase Sub-Unit 1 (cox1) and small-subunit ribosomal RNA gene (16S), provide some nucleotide detail for understanding the evolution and phylogeography of this species. Despite this, the origins of the invasion remain unknown, as do the origins of its parasites. This study provides the full annotated mitochondrial genome (15,460 bp) of D. haemobaphes, consisting of 2 rRNAs, 24 tRNAs and 14 protein coding genes. Mitochondrial genes from the UK isolate are compared to existing data on NCBI and are used in a concatenated phylogenetic approach and identify D. haemobaphes as an early member of the Gammaridae (Amphipoda). Viral, bacterial, protistan and microsporidian parasites are present across the Gammaridae, including D. haemobaphes, suggesting the ancestor of the Gammaridae harboured related diseases, and that further screening of amphipods is likely to reveal further microparasite diversity. This correlation suggests that other gammarid invaders have the potential to harbour a range of microparasites. The mitochondrial genome of this species will act a resource to facilitate our understanding of geneflow, disease epidemiology and evolutionary history in this invasion-disease model. The demon shrimp, Dikerogammarus haemobaphes (Eichwald 1841), is a non-native freshwater amphipod in the UK that exerts low levels of ecological damage and inter-species competition (Bovy et al., 2015) . The species hosts multiple mortality-inducing and behaviour-altering pathogens that have been carried alongside the invasion into the UK (Bojko et al., 2018a) . Infection with the microsporidian pathogen Cucumispora ornata Bojko, Dunn, Stebbing, Ross, Kerr, Stentiford 2015 was noted to reduce activity in heavily infected hosts and was associated with mortality in both D. haemobaphes and non-target Gammarus pulex (L.), which also have the infection in wild populations (Bojko et al., 2018a) . 'Dikerogammarus haemobaphes bi-facies-like Virus' (DhbflV), was also identified as a mortality-inducing virus at low prevalence within the D. haemobaphes population in the UK (Bojko et al., 2018a) . Finally, a likely novel member of the Nudiviridae, 'Dikerogammarus haemobaphes Bacilliform Virus' (DhBV) was found to increase the activity of its host and potentially alter the rate of invasion spread (Bojko et al., 2018a) . This species, and specifically its parasites, are now considered a high-risk invasion system that requires the development of diagnostic methods to track the invasion, associated diseases and their effects. To date, mitochondrial data for this species are restricted to short * 600 bp sequence tags of the Cytochrome Oxidase Sub-Unit 1 gene (cox1) (Grabner et al., 2015) and partial 16S. Next generation sequencing platforms and bioinformatic tools provide the ability to rapidly provide data on the genomic composition of the demon shrimp and aid the development of diagnostic tools. Recent advances in the sequencing of mitochondrial genomes from amphipods has also allowed for increased phylogenetic information, with an excess of 50 mitochondrial genomes being available for Amphipoda (Romanova et al., 2016; Macher et al., 2017; Cormier et al., 2018) . Herein, the mitochondrial genome of the demon shrimp is presented. The mitochondrial genome of this UK-based individual will act as a resource to develop additional PCR diagnostics for population genetics studies to determine the genetic diversity and likely origins of invasive populations. Furthermore, this genome provides detailed information on the evolution of Dikerogammarus sp. and can be used in tandem with disease screening data to identify the potential origins of its parasites. In 2016, a single animal was collected by hand from Carlton Brook, UK (British National Grid [BNG] ref: SK3870004400). The urosome of this individual underwent phenol:chloroform DNA extraction after an overnight digestion with Proteinase K. This extract was prepared into a DNA library using a NEXTERA-XT library preparation method for MiSeq sequencing (Illumina; www.illumina.com) and Illumina TruSeqÒ DNA PCR-Free library preparation kit for HiSeq (Illumina; www.illumina.com) . Raw data were trimmed (Illuminatrim-TRIMMOMATIC) and then assembled using SPAdes v.3.13.0 (default settings with km: 21, 33, 55, 77, 99, 127) (Bankevich et al., 2012; Bolger et al., 2014) . This resulted in a 15,460 bp circular contig with 243.97X coverage. Trimmed reads were re-aligned to the sequence to confirm even coverage across the circular sequence. This sequence was submitted to MITOS (invertebrate) to provide detailed annotation of protein coding (PCG) and non-coding RNA (ncRNA) genetic regions (Bernt et al., 2013) , which were further edited and confirmed using data available on NCBI. Individual ncRNA and PCGs were compared to available sequence data from alternative D. haemobaphes and other Amphipoda using NCBI, BLASTp and BLASTn. Circa (www.omgenomics. com/circa) and CLC (www.qiagenbioinformatics. com) were used to develop diagrammatic representations of the genetic data. Sequence data for the D. haemobaphes mitochondrial genome can be acquired from NCBI (accession number: MK644228). Three maximum likelihood phylogenetic trees were calculated using the mitochondrial genome of D. haemobaphes. The first two used the 16S (276 positions) or cox1 (614 positions) gene to compare Dikerogammarus sp. from NCBI (n = 16 sequences and 39 sequences, respectively) (evolutionary model: HKY ? F? I). The final tree used individually aligned and subsequently concatenated amino acid (AA) sequences (13 genes: atp6, atp8-0, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, nad6) (n = 38 Amphipoda and 1 Isopoda outgroup) (evolutionary model: mtInv ? F ?I ? G4). In all cases the sequences were trimmed and aligned using MAFFT in Geneious v.10.0.2 (gap: 1.53, cost: 0.123) before phylogenetic analysis and model matching according to Bayesian Information Criterion (BIC). IQtree was used to calculate the phylogenetic trees (Nguyen et al., 2015) and included the use of ultrafast approximated bootstraps (n = 1000) (Minh et al. 2013) . '?F' refers to the empirical base frequencies and counts base frequencies directly from the alignment; '?I' refers to a fix the proportion of invariable sites; finally, '?G4' refers to the addition of the discrete Gamma model. Multiple sources of literature were used to compare known microparasites [Nudiviridae, 'Candidatus Aquirickettsiella', Microsporidia (Cucumispora and Dictyocoela) and gregarines (Apicomplexa)] of each amphipod with a known mitochondrial genome to the phylogenetic information determined by this study (Madyarova et al., 2015; Bacela-Spychalska et al., 2018; Dimova et al., 2018; Ironside & Wilkinson, 2018; Bojko & Ovcharenko, 2019) . The mitochondrial genome of D. haemobaphes is 15,460 bp in length (coverage = 243.97%) and encodes 24 tRNA, 2 rRNA and 14 protein coding genes (including a duplication of atp8) (Table 1 ; Fig. 1 ). The closest associated genome is that of Gammarus duebeni (NC017760), which shares two closely related tRNAs and six protein coding genes, primarily linked with the cytochrome complex. The cox1 and 16S (rrnL) genes of the D. haemobaphes mitochondrial genome showed closest similarity to D. haemobaphes haplotypes from Germany (Main River and North Rhine-Westphalia) (Table 1) . Structurally, the mitochondrial genome is A?T rich with 33.8% GC content across the circular genome. The closest relatives with full mitochondrial genome availability were Gammarus duebeni Lilljeborg, 1852 (NC017760) and Eulimnogammarus cyaneus (Dybowsky, 1874) (NC033360), which show high levels of relative gene organisation along the circular mitochondrial genome but with some small reorganisation of tRNAs. The trnR and trnE are present in that order instead of trnE-trnR as seen in the genomes of G. duebeni and E. cyaneus (Fig. 2) . Dikerogammarus haemobaphes also has a duplication of the trnQ. A duplication of atp8, which is part of the Adenosine Tri-Phosphate synthesis pathway (genes: atp8-0 and atp8-1), is present. (Fig. 3 ), but the UK isolate is genetically dissimilar by 3.27%, suggesting that although this is the closest isolate, they are not genetically identical. Greater genetic variation is visible between the D. haemobaphes 16S gene than the D. villosus 16S data (Fig. 3) . The tree based on the cox1 gene also results in two distinct clades of D. villosus and D. haemobaphes, both with high (95-100) bootstrap support. The UK isolate of D. haemobaphes shows closest nucleotide similarity to a D. haemobaphes haplotype 1 (KY075268) sampled in Germany (sim. = 100%, cov. = 100%, e-value = 0.0). In Fig. 3 all the D. haemobaphes isolates branch together apart from one individual (AY529049), which originates from the North Caspian Sea, the species native range. A concatenated phylogeny of all available mitochondrial protein sequences (n = 13 genes) from available amphipod mitochondrial genomes confirms that D. haemobaphes sits within the Gammaridae, but also identifies the species as an early branching member (bootstrap support = 100) relative to the Gammarus genus and other species from Europe and the Ponto-Caspian region (Fig. 4) . Other Amphipoda show predicted branching throughout the tree, with all genera represented by multiple species (Eulimnogammarus, Gammarus, Platorchestia, Hyallela, Epimeria, Pseudoniphargus, Stygobromus, Metacrangonyx and Caprella) branching together (Fig. 4) . The tree shows low bootstrap support close to the root (55 or 38), suggesting that further sequencing of highly derived amphipods may help to add detail to the tree and its topology, providing further detail to the tree and increase its accuracy at predicting topology. Using available literature (Madyarova et al., 2015; Bacela-Spychalska et al., 2018; Dimova et al., 2018; Ironside & Wilkinson, 2018; Bojko & Ovcharenko, 2019) , the known microparasites of those amphipods from Europe and the Ponto-Caspian region are presented alongside the phylogenetics conducted by this study to identify possible points of parasite evolution and yet undetermined hosts that may harbour infection (Fig. 5) . Bacilliform viruses (Nudiviridae), intracellular bacteria ('Candidatus Aquirickettsiella'), species from two genera of Microsporidia (Cucumispora and Dictyocoela) and the presence of gregarines (Apicomplexa) are presented on the tree alongside known hosts (Fig. 5) . Bacilliform viruses are present in two Gammarus sp. and D. haemobaphes, which sit at the base of the Gammaridae in Fig. 4 . These same individuals also host intracellular bacteria (Candidatus Aquirickettsiella). Systematically identified Cucumispora sp. are restricted to two host species with mitochondrial genome data, D. haemobaphes and Gammarus roeselii (L.); however, multiple SSU sequences from Ponto-Caspian amphipod hosts place Cucumispora candidates across Fig. 5 . Gregarines have been observed in D. haemobaphes, all Gammarus sp. and in members of the Eulimnogammarus genus (Fig. 5) . Two species, Pallaseopsis kessleri (Dyb.) and Crypturopus tuberculatus (Dyb.), do not yet have any identified microparasite groups explored herein. The mitochondrial genomes of eukaryotic organisms have been used to infer phylogenetic relationships (Cormier et al., 2018) , to understand energy and metabolism (Abele et al., 2007) and to better inform upon the genetic diversity of a population (Ma et al., 2015) . Increased availability of mitochondrial data associated with biological invasions can provide a valuable resource to better understand invasion dynamics through population genetics. This information can be used to determine potential entry points and locate source populations of invasive species (Lallias et al., 2015) , to determine the rates of evolution in invaders (Cormier et al., 2018) and cumulatively provide information on the potency of biosecurity and management efforts (Anderson et al., 2015) . This study provides the first complete mitochondrial genome for a Dikerogammarus sp., identifying a total 40 predicted coding regions for ncRNAs and PCGs that can be used to gain greater genetic-level data for understanding demon shrimp invasions, origins and evolution. These data are used to explore Fig. 1 A map of the circular mitochondrial genome of Dikerogammarus haemobaphes. The genome is represented as a single circular black line. Protein coding genes are present on the outside of the black circle, with positive strand sequences in red and negative strand sequences in blue. Non-coding RNA sequences are represented internal to the black circle, with positive strand coding regions in red and negative strand sequences in blue. The labels for each protein coding gene or ncRNA gene are listed around the outside of the diagram before the genome size markers. Please refer to the NCBI accession MK644228 for electronic annotation the position of Dikerogammarus sp. within the Amphipoda and identifies the UK population to be similar to populations on mainland Europe. The largest mitochondrial phylogeny for the Amphipoda is presented herein and is correlated with known amphipod diseases (Bojko & Ovcharenko, 2019) to explore potential evolutionary origins in addition to host species that may harbour interesting infections. Biological invasions that introduce disease tend to be understudied, with the majority focussing on the host introduction pathway and host-associated impact in novel environments (Roy et al., 2017) . Dikerogammarus haemobaphes is a high-risk species for the introduction of disease; therefore it is important to note that the use of molecular resources in combination with disease screening efforts may be able to define the invasion pathway of the host and its parasites. This mitochondrial genome scaffold has already indicated that the UK population is tentatively related to non-native populations of demon shrimp collected from the Rhine and Main rivers in Germany (Grabner et al., 2015) and Vistula, Poland. In these locations, disease has also been observed from the lethal microsporidian parasite, C. ornata (aka: Microsporidium sp. G) (Bojko et al., 2015; Grabner et al., 2015; and in the related D. villosus (Bacela-Spychalska et al., 2012) . There has been high success in the tracking of populations of invasive amphipods through Europe to their native range(s) using population genetics (Rewicz et al., 2015 (Rewicz et al., , 2017 . Increased availability of molecular tools may allow a phylogeographic understanding for the origins of D. haemobaphes and potentially its parasites. This capability also extends to future invasions, such as the impending threat of invasion to the Great Lakes (USA), whereby multiple Ponto-Caspian species (e.g. Dreissina polymorpha Boettger, 1913) have already successfully invaded (Ricciardi & MacIsaac, 2000) . Whether these species The mitochondrial data provided herein for D. haemobaphes represent only a single specimen, but based on the16S data, it constituted a unique haplotype. The 16S gene showed closest similarity to D. haemobaphes (AJ440888) from Poland (97%). The cox1 gene of the UK individual is 100% identical over a 658 bp region to D. haemobaphes 'haplotype 1' from Germany (North Rhine-Westphalia). In conclusion, it appears that the D. haemobaphes in the UK (Carlton Brook) likely arrived from invasive populations in central Europe and not from the native range. The genus Dikerogammarus (Gammaridea) contains freshwater and brackish amphipods and was first described by Stebbing (1899) . The genus contains nine species to date: D. aralychensis, D. bispinosus, D. caspius, D. fluvitalis, D. gruberi, D. istanbulensis, D. oskari, D. villosus and D. haemobaphes (Ö zbek & Ö zkan, 2011) . These species are naturally distributed around the Ponto-Caspian region (Black Sea, Caspian Sea and Sea of Azov) and several have become invasive throughout Europe and on the island of the UK. Dikerogammarus villosus and D. haemobaphes have both invaded the UK and continue to impact freshwater systems, both directly and through the introduction of pathogens (Bojko et al., 2013 (Bojko et al., , 2018b . The mitochondrial genome of D. haemobaphes shares synteny and gene similarity with closely related amphipods, excluding the presence of some duplicate gene regions and a tRNA rearrangement (Figs. 1, 2) . Specifically, the presence of a duplicated atp8 gene (atp8-1) on the opposite coding strand (Fig. 1) is absent from other Gammaridae. This gene shows no genetic similarity to other Gammaridae and may be a motif specific to this species, or possibly the Dikerogammarus genus pending further research. If Identification of known pathogens for those amphipods with mitochondrial sequence data. The phylogenetic tree is a portion of that presented in Fig. 4 . Bacilliform viruses (Nudiviridae), intracellular bacteria ('Candidatus Aquirickettsiella'), Cucumispora sp., Dictyocoela sp. and gregarines are presented next to known host species. Dikerogammarus haemobaphes is known to harbour all these different pathogen groups, and its early presence on the tree suggests that all these parasite groups have been infecting this group prior to the evolutionary divergence of the most recent common ancestor of the Gammaridae this is the case it could be a clear molecular tag for use in future systematics of the Dikerogammarus group. Phylogenetically and morphologically, Dikerogammarus sp. have been identified as members of the Gammaridae and this study supports their inclusion using mitochondrial data from a D. haemobaphes representative (Müller et al., 2002) . The phylogenetic data in Fig. 4 suggest that D. haemobaphes is likely an early member of the Gammaridae [sensu Hou and Sket (2016) ], branching with strong support before the other members. Eurythenes maldoror d' Udekem, d'Acoz, Havermans, 2015 and Onisimus nanseni (Sars, 1900) both branch at the node separating the Gammaridae and Eurytheneidae/Uristidae and greater numbers of sequenced amphipods in both the Dikerogammarus and other related genera would greatly increase the evolutionary detail of the early formation of the Gammaridae. Dikerogammarus sp. are thought to be high invasion risks to the UK, including their co-invasive diseases [summarised in Bojko et al., (2018b) ], which are of importance to freshwater ecosystem health (Roy et al., 2017 (Roy et al., , 2019 . The data presented herein have identified that the D. haemobaphes population seeding the UK invasion was likely originating in central Europe and not the native range, which means that the diseases identified from extensive screening in the UK are likely also present in the European range (Bojko et al., 2015 (Bojko et al., , 2018b . Many of these diseases can impact the activity of their host, but also cause mortality in nontarget amphipod hosts, such as G. pulex, potentially threatening biodiversity (Bojko et al., 2018b) . Disease screening in the Gammaridae is lacking in research effort; however, some studies have provided insight the parasite diversity of some that also have mitochondrial genomes available (Madyarova et al., 2015; Dimova et al., 2018; Bacela-Spychalska et al., 2018; Ironside & Wilkinson, 2018; Bojko & Ovcharenko, 2019) . Combining data from any disease screening efforts with the phylogenetics conducted herein indicated that Microsporidia [Cucumispora sp. (including candidatus species) and Dictyocoela sp.] are present across the Gammaridae, with the exception of P. kessleri and C. tuberculatus, likely due to a lack of screening effort. Gregarine parasites (Apicomplexa) are present in Eulimnogammarus sp., Gammarus sp. and D. haemobaphes, suggesting that this group is also likely present across the Gammaridae. The recent discovery of 'Candidatus Aquirickettsiella gammari 'Bojko, Dunn, Stebbing, van Aerle, Bacela-Spychalska, Bean, Urrutia, Stentiford 2018b and similar pathologies in D. haemobaphes and G. roeselii suggest that increased screening will also discover related pathogens across the Gammaridae. Finally, bacilliform viruses in the hepatopancreas of crustaceans (now thought to be part of the Nudiviridae) have been found in multiple Gammaridae, including: Dikerogammarus sp. (Bojko et al., 2013; 2018b) and Gammarus sp. . The presence of this virus in this early branching gammarid host suggests the viral group are also likely present in the other Gammaridae (Fig. 5) . Dikerogammarus haemobaphes is the earliest member of the Gammaridae identified to date using concatenated mitochondrial phylogenetics. Knowledge of its diseases suggests that many of the other Gammaridae likely also co-evolved with microsporidian, protistan, bacterial and viral diseases; many yet to be discovered. The mitochondrial genome of this host will provide further insight into the development of genetic identification tools and the ability to track this species and its diseases, perhaps in combination with eDNA tools to explore invasion presence (Mauvisseau et al., 2019) . Knowledge of the mitochondrial genome will help to differentiate host haplotypes to explore disease susceptibility and identify regions of similarity and difference between Dikerogammarus populations. Finally, this study has determined that the population in the UK seems to have been seeded by populations in Europe and not the native range, suggesting that the diseases in the UK are likely to be present in continental Europe and may pose risk to native Gammarids and the related freshwater ecology. Marine invertebrate mitochondria and oxidative stress Invaders in hot water: a simple decontamination method to prevent the accidental spread of aquatic invasive non-native species Microsporidian disease of the invasive amphipod Dikerogammarus villosus and the potential for its transfer to local invertebrate fauna Europe-wide reassessment of Dictyocoela (Microsporidia) infecting native and invasive amphipods (Crustacea): molecular versus ultrastructural traits SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing MITOS: improved de novo metazoan mitochondrial genome annotation Parasites of invasive Crustacea: risks and opportunities for control (Doctoral dissertation Pathogens and other symbionts of the Amphipoda: taxonomic diversity and pathological significance. Diseases of Aquatic Organisms Baseline histopathological survey of a recently invading island population of 'killer shrimp', Dikerogammarus villosus Cucumispora ornata n. sp. (Fungi: Microsporidia) infecting invasive 'demon shrimp' (Dikerogammarus haemobaphes) in the United Kingdom Parasites, pathogens and commensals in the ''low-impact'' non-native amphipod host Gammarus roeselii Pathogens of Dikerogammarus haemobaphes regulate host activity and survival, but also threaten native amphipod populations in the UK. Diseases of Aquatic Organisms Candidatus Aquirickettsiella gammari' (Gammaproteobacteria: Legionellales: Coxiellaceae): a bacterial pathogen of the freshwater crustacean Gammarus fossarum (Malacostraca: Amphipoda) Trimmomatic: a flexible trimmer for Illumina sequence data Predicting the predatory impacts of the ''demon shrimp'' Dikerogammarus haemobaphes, on native and previously introduced species The complete mitochondrial genome of Gammarus roeselii (Crustacea, Amphipoda): insights into mitogenome plasticity and evolution Genetic diversity of Microsporidia in the circulatory system of endemic amphipods from different locations and depths of ancient Lake Baikal Invaders, natives and their enemies: distribution patterns of amphipods and their microsporidian parasites in the Ruhr Metropolis A review of Gammaridae (Crustacea: Amphipoda): the family extent, its evolutionary history, and taxonomic redefinition of genera Accumulation and exchange of parasites during adaptive radiation in an ancient lake Invasion genetics of the Pacific oyster Crassostrea gigas in the British Isles inferred from microsatellite and mitochondrial markers First mitochondrial genome for the red crab (Charybdis feriata) with implication of phylogenomics and population genetics The complete mitochondrial genome of a cryptic amphipod species from the Gammarus fossarum complex. Mitochondrial Microsporidian parasites found in the hemolymph of four baikalian endemic amphipods The development of an eDNA based detection method for the invasive shrimp Dikerogammarus haemobaphes Ultrafast approximation for phylogenetic bootstrap Genetic and morphological differentiation of Dikerogammarus invaders and their invasion history in Central Europe IQ-TREE: A fast and effective stochastic algorithm for estimating maximum likelihood phylogenies Dikerogammarus istanbulensis sp Out of the Black Sea: phylogeography of the invasive killer shrimp Dikerogammarus villosus across Europe The killer shrimp, Dikerogammarus villosus, invading European Alpine Lakes: a single main source but independent founder events with an overall loss of genetic diversity Recent mass invasion of the North American Great Lakes by Ponto-Caspian species Evolution of mitochondrial genomes in Baikalian amphipods Alien pathogens on the horizon: Opportunities for predicting their threat to wildlife Developing a list of invasive alien species likely to threaten biodiversity and ecosystems in the European Union Amphipoda from the Copenhagen Museum and other sources, part II. The Transactions of the Linnean Society of London Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations Acknowledgements I would like to thank Dr Donald Behringer for his contribution to the collection of HiSeq data for this study and for providing me some time to construct this manuscript. Additionally, I would like to acknowledge NERC funding (#:1368300) to JB, Dr Grant Stentiford, Dr Alison Dunn and Dr Paul Stebbing, which contributed to the initial collection of Illumina MiSeq data. Data availability The mitochondrial genome is submitted under accession number: MK644228, in the NCBI database.