id author title date pages extension mime words sentences flesch summary cache txt cord-268224-5tbb8df1 Di Gioacchino, Andrea The heterogeneous landscape and early evolution of pathogen-associated CpG dinucleotides in SARS-CoV-2 2020-08-27 .txt text/plain 7109 412 62 Using a model of the viral gene evolution under human host pressure, we find that synonymous mutations seem driven, in the N protein coding region, both by the viral codon bias and by the high value of the CpG content, leading to a loss in CpG. Finally we use a model of the viral gene evolution under human host pressure, characterized by the CpG force, to study synonymous mutations, and in particular those which change CpG content, observed since the SARS-CoV-2 entered the human population (Sec. 2.3). We first compute the global force on CpG dinucleotides for SARS-Cov-2 and a variety of other viruses from the Coronaviridae family affecting humans or other mammals (bat, pangolin), see Fig. 1a , using as null model the nucleotide usage calculated from human genome [22] (see Methods Sec. 4.2) 1 . ./cache/cord-268224-5tbb8df1.txt ./txt/cord-268224-5tbb8df1.txt