id author title date pages extension mime words sentences flesch summary cache txt cord-193489-u6ewlh16 Wang, Rui Decoding SARS-CoV-2 transmission, evolution and ramification on COVID-19 diagnosis, vaccine, and medicine 2020-04-29 .txt text/plain 6066 419 62 Based on the genotyping of 6156 genome samples collected up to April 24, 2020, we report that SARS-CoV-2 has had 4459 alarmingly mutations which can be clustered into five subtypes. Genetic identification and characterization of the geographic distribution, intercontinental evolution, and global trends of SARS-CoV-2 is the most efficient approach for studying COVID-19 genomic epidemiology and offer the molecular foundation for region-specific SARS-CoV-2 vaccine design, drug discovery, and diagnostic development [10] . We use K-means methods to cluster SARS-CoV-2 mutations, which provides the updated molecular information for the region-specific design of vaccines, drugs, and diagnoses. Table 5 presents the statistics of single mutations on various SARS-CoV-2 proteins that occurred in the recorded genomes between January 5, 2020, and April 24, 2020. Specifically, nucleocapsid protein has both the highest number of mutations per residues of 0.56 and the highest h-index of 27, suggesting that it is the most non-conservative protein in SARS-CoV-2 genomes. ./cache/cord-193489-u6ewlh16.txt ./txt/cord-193489-u6ewlh16.txt