id author title date pages extension mime words sentences flesch summary cache txt cord-266288-buc4dd5y Dong, Rui A Novel Approach to Clustering Genome Sequences Using Inter-nucleotide Covariance 2019-04-09 .txt text/plain 5247 300 61 Classification of DNA sequences is an important issue in the bioinformatics study, yet most existing methods for phylogenetic analysis including Multiple Sequence Alignment (MSA) are time-consuming and computationally expensive. Here we propose a new Accumulated Natural Vector (ANV) method which represents each DNA sequence by a point in ℝ(18). The natural vector method performs well on many datasets (Deng et al., 2011; Yu et al., 2013b; Hoang et al., 2016; Li et al., 2016) , however, it only considers the number, average position and dispersion of positions of each nucleotide. In this paper, we propose a new Accumulated Natural Vector (ANV) method, which not only considers the basic property of each nucleotide, but also the covariance between them. In this paper, we propose an Accumulated Natural Vector approach, which projects each sequence into a point in R 18 , where the additional six dimensions describe the covariance between nucleotides. ./cache/cord-266288-buc4dd5y.txt ./txt/cord-266288-buc4dd5y.txt