id author title date pages extension mime words sentences flesch summary cache txt cord-341029-49360l2a Nasir, Arshan A phylogenomic data-driven exploration of viral origins and evolution 2015-09-25 .txt text/plain 14410 794 49 Viruses harboring different replicon types and infecting distantly related hosts shared many metabolic and informational protein structural domains of ancient origin that were also widespread in cellular proteomes. Here, we analyzed a total of 5080 completely sequenced proteomes from cells and viruses and assigned FSF domains to their proteins using structure-based hidden Markov models (HMMs) defined by the SUPER-FAMILY database (version 1.75) (20) . Viral supergroup behaves similarly to cellular superkingdoms in terms of FSF sharing patterns A total of 1995 significant FSF domains (E < 0.0001) were detected iƱ 11 million proteins of 5080 proteomes sampled from cells and viruses. It also suggests that viruses are very ancient and most likely infected the last common ancestor of each superkingdom because viral FSFs were present in a diverse array of cellular organisms ranging from small microbes to large eukaryotes. ./cache/cord-341029-49360l2a.txt ./txt/cord-341029-49360l2a.txt