Skip to main content
Journal cover image

Virus-like insertions with sequence signatures similar to those of endogenous nonretroviral RNA viruses in the human genome.

Publication ,  Journal Article
Kojima, S; Yoshikawa, K; Ito, J; Nakagawa, S; Parrish, NF; Horie, M; Kawano, S; Tomonaga, K
Published in: Proceedings of the National Academy of Sciences of the United States of America
February 2021

Understanding the genetics and taxonomy of ancient viruses will give us great insights into not only the origin and evolution of viruses but also how viral infections played roles in our evolution. Endogenous viruses are remnants of ancient viral infections and are thought to retain the genetic characteristics of viruses from ancient times. In this study, we used machine learning of endogenous RNA virus sequence signatures to identify viruses in the human genome that have not been detected or are already extinct. Here, we show that the k-mer occurrence of ancient RNA viral sequences remains similar to that of extant RNA viral sequences and can be differentiated from that of other human genome sequences. Furthermore, using this characteristic, we screened RNA viral insertions in the human reference genome and found virus-like insertions with phylogenetic and evolutionary features indicative of an exogenous origin but lacking homology to previously identified sequences. Our analysis indicates that animal genomes still contain unknown virus-derived sequences and provides a glimpse into the diversity of the ancient virosphere.

Duke Scholars

Published In

Proceedings of the National Academy of Sciences of the United States of America

DOI

EISSN

1091-6490

ISSN

0027-8424

Publication Date

February 2021

Volume

118

Issue

5

Start / End Page

e2010758118

Related Subject Headings

  • Retroviridae
  • Nucleoproteins
  • Mutagenesis, Insertional
  • Mammals
  • Machine Learning
  • Humans
  • Genome, Human
  • Base Sequence
  • Animals
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Kojima, S., Yoshikawa, K., Ito, J., Nakagawa, S., Parrish, N. F., Horie, M., … Tomonaga, K. (2021). Virus-like insertions with sequence signatures similar to those of endogenous nonretroviral RNA viruses in the human genome. Proceedings of the National Academy of Sciences of the United States of America, 118(5), e2010758118. https://doi.org/10.1073/pnas.2010758118
Kojima, Shohei, Kohei Yoshikawa, Jumpei Ito, So Nakagawa, Nicholas F. Parrish, Masayuki Horie, Shuichi Kawano, and Keizo Tomonaga. “Virus-like insertions with sequence signatures similar to those of endogenous nonretroviral RNA viruses in the human genome.Proceedings of the National Academy of Sciences of the United States of America 118, no. 5 (February 2021): e2010758118. https://doi.org/10.1073/pnas.2010758118.
Kojima S, Yoshikawa K, Ito J, Nakagawa S, Parrish NF, Horie M, et al. Virus-like insertions with sequence signatures similar to those of endogenous nonretroviral RNA viruses in the human genome. Proceedings of the National Academy of Sciences of the United States of America. 2021 Feb;118(5):e2010758118.
Kojima, Shohei, et al. “Virus-like insertions with sequence signatures similar to those of endogenous nonretroviral RNA viruses in the human genome.Proceedings of the National Academy of Sciences of the United States of America, vol. 118, no. 5, Feb. 2021, p. e2010758118. Epmc, doi:10.1073/pnas.2010758118.
Kojima S, Yoshikawa K, Ito J, Nakagawa S, Parrish NF, Horie M, Kawano S, Tomonaga K. Virus-like insertions with sequence signatures similar to those of endogenous nonretroviral RNA viruses in the human genome. Proceedings of the National Academy of Sciences of the United States of America. 2021 Feb;118(5):e2010758118.
Journal cover image

Published In

Proceedings of the National Academy of Sciences of the United States of America

DOI

EISSN

1091-6490

ISSN

0027-8424

Publication Date

February 2021

Volume

118

Issue

5

Start / End Page

e2010758118

Related Subject Headings

  • Retroviridae
  • Nucleoproteins
  • Mutagenesis, Insertional
  • Mammals
  • Machine Learning
  • Humans
  • Genome, Human
  • Base Sequence
  • Animals