TY - UNPB
T1 - Repeats Mimic Pathogen-Associated Patterns Across a Vast Evolutionary Landscape
AU - Šulc, Petr
AU - Di Gioacchino, Andrea
AU - Solovyov, Alexander
AU - Marhon, Sajid A.
AU - Sun, Siyu
AU - Lindholm, Håvard T
AU - Chen, Raymond
AU - Hosseini, Amir
AU - Jiang, Hua
AU - Ly, Bao-Han
AU - Mehdipour, Parinaz
AU - Abdel-Wahab, Omar
AU - Vabret, Nicolas
AU - LaCava, John
AU - De Carvalho, Daniel D.
AU - Monasson, Rémi
AU - Cocco, Simona
AU - Greenbaum, Benjamin D.
PY - 2023/3/10
Y1 - 2023/3/10
N2 - An emerging hallmark across human diseases – such as cancer, autoimmune and neurodegenerative disorders – is the aberrant transcription of typically silenced repetitive elements. Once active, a subset of repeats may be capable of “viral mimicry”: the display of pathogen-associated molecular patterns (PAMPs) that can, in principle, bind pattern recognition receptors (PRRs) of the innate immune system and trigger inflammation. Yet how to quantify the landscape of viral mimicry and how it is shaped by natural selection remains a critical gap in our understanding of both genome evolution and the immunological basis of disease. We propose a theoretical framework to quantify selective forces on virus-like features as the entropic cost a sequence pays to hold a non-self PAMP and show our approach can predict classes of viral-mimicry within the human genome and across eukaryotes. We quantify the breadth and conservation of viral mimicry across multiple species for the first time and integrate selective forces into predictive evolutionary models. We show HSATII and intact LINE-1 (L1) are under selection to maintain CpG motifs, and specific Alu families likewise maintain the proximal presence of inverted copies to form double-stranded RNA (dsRNA). We validate our approach by predicting high CpG L1 ligands of L1 proteins and the innate receptor ZCCHC3, and dsRNA present both intracellularly and as MDA5 ligands. We conclude viral mimicry is a general evolutionary mechanism whereby genomes co-opt pathogen-associated features generated by prone repetitive sequences, likely offering an advantage as a quality control system against transcriptional dysregulation.
AB - An emerging hallmark across human diseases – such as cancer, autoimmune and neurodegenerative disorders – is the aberrant transcription of typically silenced repetitive elements. Once active, a subset of repeats may be capable of “viral mimicry”: the display of pathogen-associated molecular patterns (PAMPs) that can, in principle, bind pattern recognition receptors (PRRs) of the innate immune system and trigger inflammation. Yet how to quantify the landscape of viral mimicry and how it is shaped by natural selection remains a critical gap in our understanding of both genome evolution and the immunological basis of disease. We propose a theoretical framework to quantify selective forces on virus-like features as the entropic cost a sequence pays to hold a non-self PAMP and show our approach can predict classes of viral-mimicry within the human genome and across eukaryotes. We quantify the breadth and conservation of viral mimicry across multiple species for the first time and integrate selective forces into predictive evolutionary models. We show HSATII and intact LINE-1 (L1) are under selection to maintain CpG motifs, and specific Alu families likewise maintain the proximal presence of inverted copies to form double-stranded RNA (dsRNA). We validate our approach by predicting high CpG L1 ligands of L1 proteins and the innate receptor ZCCHC3, and dsRNA present both intracellularly and as MDA5 ligands. We conclude viral mimicry is a general evolutionary mechanism whereby genomes co-opt pathogen-associated features generated by prone repetitive sequences, likely offering an advantage as a quality control system against transcriptional dysregulation.
U2 - 10.1101/2021.11.04.467016
DO - 10.1101/2021.11.04.467016
M3 - Preprint
T3 - BioRxiv
BT - Repeats Mimic Pathogen-Associated Patterns Across a Vast Evolutionary Landscape
PB - BioRxiv
ER -