Skip to main navigation Skip to search Skip to main content

Protein Language Models Expose Viral Immune Mimicry

Research output: Contribution to journalArticlepeer-review

Abstract

Viruses have evolved sophisticated solutions to evade host immunity. One of the most pervasive strategies is molecular mimicry, whereby viruses imitate the molecular and biophysical features of their hosts. This mimicry poses significant challenges for immune recognition, therapeutic targeting, and vaccine development. In this study, we leverage pretrained protein language models (PLMs) to distinguish between viral and human proteins. Our model enables the identification and interpretation of viral proteins that most frequently elude classification. We characterize these by integrating PLMs with explainable models. Our approach achieves state-of-the-art performance with ROC-AUC of 99.7%. The 3.9% of misclassified sequences are signified by viral proteins with low immunogenicity. These errors disproportionately involve human-specific viral families associated with chronic infections and immune evasion, suggesting that both the immune system and machine learning models are confounded by overlapping biophysical signals. By coupling PLMs with explainable AI techniques, our work advances computational virology and offers mechanistic insights into viral immune escape. These findings carry implications for the rational design of vaccines, and improved strategies to counteract viral persistence and pathogenicity.

Original languageEnglish
Article number1199
JournalViruses
Volume17
Issue number9
DOIs
StatePublished - Sep 2025

Bibliographical note

Publisher Copyright:
© 2025 by the authors.

Keywords

  • IL-10
  • PLM
  • ProteinBERT
  • adaptive immune system
  • autoimmune diseases
  • deep learning
  • epitope
  • feature selection
  • immunological tolerance

Fingerprint

Dive into the research topics of 'Protein Language Models Expose Viral Immune Mimicry'. Together they form a unique fingerprint.

Cite this