Affordable Access

Mapping the drivers of within-host pathogen evolution using massive data sets

Authors
  • Palmer, DS
  • Turner, I
  • Fidler, S
  • Frater, J
  • Goedhals, D
  • Goulder, P
  • Huang, K-HG
  • Oxenius, A
  • Phillips, R
  • Shapiro, R
  • Van Vuuren, C
  • McLean, AR
  • McVean, G
Publication Date
May 20, 2019
Source
Spiral - Imperial College Digital Repository
Keywords
License
Unknown

Abstract

Differences among hosts, resulting from genetic variation in the immune system or heterogeneity in drug treatment, can impact within-host pathogen evolution. Genetic association studies can potentially identify such interactions. However, extensive and correlated genetic population structure in hosts and pathogens presents a substantial risk of confounding analyses. Moreover, the multiple testing burden of interaction scanning can potentially limit power. We present a Bayesian approach for detecting host influences on pathogen evolution that exploits vast existing data sets of pathogen diversity to improve power and control for stratification. The approach models key processes, including recombination and selection, and identifies regions of the pathogen genome affected by host factors. Our simulations and empirical analysis of drug-induced selection on the HIV-1 genome show that the method recovers known associations and has superior precision-recall characteristics compared to other approaches. We build a high-resolution map of HLA-induced selection in the HIV-1 genome, identifying novel epitope-allele combinations.

Report this publication

Statistics

Seen <100 times