Affordable Access

Publisher Website

Utilizing Genotype Imputation for the Augmentation of Sequence Data

Authors
Journal
PLoS ONE
1932-6203
Publisher
Public Library of Science
Publication Date
Volume
5
Issue
6
Identifiers
DOI: 10.1371/journal.pone.0011018
Keywords
  • Research Article
  • Genetics And Genomics/Bioinformatics
  • Genetics And Genomics/Genome Projects
  • Genetics And Genomics/Population Genetics
Disciplines
  • Biology
  • Design
  • Medicine

Abstract

Background In recent years, capabilities for genotyping large sets of single nucleotide polymorphisms (SNPs) has increased considerably with the ability to genotype over 1 million SNP markers across the genome. This advancement in technology has led to an increase in the number of genome-wide association studies (GWAS) for various complex traits. These GWAS have resulted in the implication of over 1500 SNPs associated with disease traits. However, the SNPs identified from these GWAS are not necessarily the functional variants. Therefore, the next phase in GWAS will involve the refining of these putative loci. Methodology A next step for GWAS would be to catalog all variants, especially rarer variants, within the detected loci, followed by the association analysis of the detected variants with the disease trait. However, sequencing a locus in a large number of subjects is still relatively expensive. A more cost effective approach would be to sequence a portion of the individuals, followed by the application of genotype imputation methods for imputing markers in the remaining individuals. A potentially attractive alternative option would be to impute based on the 1000 Genomes Project; however, this has the drawbacks of using a reference population that does not necessarily match the disease status and LD pattern of the study population. We explored a variety of approaches for carrying out the imputation using a reference panel consisting of sequence data for a fraction of the study participants using data from both a candidate gene sequencing study and the 1000 Genomes Project. Conclusions Imputation of genetic variation based on a proportion of sequenced samples is feasible. Our results indicate the following sequencing study design guidelines which take advantage of the recent advances in genotype imputation methodology: Select the largest and most diverse reference panel for sequencing and genotype as many “anchor” markers as possible.

There are no comments yet on this publication. Be the first to share your thoughts.

Statistics

Seen <100 times
0 Comments

More articles like this

Model, properties and imputation method of missing...

on Journal of Computational and A... Jan 01, 2009

Acceleration Genotype Imputation for Large Dataset...

on Procedia Environmental Science... Jan 01, 2011

Genotype imputation.

on Annual Review of Genomics and... 2009
More articles like this..