Rare variant analysis in large-scale association and sequencing studies

Duration: 11 mins 42 secs
Share this media item:
Embed this media item:


About this item
Rare variant analysis in large-scale association and sequencing studies's image
Description: Zeggini, E (Sanger Institute)
Monday 26 September 2011, 10:00-10:10
 
Created: 2011-09-30 15:15
Collection: Design and Analysis of Experiments
Cambridge Statistics Initiative 2011
Publisher: Isaac Newton Institute
Copyright: Zeggini, E
Language: eng (English)
 
Abstract: Recent advances in whole-genome genotyping technologies, the availability of large, well-defined sample sets, and a better understanding of common human sequence variation, coupled with the development of appropriate quality control and analysis pipelines, have led to the identification of many novel common genetic determinants of complex traits. However, despite these successes, much of the genetic component of these traits remains unaccounted for. One largely unexplored paradigm which may contribute to this missing heritability is a model of multiple rare causal variants, each of modest effect and residing
within the same functional unit, for example, a gene. Joint analysis of rare variants, searching for accumulations of minor alleles in individuals, for a dichotomous or quantitative trait, may thus provide signals of association with complex phenotypes that could not have been identified through traditional association analysis of single nucleotide polymorphisms (SNPs). However, statistical methods to perform such joint analyses of rare variants have not yet been fully developed or evaluated. We have implemented rare variant analysis methods in user-friendly software and have extended approaches to collapsing rare allele tables and allele-matching tests by incorporating variant-specific quality scores (for example arising from next generation sequencing studies in which different positions have been covered at different depths) and genotype-specific probabilities (for example arising from 1000 genomes project-imputed data. We evaluated these methods and find increases in power to detect association under varying allelic architectures and parameters. We make recommendations for the analysis of rare variants in large-scale association and next generation sequencing studies.
Available Formats
Format Quality Bitrate Size
MPEG-4 Video 640x360    1.85 Mbits/sec 162.36 MB View Download
WebM 640x360    620.99 kbits/sec 52.91 MB View Download
Flash Video 484x272    569.3 kbits/sec 48.79 MB View Download
iPod Video 480x270    506.8 kbits/sec 43.43 MB View Download
MP3 44100 Hz 125.14 kbits/sec 10.53 MB Listen Download
Auto * (Allows browser to choose a format it supports)