10/08/2009 - 16:10
10/08/2009 - 17:30
Short Title: 
290 Seminar: Nancy R. Zhang (Stanford U)
Short Desc: 
Simultaneous Change-point Models with Applications to Cross-sample and Cross-platform Analysis of DNA Copy Number
STATISTICS COLLOQUIUM

THURSDAY, October 8th, 2009 at 4.10pm, MSB 1147 (Colloquium Room)

Refreshments: 3.30pm, MSB 4110 (Statistics Lounge)

Speaker:            Nancy R. Zhang (Stanford University)

Title:                Simultaneous Change-point Models with Applications to Cross-sample and Cross-platform Analysis of DNA Copy Number

Abstract:        DNA copy number analysis involves the detection of chromosomal gains and losses using high-density microarray platforms. Change-point methods have been applied successfully to detecting signals in single data sequences derived from one biological sample. However, it is common to have data sets involving hundreds to thousands of biological samples. How should information be combined across samples to detect population level common polymorphisms?

Also, how should the samples be summarized to give a sparse signature of variation across the cohort? It is also now common to have the same biological sample assayed using multiple experimental platforms. For example, in the Cancer Genome Atlas project, each biological sample is processed using Illumina, Affymetrix and Agilent chips. How should data be integrated across platforms to achieve higher accuracy?

I will discuss the statistical issues underlying these problems and formulate a class of simultaneous change-point models for cross-sample and cross-platform data integration. These models lead to interpretable scan statistics whose significance level can be theoretically analyzed. I will also discuss model selection approaches for this class of models. The insights gained from this study can be applied to integrative analysis of data from other types of genome-wide profiling experiments, such as methylation or RNA expression.