Seminar: Bayesian nonparametric dynamic-clustering and genetic imputation

SpeakerLloyd Elliott
AffiliationUCL, Gatsby
DateFriday, 21 Mar 2014
Time13:00 - 14:00
LocationMalet Place Engineering 1.03
Event seriesMicrosoft Research CSML Seminar Series

I will describe new approaches to dynamic-clustering based on Bayesian nonparametric (BNP) hidden Markov models (HMMs). I will apply these approaches to genotype imputation problems and illustrate the practical benefits of BNP. Genetic similarity within a population is a function of chromosome position and dynamic-clustering based on parametric HMMs are popular models of genetic structure. BNP priors are well suited as extensions of, or as competitors to, these HMMs because many aspects of genetic processes (such as allele sampling) arise naturally from BNP models. In addition, BNP priors provide several practical benefits over parametric HMMs. First, by defining probability distributions on the set of partitions, BNP priors avoid label switching problems. Second, costly model selection and ad-hoc methods to determine the number of latent clusters are also avoided. Finally, the flexibility of BNP often provides state-of-the-art imputation accuracy. I will conclude with directions of future work including the abstraction of auxiliary Gibbs schemes (used for inference in these models) to probabilistic programming for BNP models.

iCalendar csml_id_162.ics