Conférence Internationale de Statistique Appliquée pour le Developpement en Afrique / International Conference on Applied Statistics for Development in Africa

Statistique Appliquée pour le Développement en Afrique

4-8 mars 2013 Cotonou (Bénin)

Vendredi 8

Conférence plénière
Professor Mahlet G. Tadesse, Georgetown University, Washington, DC, USA
› 10:30 - 11:30 (1h)

› Amphithéatre

sciencesconf.org:sada2013:15935

In analyzing high-dimensional datasets, there is often interest in uncovering cluster structures and identifying variables associated with the clusters. I will present some Bayesian methods we have proposed to address such questions in a unified manner. The first problem I will discuss is concerned with discovering homogeneous subgroups of samples and identifying variables that discriminate across the subgroups. We use mixture models with an unknown number of components to uncover the cluster structures and build a stochastic search variable selection method into the model to identify discriminating variables. The second problem is concerned with relating two high-dimensional data sets by uncovering cluster structures in the data and identifying groups of associated variables across the data sets. We use a stochastic partitioning method that combines ideas of mixtures of regression models and variable selection methods to search for sets of covariates associated with sets of correlated outcomes. I will illustrate the methods with applications to genomic data sets.

Type :	:	oral
Thématiques	:	Conférence plénière
Mots-Clés	:	Bayesian inference ; genomic studies ; high ; dimensional data ; Markov chain Monte Carlo ; mixture models ; variable selection

Présentation

Personnes connectées : 1