For high-dimensional hierarchical models, consider exchangeability of effects across covariates instead of across datasets

Trippe, Brian L.; Finucane, Hilary K.; Broderick, Tamara

Statistics > Methodology

arXiv:2107.06428 (stat)

[Submitted on 13 Jul 2021]

Title:For high-dimensional hierarchical models, consider exchangeability of effects across covariates instead of across datasets

Authors:Brian L. Trippe, Hilary K. Finucane, Tamara Broderick

View PDF

Abstract:Hierarchical Bayesian methods enable information sharing across multiple related regression problems. While standard practice is to model regression parameters (effects) as (1) exchangeable across datasets and (2) correlated to differing degrees across covariates, we show that this approach exhibits poor statistical performance when the number of covariates exceeds the number of datasets. For instance, in statistical genetics, we might regress dozens of traits (defining datasets) for thousands of individuals (responses) on up to millions of genetic variants (covariates). When an analyst has more covariates than datasets, we argue that it is often more natural to instead model effects as (1) exchangeable across covariates and (2) correlated to differing degrees across datasets. To this end, we propose a hierarchical model expressing our alternative perspective. We devise an empirical Bayes estimator for learning the degree of correlation between datasets. We develop theory that demonstrates that our method outperforms the classic approach when the number of covariates dominates the number of datasets, and corroborate this result empirically on several high-dimensional multiple regression and classification problems.

Comments:	10 pages plus supplementary material
Subjects:	Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2107.06428 [stat.ME]
	(or arXiv:2107.06428v1 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.2107.06428

Submission history

From: Brian Trippe [view email]
[v1] Tue, 13 Jul 2021 23:23:06 UTC (1,820 KB)

Statistics > Methodology

Title:For high-dimensional hierarchical models, consider exchangeability of effects across covariates instead of across datasets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:For high-dimensional hierarchical models, consider exchangeability of effects across covariates instead of across datasets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators