GitHub - mobeets/group-ARD: code for performing Bayesian ARD regression, where covariates have groups

Summary

Suppose we have covariates $X \in \mathbb{R}^{N \times K}$ and observations $y \in \mathbb{R}^N$, where our observation model is $y_i \sim \mathcal{N}(x_i^\top w, \sigma^2)$. Here, $w \in \mathbb{R}^K$ are unknown weights.

In linear regression we want to find the best estimate of the weights given $X$ and $y$. For example, standard linear regression finds the weights $\widehat{w}$ minimizing the sum of the squared residuals:

$$ || y - X \widehat{w} ||_2^2 $$

When $N$ is small or $K$ is large, it's often useful to do Bayesian linear regression. This involves choosing a prior on our weights (see [1] for more details). Some common choices or prior are:

Ridge: $w_i \sim \mathcal{N}(0, \alpha^{-1})$, where $\alpha \in \mathbb{R}$ is called our "inverse prior variance".
Automatic Relevance Determination (ARD): $w_i \sim \mathcal{N}(0, \alpha_i^{-1})$. Note that now each covariate has its own inverse prior variance.

Here we consider a third option in between these two, which I will call "Group ARD" (in analogy to Group Lasso [2]). This prior is relevant when our covariates can be grouped. Specifically, we assume the $i^{th}$ covariate has a known group label $c_i \in \{ 1, 2, \ldots, G\}$, where $G$ is the total number of groups. The idea is that every covariate in the same group has the same inverse prior variance. In other words:

Group ARD: $w \sim \mathcal{N}(0, \alpha_{c_i}^{-1})$

We can estimate the inverse prior variances using similar methods proposed in Appendix 1 of [1].

For what it's worth, this model was implemented in [3] but called "Group-sparse Bayesian linear discriminant analysis" (?).

References

[1] Tipping, Michael E. "Sparse Bayesian learning and the relevance vector machine." Journal of machine learning research 1.Jun (2001): 211-244.

[2] Yuan, Ming, and Yi Lin. "Model selection and estimation in regression with grouped variables." Journal of the Royal Statistical Society Series B: Statistical Methodology 68.1 (2006): 49-67.

[3] Yu, Tianyou, et al. "Grouped automatic relevance determination and its application in channel selection for P300 BCIs." IEEE Transactions on Neural Systems and Rehabilitation Engineering 23.6 (2015): 1068-1077.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
plots		plots
.gitignore		.gitignore
README.md		README.md
example.py		example.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

plots

plots

.gitignore

.gitignore

README.md

README.md

example.py

example.py

model.py

model.py

Repository files navigation

Summary

References

About

Releases

Packages

Languages

mobeets/group-ARD

Folders and files

Latest commit

History

Repository files navigation

Summary

References

About

Topics

Resources

Stars

Watchers

Forks

Languages