GitHub - pcsl-epfl/regressionsphere: Train a neural network in feature and lazy regimes on a regression task defined on the hyper-sphere.

Regression of a Target Function on the Sphere

Code for the paper Learning sparse features can lead to overfitting in neural networks appeared at NeurIPS 2022. Details for running experiments can be found in experiments.md.

Neural Network Training

Run main.py

Architecture. Train a one hidden-layer fully-connected neural network on the mean square error.

Data-set. Data samples x can be drawn from

the d-dimensional normal distribution, args.dataset = normal;
uniform distribution inside the sphere, uniform;
or uniform distribution on the spherical surface, sphere.

The target is either

the sample norm ||x||, args.target = norm;
or a Gaussian random field computed through an ~infinite-width teacher network (args.target = teacher) with relu or abs activation function to some power a. Here examples of the Gaussian random field defined on the sphere in d=3:

Algorithm. Full batch gradient descent can be performed with

the alpha-trick by setting args.alpha larger (lazy) or smaller (feature) than one.
a regularization args.l on the l2 norm of the parameters (args.reg = 'l2'), on the path norm ||w1|| * |w2| (args.reg = 'l1') or on the l1 norm |w2| by fixing the first layer weights on the unit sphere ||w1|| = 1 (args.reg = 'l1' and args.w1_norm1 = 1).

Additionally, conic gradient descent [Chizat and Bach, 2018] can be performed by setting args.conic_gd = 1.

Kernel Ridge Regression (KRR)

Run main_krr.py

Student kernel. Analytical NTK of an infinite-width one-hidden-layer neural network.

Data-set. see above.

Ridge. The regularization or ridge parameter can be fixed by args.l (default: 0).

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
arch		arch
dataset		dataset
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
experiments.md		experiments.md
loss.py		loss.py
main.py		main.py
main_krr.py		main_krr.py
spherical_data.png		spherical_data.png
stability.py		stability.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

arch

arch

dataset

dataset

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

experiments.md

experiments.md

loss.py

loss.py

main.py

main.py

main_krr.py

main_krr.py

spherical_data.png

spherical_data.png

stability.py

stability.py

train.py

train.py

utils.py

utils.py

Repository files navigation

Regression of a Target Function on the Sphere

Neural Network Training

Kernel Ridge Regression (KRR)

About

Releases

Packages

Languages

License

pcsl-epfl/regressionsphere

Folders and files

Latest commit

History

Repository files navigation

Regression of a Target Function on the Sphere

Neural Network Training

Kernel Ridge Regression (KRR)

About

Topics

Resources

License

Stars

Watchers

Forks

Languages