GitHub - gradhep/neos: Upstream optimisation for downstream inference

neural end-to-end-optimised summary statistics
arxiv.org/abs/2203.05570

About

Leverages the shoulders of giants (jax and pyhf) to differentiate through a high-energy physics analysis workflow, including the construction of the frequentist profile likelihood.

If you're more of a video person, see this talk given by Nathan on the broader topic of differentiable programming in high-energy physics, which also covers neos.

You want to apply this to your analysis?

Some things need to happen first. Click here for more info -- I wrote them up!

Have questions?

Do you want to chat about neos? Join us in Mattermost:

Cite

Please cite our newly released paper:

@article{neos,
    Author = {Nathan Simpson and Lukas Heinrich},
    Title = {neos: End-to-End-Optimised Summary Statistics for High Energy Physics},
    Year = {2022},
    Eprint = {arXiv:2203.05570},
    doi = {10.48550/arXiv.2203.05570},
    url = {https://doi.org/10.48550/arXiv.2203.05570}
}

Example usage -- train a neural network to optimize an expected p-value

setup

In a python 3 environment, run the following:

pip install --upgrade pip setuptools wheel
pip install neos
pip install git+http://github.com/scikit-hep/pyhf.git@make_difffable_model_ctor

With this, you should be able to run the demo notebook demo.ipynb on your pc :)

This workflow is as follows:

From a set of normal distributions with different means, we'll generate four blobs of (x,y) points, corresponding to a signal process, a nominal background process, and two variations of the background from varying the background distribution's mean up and down.
We'll then feed these points into the previously defined neural network for each blob, and construct a histogram of the output using kernel density estimation. The difference between the two background variations is used as a systematic uncertainty on the nominal background.
We can then leverage the magic of pyhf to construct an event-counting statistical model from the histogram yields.
Finally, we calculate the p-value of a test between the nominal signal and background-only hypotheses. This uses the familiar profile likelihood-based test statistic.

This counts as one forward pass of the workflow -- we then optimize the neural network by gradient descent, backpropagating through the whole analysis!

Thanks

A big thanks to the teams behind jax, fax, jaxopt and pyhf for their software and support.

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
.github		.github
binder		binder
docs		docs
examples		examples
nbs		nbs
src/neos		src/neos
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
animation.gif		animation.gif
applying_neos.md		applying_neos.md
demo.ipynb		demo.ipynb
noxfile.py		noxfile.py
pyproject.toml		pyproject.toml
random.pdf		random.pdf
setup.cfg		setup.cfg
setup.py		setup.py

License

gradhep/neos

Folders and files

Latest commit

History

Repository files navigation

About

You want to apply this to your analysis?

Have questions?

Cite

Example usage -- train a neural network to optimize an expected p-value

setup

Thanks

About

Resources

License

Stars

Watchers

Forks

Languages