RSCA

Measuring the chemical similarity of stars through applying metric-learning to open-clusters with experiments using the APOGEE DR16 stellar survey

Summary

This is a companion repository for the paper "Measuring chemical likeness of stars with RSCA". It contains the code necessary for reproducing experiments. Some limited effort has been made towards ensuring the code is well-documented and easily installable. However the code has not been tested on external machines. Users are encouraged to open a git issue if they run into any problems.

Requirements

apogee: We access the APOGEE survey data using the apogee package. Detailed instructions for installing apogee can be found in its associated repository. Our repository was designed to work with DR16 and so the associated environment variable within apogee should be appropriately set.
Other dependencies are :astropy,matplotlib,mpl_scatter_density,numpy,scikit-learn,scipy. These must be manually installed.

Use the package manager pip to install

pip install setup.py -e .

Structure

RSCA/apoNN/src contains the core code for the algorithm. More precisely...

/apoNN/src/occam.py contains code for cross-matching the Occam value-added catalogue with an APOGEE dataset cut. It will return a filtered down Apogee AllStar Fits file containing only those OCCAM object within an APOGEE style catalogue.

/apoNN/src/data.py contains code for downloading and pre-processing the AspcapStar spectra in an APOGEE dataset. While it can download spectra in AllStar not locally available, it does so extremelly inefficiently.

/apoNN/src/vectors.pycontains data wrappers. Our codebase, rather than directly manipulating numpy arrays, wraps these into a Vector class allowing easier handling of open-clusters. This allows for keeping track of cluster member stars and provides some additional useful utility functions.

/apoNN/src/fitters.py contains the source code for the RSCA algorithm(as well as a few unpublished variants). These take the form of Fitter classes.

/apoNN/src/evaluators.py contains Evaluator classes. These are wrappers around Fitter that naturally handle cross-validation, doppelganger rate calculations and visualizations of RSCA runs for quantitative evaluation.

RSCA/apoNN/scripts contains Python scripts for downloading and saving the dataset with the same dataset cuts as used in the paper. These must be run preliminarily to the code for generating figures.

RSCA/apoNN/figures contains Python scripts for reproducing figures. These are a good starting point for understanding how to run the codebase.

RSCA/outputs contains any and all outputs. This includes intermediary pickled datasets as well as generated figures.

Citation

Please cite as:

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
apoNN		apoNN
outputs/figures		outputs/figures
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RSCA

Summary

Requirements

Structure

Citation

About

Uh oh!

Releases

Packages

Languages

drd13/RSCA

Folders and files

Latest commit

History

Repository files navigation

RSCA

Summary

Requirements

Structure

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages