fastSparse

This repository contains source code to our AISTATS 2022 paper:

Fast Sparse Classification for Generalized Linear and Additive Models

1. Installation

To install this R package, please go to the installation folder and follow the installation instructions.

2. Application and Usage

We provide a toolkit for producing sparse and interpretable generalized linear and additive models for the binary classiciation task by solving the L0-regularized problems. The classiciation loss can be either the logistic loss or the exponential loss. The algorithms can produce high quality (swap 1-OPT) solutions and are generally 2 to 5 times faster than previous approaches.

2.1 R Interface

To understand how to use this R package, please go to the application_and_usage_R_interface_folder.

2.2 Python Interface

To understand how to use this package in a python environment, we provide a python wrapper to acheive this. Please go to the application_and_usage_python_interface_folder.

3. Experiment Replication

To replicate our experimental results shown in the paper, please go to the experiments folder.

4. Step Function Visualization

To reproduce the step function plots shown in the paper, please go to the step_function_visualization folder.

5. Citing Our Work

If you find our work useful in your research, please consider citing the following paper:

@inproceedings{liu2022fast,
  title={Fast Sparse Classification for Generalized Linear and Additive Models},
  author={Liu, Jiachang and Zhong, Chudi and Seltzer, Margo and Rudin, Cynthia},
  booktitle={International Conference on Artificial Intelligence and Statistics},
  pages={9304--9333},
  year={2022},
  organization={PMLR}
}

Acknowledgement: For this AISTATS camera ready code repository, we build our method based on L0Learn’s codebase, so that we could use its preprocessing steps, and the pipeline for running the full regularization path of $λ_0$ values. Therefore, the computational speedup shown in our paper solely comes from our new proposed algorithms.

We plan to build our own pipeline and further extend this work before pushing the project to CRAN. Right now you can install the project from GitHub directly. Our repository will be actively maintained, and the most updated version can be found at this current GitHub page.

Package Development ToDo List

Fix windows installation issues.
Add language specification to codeblock in README.
Add binarization preprocessing function and provide usage in jupyter notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
FastSparse		FastSparse
Unified_API		Unified_API
application_and_usage_R_interface		application_and_usage_R_interface
application_and_usage_python_interface		application_and_usage_python_interface
experiments		experiments
installation		installation
step_function_visualization		step_function_visualization
.gitignore		.gitignore
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
README.md		README.md
environment.yml		environment.yml

License

interpretml/fastSparse

Folders and files

Latest commit

History

Repository files navigation

fastSparse

1. Installation

2. Application and Usage

2.1 R Interface

2.2 Python Interface

3. Experiment Replication

4. Step Function Visualization

5. Citing Our Work

Package Development ToDo List

About

Resources

License

Stars

Watchers

Forks

Languages