Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ADBench with custom data #9

Open
Sreyashi-Bhattacharjee opened this issue Sep 22, 2022 · 1 comment
Open

ADBench with custom data #9

Sreyashi-Bhattacharjee opened this issue Sep 22, 2022 · 1 comment

Comments

@Sreyashi-Bhattacharjee
Copy link

Hello, I am trying to replicate the demo notebook but using a different open source data. I am getting keyerror. I have tried changing the data as well and have reduced the number of rows as well. Please help.

Attaching screenshots

ad`b_2

adb_1

@Minqi824
Copy link
Owner

I sincerely apologize for my late reply. We have released the latest version of ADBench, which now support the implementation of either customized dataset or algorithm. After installing ADBench and upgrading to its latest version, you can run AD algorithms (or your customized algorithm) on the customized dataset. See the following codes:

# customized model on ADBench's datasets
from adbench.run import RunPipeline
from adbench.baseline.Customized.run import Customized

# notice that you should specify the corresponding category of your customized AD algorithm
# for example, here we use Logistic Regression as customized clf, which belongs to the supervised algorithm
# for your own algorithm, you can realize the same usage as other baselines by modifying the fit.py, model.py, and run.py files in the adbench/baseline/Customized
pipeline = RunPipeline(suffix='ADBench', parallel='supervise', realistic_synthetic_mode=None, noise_type=None)
results = pipeline.run(clf=Customized)

# customized model on customized dataset
import numpy as np
dataset = {}
dataset['X'] = np.random.randn(1000, 20)
dataset['y'] = np.random.choice([0, 1], 1000)
results = pipeline.run(dataset=dataset, clf=Customized)

Please see the guidance for further details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants