Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

200万数据,500维特征,卡方分箱很慢,有没有好办法? #111

Open
dinglei8908 opened this issue Nov 30, 2022 · 1 comment

Comments

@dinglei8908
Copy link

RT

@Secbone
Copy link
Member

Secbone commented Dec 27, 2022

@dinglei8908 如果可以接受误差的话,可以尝试先等频分成m箱(如1000),然后再使用卡方分箱分成n箱,n<<m

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants