Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NaN data #67

Open
GoogleCodeExporter opened this issue Mar 7, 2016 · 4 comments
Open

NaN data #67

GoogleCodeExporter opened this issue Mar 7, 2016 · 4 comments

Comments

@GoogleCodeExporter
Copy link

I would just like to know how the model handles or needs to handle NaN data?

I have NaN observations in my X_train set. I am performing a regression 
analysis.

Regards

Original issue reported on code.google.com by nikhil.h...@gmail.com on 3 Jan 2015 at 7:26

@GoogleCodeExporter
Copy link
Author

I was getting the following error:

Warning: Do you want regression? there are just 5 or less unique values 
> In regRF_train at 163 
Error using regRF_train (line 176)
NaNs in X

Original comment by nikhil.h...@gmail.com on 3 Jan 2015 at 7:29

@GoogleCodeExporter
Copy link
Author

this implementation cannot handle NaN data. you can try imputing the values

https://code.google.com/p/randomforest-matlab/wiki/Finding_Missing_Values

regards

Original comment by abhirana on 3 Jan 2015 at 11:38

@GoogleCodeExporter
Copy link
Author

Ok. Thank you.

Could you please point me in the direction of how and where it is stated in 
literature that 500 trees are the most stable for random forests, and that for 
regression the minimum leaf size is 5?

Original comment by nikhil.h...@gmail.com on 5 Jan 2015 at 3:05

@GoogleCodeExporter
Copy link
Author

500 trees are not the most stable. they are a good enough number of trees after 
which you might find the oob error rate stabilizing

if i remember correctly, ntree and the minimum leaf size=5 for regression is 
suggested by Breiman in his paper on random forests.

Original comment by abhirana on 5 Jan 2015 at 6:23

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant