You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Can we add a feature in LinearRegression that could remove collinearity (exact collinearity) in the data?.
Describe your proposed solution
My proposal is to add an extra argument like remove_collinearity if it is set by the user then we can remove exact collinear variables using the rank of the matrix or collinear variables using VIF. This can save some time instead of going for Ridge regression.
The text was updated successfully, but these errors were encountered:
It might be better to have this as a prepreprocessor in sklearn.feature_selection, that way it could be applied to multiple estimators. I'm not sure that exact collinearity is a frequent issue though. Maybe an estimator with a user defined feature correlation threshold?
I'm not sure if it's something that is often done, as opposed to say feature clustering? The latter can be done in scikit-learn with cluster.FeatureAgglomeration though maybe the interface with a required n_clusters is not ideal.
Describe the workflow you want to enable
Can we add a feature in LinearRegression that could remove collinearity (exact collinearity) in the data?.
Describe your proposed solution
My proposal is to add an extra argument like remove_collinearity if it is set by the user then we can remove exact collinear variables using the rank of the matrix or collinear variables using VIF. This can save some time instead of going for Ridge regression.
The text was updated successfully, but these errors were encountered: