New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Misspelled function parameter for MosesTokenizer: agressive_dash_splits #1955
Comments
Thanks @somnathrakshit for the quick PR. Note that altering the parameter name breaks the API, so it might be better to first provide it as an option with a DeprecationWarning when the old parameter name is used, then it can be fully removed at the next major version. Maybe a regular NLTK dev can comment on the procedure here, as I didn't see it mentioned explicitly in the developer guidelines or CONTRIBUTING.md doc. @alvations, are there any guidelines or precedents for changing function/parameter names? |
@goodmami @somnathrakshit no worries about breaking API in this case. Most people would be more stymied by the typo argument instead of the correct one =) Regarding deprecation and breaking user space, in this case it's our fault and it's easier for users to update to new NLTK version. But in other cases, esp. when it comes to more major changes that's not just typo, we'll can use |
Thanks @alvations for letting us know. As a beginner in open source, nltk has been nice to tinker with. Are you taking part in GSoC 2018? |
Resolved in #1956 @somnathrakshit Thanks for the contribution! Unfortunately, we're not taking part in GSoC 2018. Perhaps another year when we have more volunteers =) |
Just a minor issue. The
agressive_dash_splits
is misspelled. It should beaggressive_dash_splits
. Or maybe usehyphen
instead ofdash
to be consistent with both theAGGRESSIVE_HYPHEN_SPLIT
class member and withtokenizer.perl
.http://www.nltk.org/api/nltk.tokenize.html#nltk.tokenize.moses.MosesTokenizer.tokenize
Also this functionality does not appear to be tested.
The text was updated successfully, but these errors were encountered: