Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ignoring operations belonging to certain toolboxes #23

Open
SirSteel opened this issue Nov 28, 2017 · 2 comments
Open

Ignoring operations belonging to certain toolboxes #23

SirSteel opened this issue Nov 28, 2017 · 2 comments

Comments

@SirSteel
Copy link

Hello,

This might be considered as a feature request or seeking advice how to hack it.

The gist of the problem is, I do not have all the toolboxes available. Seemingly producing errors and calling the functions adds an unnecessary overhead to my computation time, which I would want to minimize.

What I would like to have the option to do, is to exclude operations that belong to certain toolboxes, as to avoid even calling functions I know will fail.

The TS_ops.txt does not seem to contain information on which toolbox something belongs to.

So, if anyone has any ideas, how I could perform, this to speed up my computations, I would be very happy.

And thank you for a nice software suite! :)

PS: I have many more questions, and I have a feeling this is more a bug reporting place so where would be a more appropriate place to ask those.

@benfulcher
Copy link
Owner

Hi @SirSteel! Thanks for your interest in the package :)
I've just added a list of toolbox dependences that I generated a couple years ago, in this commit: c6e6aa4

I don't think dependencies would have changed much since, but think of this as a starting point; do feel free to modify. I have in the past used keywords added to features based on their toolbox dependencies (e.g., as specified in INP_ops.txt) to filter them, but you could do a simple filtering of INP_ops.txt based on master operations (listed in INP_mops.txt) that match functions listed in ToolboxDependencies.txt -- i.e., generate a new INP_ops_toolboxFiltered.txt and INP_mops_toolboxFiltered.txt that specifies the filtered lists (then you can run e.g., TS_init taking these as the input files specifying the list of operations).

Note also that this filtering may be overly harsh (e.g., it may only be a single output of a function that relies on a particular toolbox; the others may compute fine). If you really wanted to get every number you could, you could alternatively run it on your dataset and then list any features that consistently produced missing data, and use that as the basis of filtering to a new reduced set for future calculations.

You kind of need the Statistics/Machine Learning toolbox, but you could get away with not having the others. Let me know how you get on.

Happy to help with any other questions -- please send any non-issue-like questions to my email :)

Best,

Ben

@SirSteel
Copy link
Author

SirSteel commented Dec 3, 2017 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants