-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is "multi-gpus training support" in next branch only for linux? #1437
Comments
What errors occur? |
This is the error message when using net_to_model J:\mtraining\tf>python net_to_model.py a.txt During handling of the above exception, another exception occurred: Traceback (most recent call last): J:\mtraining\tf> |
Seems an error of split, I will check that. UPDATE: When using net_to_model, GPU number should be set to 1 : ) |
Thanks @godmoves. I dowonloaded your new branch tf, and I got this result. J:\mtrainingfix\tf>python net_to_model.py b.txt During handling of the above exception, another exception occurred: Traceback (most recent call last): Caused by op 'tower_0/get_regularization_penalty/l2_regularizer_45/L2Loss', defined at: InvalidArgumentError (see above for traceback): Cannot assign a device for operation 'to J:\mtrainingfix\tf> Upper blue text is so strange. T=DT_FLOAT, _device="/device:GPU:0"](w_fc_3/read)]] is original text. |
I heard that it can be solved by compiling TensorFlow. You may try it. |
I'll try comiling and rebuild TF 1.4 with .whl file. Thanks @bjiyxo |
@trainewbie This issue is something about the TensorFlow's variable assignment mechanism, and maybe soft placement will solve that. I have updated the code, can you check it? EDIT: something similar here |
@godmoves Solved! Great! Thanks a lot. :-) |
Thanks for your feedback, I will open a PR for this 😄 |
My OS is windows7(with python 3.5 or 3.6 or ananconda, tensorflow 1.4, cuda 8.0 and cudnn 6.0).
There was no problem to use net_to_model and train a network in a single gpu with the master branch tf.
When using python net_to_model.py and python parse.py command with multi-gpus tf, errors occur.
Am I missing something?
The text was updated successfully, but these errors were encountered: