Issues: horovod/horovod
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Milestones
Assignee
Sort
Issues list
Replace tf.train.SessionRunHook by tf.compat.v1.train.SessionRunHook ?
bug
#4040
opened May 1, 2024 by
whatdhack
v0.28.1 Version Mismatch with TF 2.12.0. Works with v0.28.0
bug
#4039
opened Apr 16, 2024 by
liamaltarac
Tensorflow Saved model not portable with latest tf.keras.optimizers
bug
#4028
opened Mar 11, 2024 by
supercharleszhu
Unexpected Worker Failure when using Elastic Horovod + Process Sets
bug
#4021
opened Feb 7, 2024 by
Pranavug
Can I call horovod training process in proc = subprocess.Popen(command, shell=True, cwd=cwd) using command
bug
#4017
opened Jan 15, 2024 by
bit-pku-zdf
Error install horovod with python 3.11.5 on macOS 11.3.1
bug
#4013
opened Dec 22, 2023 by
DriverSong
AttributeError: module 'horovod.torch' has no attribute 'init'
bug
#4009
opened Dec 13, 2023 by
Cow-Kite
Getting error while running multi node machine learning training on H100 servers
enhancement
#3989
opened Oct 2, 2023 by
PurvagLapsiwala
Test test.integration.test_spark.SparkTests.test_dbfs_local_store broken for tensorflow>=2.13
bug
#3988
opened Sep 27, 2023 by
EnricoMi
How to write tensorflow custom training loop with using horovod.
#3987
opened Sep 21, 2023 by
PurvangL
Horovod on spark>=2.4 Barrier Execution Mode supporting
enhancement
#3982
opened Sep 13, 2023 by
max-509
Missing ranks deadlock: imbalanced data (like rank 0 has more batches than rank 1)
question
wontfix
#3980
opened Sep 6, 2023 by
fuhailin
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-04-07.