Skip to content

How to run colossalAI in kubernetes? #2542

Discussion options

You must be logged in to vote
root@test-colossal:/workspace# colossalai benchmark -g 2
_meta_registrations seems to be incompatible with PyTorch 1.11.0.
=== Benchmarking Parameters ===
gpus: 2
batch_size: 8
seq_len: 512
dimension: 1024
warmup_steps: 10
profile_steps: 50
layers: 2
model: mlp

_meta_registrations seems to be incompatible with PyTorch 1.11.0.
_meta_registrations seems to be incompatible with PyTorch 1.11.0.
[W socket.cpp:558] [c10d] The client socket has failed to connect to [localhost]:43404 (errno: 99 - Cannot assign requested address).
Traceback (most recent call last):
  File "/opt/conda/envs/pytorch/bin/colossalai", line 8, in <module>
    sys.exit(cli())
  File "/opt/conda/envs/pytorch/lib/python3.…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@tswangdi
Comment options

Answer selected by tswangdi
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants