Skip to content

Worker 21 terminated.Unhandled Task ERROR: EOFError: read end of file #241

Answered by mloubout
tjk9501 asked this question in Q&A
Discussion options

You must be logged in to vote

addprocs add julia workers so it will process shots in parallel. However, If you are on a single node now, the resources are shared by all these workers, including memory and CPU cores, and will compete for it. So it will get quite slower because now the process spends time waiting for an available core, and you will run out of memory much quicker than if you were using x32 or x64 more memory.

So if you use addprocs you need to make sure that:

  • each julia worker is using a separate set of cores (with i.e GOMP_CPU_AFFINITY , you can see an example here in section 3
  • there is enough memory for all these workers
  • the slow down (less cpus per worker) is worth the shot parallelization

Hope this…

Replies: 2 comments 18 replies

Comment options

You must be logged in to vote
7 replies
@tjk9501
Comment options

@mloubout
Comment options

@tjk9501
Comment options

@mloubout
Comment options

@mloubout
Comment options

Comment options

You must be logged in to vote
11 replies
@tjk9501
Comment options

@mloubout
Comment options

@tjk9501
Comment options

@mloubout
Comment options

Answer selected by tjk9501
@tjk9501
Comment options

@mloubout
Comment options

@tjk9501
Comment options

@mloubout
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants