[BUG]: runtime grows to infinity #460

raf-antwerpen · 2023-11-09T17:56:17Z

What happened?

Hi,

PySR runs great for me when I use a relatively small dataset (k<7000), low population (<60), and low niterations (<60). For these parameters, the model converges within a couple of minutes. However, if I increase these parameters the runtime seems to grow to infinity. For these 'too large' runs, CPU usage in the first couple of minutes is very large (same as for the 'smaller' runs) and then decreases to <1%, which makes the runtime extremely large.

I would like to run PySR on larger datasets and for longer times, but that seems impossible right now. Do you have any idea what is going on and how to avoid this happening?

Thanks so much!

Cheers,
Raf

Version

0.16.0

Operating System

macOS

Package Manager

None

Interface

Jupyter Notebook

Relevant log output

There is no log output since I do not get an error, the model just keeps running forever.

Extra Info

No response

MilesCranmer · 2023-11-09T22:29:49Z

Hi @raf-antwerpen,

Thanks for posting this bug report. Would it be possible for you to share your exist PySRRegressor settings, and maybe more info about how you installed it, how you are running it (IPython or Jupyter or script), and maybe any info about CPU/memory usage during the fit? Also is it completely frozen, or is the hall of fame csv file still being updated?

Cheers,
Miles

MilesCranmer · 2023-11-09T22:31:02Z

If you are running it in a Jupyter notebook, could you also try running it from the command line (i.e., save it in a .py file and execute it)? Sometimes Jupyter messes with parallelism so would be good to have this info.

raf-antwerpen · 2023-11-10T16:22:30Z

Hi,

Thank you for your quick response.

Executing the script as a .py file solved the issue!

But for documentation and reporting purposes, here is some info on the bug in the Jupyter notebook.

I installed PySR using mamba and run it in a Jupyter notebook with python3.9.16. I tried many configurations of PySRRegressor settings, but even with the most minimal settings the bug occurs:

model = PySRRegressor(
populations=80,
niterations=80,
)

I am using datasets of size X=(7000,2) and y=(7000,).

During the first couple of minutes or running PySRRegressor, CPU usage ranges erratically between 100-400%. After a couple of minutes, this goes done to 1-1.5%. This is when the hall of fame csv stops updating as well. When running the script as a .py file, the CPU usage is a lot more consistent between 500-600% instead of the erratic pattern when I run the notebook.

Cheers,
Raf

MilesCranmer · 2023-11-12T17:10:04Z

Thanks for this info. Yeah Jupyter has some weird interactions with multiprocessing, even in standard Python libraries like PyTorch, but it could be something else. It's good you found a workaround but I'll leave this open as original the issue is still there.

I wonder if it could also be due to how text streams work differently on Jupyter. Maybe PyJulia is trying to write to stdout or read stdin, but Jupyter isn't letting it or something, and so it's just stuck waiting to write. Perhaps the following line:

https://github.com/MilesCranmer/SymbolicRegression.jl/blob/141987a3587978039712f010172b142ad09d4757/src/SymbolicRegression.jl#L555

    stdin_reader = watch_stream(stdin)

could be commented out, so that the search (and therefore PyJulia) stops watching stdin? (Along with the other lines interacting with stdin_reader)

That would certainly explain this issue if that solves it, because it only starts checking stdin a few steps into the search process, so it would make sense how the CPU goes up to 400% then down to 1%. The 1% just being from Julia waiting to read stdin...

If you have some time to try this fix out, you can follow the instructions here: https://astroautomata.com/PySR/backend/ for modifying the PySR backend (which is SymbolicRegression.jl), and implement those changes. It would be interesting to hear if that stdin issue solves it.

MilesCranmer · 2023-11-12T17:11:05Z

One other thing to try is multithreading=False, which will switch to the multiprocessing mode.

raf-antwerpen added the bug Something isn't working label Nov 9, 2023

raf-antwerpen assigned MilesCranmer Nov 9, 2023

paulomontero mentioned this issue Dec 20, 2023

[BUG]: Possible memory leakage & best practices for memory scaling? #490

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: runtime grows to infinity #460

[BUG]: runtime grows to infinity #460

raf-antwerpen commented Nov 9, 2023

MilesCranmer commented Nov 9, 2023

MilesCranmer commented Nov 9, 2023

raf-antwerpen commented Nov 10, 2023

MilesCranmer commented Nov 12, 2023

MilesCranmer commented Nov 12, 2023

[BUG]: runtime grows to infinity #460

[BUG]: runtime grows to infinity #460

Comments

raf-antwerpen commented Nov 9, 2023

What happened?

Version

Operating System

Package Manager

Interface

Relevant log output

Extra Info

MilesCranmer commented Nov 9, 2023

MilesCranmer commented Nov 9, 2023

raf-antwerpen commented Nov 10, 2023

MilesCranmer commented Nov 12, 2023

MilesCranmer commented Nov 12, 2023