RuntimeError! #219

SomnusQue · 2024-01-21T11:16:12Z

I run auto_100weight_inherit_100to75.sh, and meet this problem. I think I have been ready everything for this project, but it still have some problems which I can't solve. Please somebody help me!

SomnusQue · 2024-01-21T11:20:18Z

wkcn · 2024-01-21T12:13:18Z

Hi @SomnusQue , thanks for your attention to our work!

Is the code of TinyCLIP latest?

It is a bug which is triggered on PyTorch 2.x.
We have fixed the bug by adding this line: https://github.com/microsoft/Cream/blob/main/TinyCLIP/src/open_clip/model.py#L28

checkpoint = functools.partial(checkpoint, use_reentrant=False)

SomnusQue · 2024-01-21T12:44:47Z

Hi @SomnusQue , thanks for your attention to our work!

Is the code of TinyCLIP latest?

It is a bug which is triggered on PyTorch 2.x. We have fixed the bug by adding this line: https://github.com/microsoft/Cream/blob/main/TinyCLIP/src/open_clip/model.py#L28
checkpoint = functools.partial(checkpoint, use_reentrant=False)

OMG! The author answer my question! The code which I have really doesn't have these lines! Thx for your patience!
But I wondering when is the code update?

SomnusQue · 2024-01-21T12:48:29Z

Hi @SomnusQue , thanks for your attention to our work!
Is the code of TinyCLIP latest?
It is a bug which is triggered on PyTorch 2.x. We have fixed the bug by adding this line: https://github.com/microsoft/Cream/blob/main/TinyCLIP/src/open_clip/model.py#L28
checkpoint = functools.partial(checkpoint, use_reentrant=False)
OMG! The author answer my question! The code which I have really doesn't have these lines! Thx for your patience! But I wondering when is the code update?
Furthermore... Is this LOSS normal?

wkcn · 2024-01-21T12:57:59Z

@SomnusQue I fixed the bug in Jan. 11, 2024 (https://github.com/microsoft/Cream/pull/218/files#diff-2c756c8b8b99609dee1b59ce4dcfaf773aa9afbc84e093e03e3e0de653fa0124R28).

You can visualize the loss curve in wandb. The loss is normal if it is decreasing : )

SomnusQue · 2024-01-21T13:48:09Z

@SomnusQue I fixed the bug in Jan. 11, 2024 (https://github.com/microsoft/Cream/pull/218/files#diff-2c756c8b8b99609dee1b59ce4dcfaf773aa9afbc84e093e03e3e0de653fa0124R28).

You can visualize the loss curve in wandb. The loss is normal if it is decreasing : )

Thanks for your patience! Due to the cluster, I can't use wandb(because it needs network..?), I change this line in .sh file'--report-to wandb' to '--report-to tensorboard'. Does it have anywhere else need to change in the code?

wkcn · 2024-01-21T15:54:22Z

@SomnusQue
No code change required. It is also available to set the environmental variable WANDB_MODE=offline. The wandb log will be saved as a file. Then run wandb sync <file path> to upload the log.

SomnusQue · 2024-01-22T03:04:18Z

@SomnusQue
No code change required. It is also available to set the environmental variable WANDB_MODE=offline. The wandb log will be saved as a file. Then run wandb sync <file path> to upload the log.

sry to bother u again...

The result in tensorboard seems like sth went wrong...

This is the final epoch of my training result..

SomnusQue · 2024-01-22T08:05:44Z

This is our bash file, is there sth wrong...?

wkcn · 2024-01-22T12:19:50Z

Sorry that I did not test TensorBoard yet.

The training data in the provided script is synthetic.
They should be replaced with the following command:

 --train-data <your yfcc_path or laion_path/> \
 --dataset-type webdataset \

SomnusQue · 2024-01-22T12:36:49Z

Sorry that I did not test TensorBoard yet.

The training data in the provided script is synthetic. They should be replaced with the following command:
 --train-data <your yfcc_path or laion_path/> \
 --dataset-type webdataset \

I downloaded laion file, and put it in the path '/.cache/clip/'.
Is this the path I need to write?

wkcn · 2024-01-23T01:53:23Z

@SomnusQue
Please refer to the document https://github.com/mlfoundations/open_clip?tab=readme-ov-file#data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError! #219

RuntimeError! #219

SomnusQue commented Jan 21, 2024 •

edited

SomnusQue commented Jan 21, 2024

wkcn commented Jan 21, 2024

SomnusQue commented Jan 21, 2024

SomnusQue commented Jan 21, 2024

wkcn commented Jan 21, 2024 •

edited

SomnusQue commented Jan 21, 2024

wkcn commented Jan 21, 2024

SomnusQue commented Jan 22, 2024

SomnusQue commented Jan 22, 2024

wkcn commented Jan 22, 2024

SomnusQue commented Jan 22, 2024

wkcn commented Jan 23, 2024

RuntimeError! #219

RuntimeError! #219

Comments

SomnusQue commented Jan 21, 2024 • edited

SomnusQue commented Jan 21, 2024

wkcn commented Jan 21, 2024

SomnusQue commented Jan 21, 2024

SomnusQue commented Jan 21, 2024

wkcn commented Jan 21, 2024 • edited

SomnusQue commented Jan 21, 2024

wkcn commented Jan 21, 2024

SomnusQue commented Jan 22, 2024

SomnusQue commented Jan 22, 2024

wkcn commented Jan 22, 2024

SomnusQue commented Jan 22, 2024

wkcn commented Jan 23, 2024

SomnusQue commented Jan 21, 2024 •

edited

wkcn commented Jan 21, 2024 •

edited