Pretrained embedders #41

yangsenwxy · 2022-06-01T03:29:33Z

I have a question, your simlr is pre-training, does it include all the data of camelyon16 (training set and test set)? Because I found that your feature extractor is faulty, you leaked the information of the test set, I tried, only pre-trained on the training set, there is no such high result, I think you should check this problem carefully, resulting in your result is too high

binli123 · 2022-06-02T18:51:41Z

https://drive.google.com/drive/folders/1_mumfTU3GJRtjfcJK_M0fWm048sYYFqi
There are several model weights trained using only the training data. I also tested using both the training set and the testing set for SimCLR, the difference in the results is minor. What is the batch size you used? Please make sure the batch size is at least 512 and train for enough iterations in order to get an actual useful embedder from SimCLR, as pointed out in their paper. Bigger batch size and longer training time lead to better embedder and they have quite a big impact on the performance of the downstream task. The best embedder we obtained was trained for 2 months because of the large number of patches.

Plus, we are not the only ones who had luck with self-supervised learning on Camelyon16, https://arxiv.org/pdf/2012.03583.pdf where they showed that very high results can be obtained.

yangsenwxy · 2022-06-03T04:28:05Z

Thank you very much, I found that the features you extracted are only 0.86 if you train directly with the CLAM method.

raycaohmu · 2022-09-05T10:49:36Z

Hi, are those weights trained using tcga data?

GeorgeBatch · 2023-02-18T21:02:04Z

@raycaohmu

Camelyon16 weights: https://drive.google.com/drive/folders/1_mumfTU3GJRtjfcJK_M0fWm048sYYFqi

see folder names for magnifications

TCGA-lung weights: https://drive.google.com/drive/folders/1Rn_VpgM82VEfnjiVjDbObbBFHvs0V1OE

magnification: low=2.5x, high=10x
pre-taining: v0 for 3 days, v1 for 2 weeks (better results)

PiumiDS · 2023-02-25T10:35:30Z

Camelyon16 weights: https://drive.google.com/drive/folders/1_mumfTU3GJRtjfcJK_M0fWm048sYYFqi

see folder names for magnifications

TCGA-lung weights: https://drive.google.com/drive/folders/1Rn_VpgM82VEfnjiVjDbObbBFHvs0V1OE

magnification: low=2.5x, high=10x

pre-taining: v0 for 3 days, v1 for 2 weeks (better results)

Hi @GeorgeBatch,

I have seen the previous discussion on the magnification change for TCGA-lung patches. Could I please verify that when the above pre-trained model is specified as,

magnification: low=2.5x, high=10x

this is only for 20x patches of the whole dataset? (so the pre-trained model is trained on 20x,5x (for 40x images) and 10x,2.5x (for 20x images))

Many thanks in advance.
Piumi.

GeorgeBatch · 2023-02-25T12:00:25Z

Hi @PiumiDS,

this is only for 20x patches of the whole dataset? (so the pre-trained model is trained on 20x,5x (for 40x images) and 10x,2.5x (for 20x images))

I am afraid I do not know the answer to your question myself. So here we will both need to wait for @binli123's answer

George

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pretrained embedders #41

Pretrained embedders #41

yangsenwxy commented Jun 1, 2022

binli123 commented Jun 2, 2022

yangsenwxy commented Jun 3, 2022

raycaohmu commented Sep 5, 2022

GeorgeBatch commented Feb 18, 2023 •

edited

PiumiDS commented Feb 25, 2023 •

edited

GeorgeBatch commented Feb 25, 2023 •

edited

Pretrained embedders #41

Pretrained embedders #41

Comments

yangsenwxy commented Jun 1, 2022

binli123 commented Jun 2, 2022

yangsenwxy commented Jun 3, 2022

raycaohmu commented Sep 5, 2022

GeorgeBatch commented Feb 18, 2023 • edited

PiumiDS commented Feb 25, 2023 • edited

GeorgeBatch commented Feb 25, 2023 • edited

GeorgeBatch commented Feb 18, 2023 •

edited

PiumiDS commented Feb 25, 2023 •

edited

GeorgeBatch commented Feb 25, 2023 •

edited