Is the training procedure result normal? Masked regions do not improve and appear to be random noise. #190

junzhin · 2024-02-08T14:45:30Z

Hello,

thank you for the great work and the great repo.

I attempted to reproduce the pre-training of MAE-ViT-Large, and performed 68 epochs on a chest x -rays dataset with about 300 k medical images, and the loss stopped improving when it reached around 0.0045 loss without pixlossnorm. Additionally, the reconstruction results fail to predict the masked regions correctly.

Could you suggest a reason for that? Any idea why this is the case?

CristoJV · 2024-03-25T10:16:44Z

Hi @junzhin,

I'm facing the same errors.
Pretrained on 500k faces for 20 epochs, resuming from the authors' pretrained version MAE-ViT-Base.

Did you manage to solve them?

EDIT: I fixed it by disabling the pixlossnorm.

Thank you!

ats4869 · 2024-05-18T09:33:14Z

I would like to ask how you use finetune to train the reconstruction model on your own data set. I see that through main_finetune.py only models for classification tasks can be generated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the training procedure result normal? Masked regions do not improve and appear to be random noise. #190

Is the training procedure result normal? Masked regions do not improve and appear to be random noise. #190

junzhin commented Feb 8, 2024 •

edited

Loading

CristoJV commented Mar 25, 2024 •

edited

Loading

ats4869 commented May 18, 2024

Is the training procedure result normal? Masked regions do not improve and appear to be random noise. #190

Is the training procedure result normal? Masked regions do not improve and appear to be random noise. #190

Comments

junzhin commented Feb 8, 2024 • edited Loading

CristoJV commented Mar 25, 2024 • edited Loading

ats4869 commented May 18, 2024

junzhin commented Feb 8, 2024 •

edited

Loading

CristoJV commented Mar 25, 2024 •

edited

Loading