Getting NaN values when trying to finetune

by imerad7 - opened Apr 21

Apr 21

Hello,
I'm trying to finetune this pretrained model on a classification task by adding a classification head. However, the training quickly starts giving NaN values after a few iterations. Do you know how this could be fixed ?
Thanks

imerad7

Apr 23

I found the issue. The classification head parameters were not properly initialized and had absurd values from the start which threw the training off. Manually initializing them solved it.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment