forked from karpathy/llm.c
-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NaN #4
Comments
@anthonix bump. |
Will try and reproduce -- on the list of things to do when I have some spare cycles |
Oh, I'm blind (and probably dumb). val_loss must be 0, hence the nan. so it must be something in that gpt2_validate() returning all zeros. |
In the mean time, can you verify some other training works, like AMD's tinyllama code they recently released? Or their JAX GPT2 training? |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Just doing a bit of debugging.
"val loss" output nan, so I figured start there...
But digging higher up, val_num_batches is set to 20, so I'm not sure how this is turning into nan so easily. Feels like something else is up...
The text was updated successfully, but these errors were encountered: