GitHub - codeboy5/transformers_grokking

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
results		results
.gitignore		.gitignore
Grokking.ipynb		Grokking.ipynb
README.md		README.md
gen_data.py		gen_data.py
model.py		model.py

Repository files navigation

Pytorch (RE)-Implementation of Grokking Phenomenon

This is a pytorch re-implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets.

I thought this would be a good paper to reproduce since this would allow me to code and train a GPT style model from scratch.

References used for the Code :-

MinGPT by Karpathy

Accuracy Loss Curves for Adam (with any weight decay)

Accuracy Loss Curves for AdamW ( λ = 1 )

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages