Skip to content
/ gpt2 Public

Reproduction of GPT2 124M parameter model. Trained with FineWeb 10B.

Notifications You must be signed in to change notification settings

nusret35/gpt2

About

Reproduction of GPT2 124M parameter model. Trained with FineWeb 10B.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages