diff --git a/README.md b/README.md index db77277..bc3f88d 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,7 @@ # PicoGPT +Accompanying blog post: [GPT in 60 Lines of Numpy](https://jaykmody.com/blog/gpt-from-scratch/) + +--- You've seen [openai/gpt-2](https://github.com/openai/gpt-2). @@ -8,7 +11,7 @@ You've even seen [karpathy/nanoGPT](https://github.com/karpathy/nanogpt)! But have you seen [picoGPT](https://github.com/jaymody/picoGPT)??!? -`picoGPT` is an unnecessarily tiny and minimal implementation of [GPT-2](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) in plain [NumPy](https://numpy.org). The entire forward pass code is [40 lines of code](https://github.com/jaymody/picoGPT/blob/main/gpt2_pico.py#L3-L41). I wrote a related [blog post](https://jaykmody.com/blog/gpt-from-scratch/) for picoGPT. +`picoGPT` is an unnecessarily tiny and minimal implementation of [GPT-2](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) in plain [NumPy](https://numpy.org). The entire forward pass code is [40 lines of code](https://github.com/jaymody/picoGPT/blob/main/gpt2_pico.py#L3-L41). picoGPT features: * Fast? ❌ Nah, picoGPT is megaSLOW 🐌