main
jaymody 2023-04-24 16:05:45 -04:00
rodzic 56e8d60c26
commit 817292baea
1 zmienionych plików z 4 dodań i 1 usunięć

Wyświetl plik

@ -1,4 +1,7 @@
# PicoGPT
Accompanying blog post: [GPT in 60 Lines of Numpy](https://jaykmody.com/blog/gpt-from-scratch/)
---
You've seen [openai/gpt-2](https://github.com/openai/gpt-2).
@ -8,7 +11,7 @@ You've even seen [karpathy/nanoGPT](https://github.com/karpathy/nanogpt)!
But have you seen [picoGPT](https://github.com/jaymody/picoGPT)??!?
`picoGPT` is an unnecessarily tiny and minimal implementation of [GPT-2](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) in plain [NumPy](https://numpy.org). The entire forward pass code is [40 lines of code](https://github.com/jaymody/picoGPT/blob/main/gpt2_pico.py#L3-L41). I wrote a related [blog post](https://jaykmody.com/blog/gpt-from-scratch/) for picoGPT.
`picoGPT` is an unnecessarily tiny and minimal implementation of [GPT-2](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) in plain [NumPy](https://numpy.org). The entire forward pass code is [40 lines of code](https://github.com/jaymody/picoGPT/blob/main/gpt2_pico.py#L3-L41).
picoGPT features:
* Fast? ❌ Nah, picoGPT is megaSLOW 🐌