kopia lustrzana https://github.com/jaymody/picoGPT
Update README.
rodzic
56e8d60c26
commit
817292baea
|
@ -1,4 +1,7 @@
|
||||||
# PicoGPT
|
# PicoGPT
|
||||||
|
Accompanying blog post: [GPT in 60 Lines of Numpy](https://jaykmody.com/blog/gpt-from-scratch/)
|
||||||
|
|
||||||
|
---
|
||||||
|
|
||||||
You've seen [openai/gpt-2](https://github.com/openai/gpt-2).
|
You've seen [openai/gpt-2](https://github.com/openai/gpt-2).
|
||||||
|
|
||||||
|
@ -8,7 +11,7 @@ You've even seen [karpathy/nanoGPT](https://github.com/karpathy/nanogpt)!
|
||||||
|
|
||||||
But have you seen [picoGPT](https://github.com/jaymody/picoGPT)??!?
|
But have you seen [picoGPT](https://github.com/jaymody/picoGPT)??!?
|
||||||
|
|
||||||
`picoGPT` is an unnecessarily tiny and minimal implementation of [GPT-2](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) in plain [NumPy](https://numpy.org). The entire forward pass code is [40 lines of code](https://github.com/jaymody/picoGPT/blob/main/gpt2_pico.py#L3-L41). I wrote a related [blog post](https://jaykmody.com/blog/gpt-from-scratch/) for picoGPT.
|
`picoGPT` is an unnecessarily tiny and minimal implementation of [GPT-2](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) in plain [NumPy](https://numpy.org). The entire forward pass code is [40 lines of code](https://github.com/jaymody/picoGPT/blob/main/gpt2_pico.py#L3-L41).
|
||||||
|
|
||||||
picoGPT features:
|
picoGPT features:
|
||||||
* Fast? ❌ Nah, picoGPT is megaSLOW 🐌
|
* Fast? ❌ Nah, picoGPT is megaSLOW 🐌
|
||||||
|
|
Ładowanie…
Reference in New Issue