Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Attention is all you need and GPT-2 were well known at that point. Many might doubt whether this approach leads to "general" intelligence – depends on the definition.

BTW, Karpathy has a nice video tutorial about building an LLM: https://www.youtube.com/watch?v=kCc8FmEb1nY



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: