Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The ones I had in mind were the newer 'Chinchilla' scaling laws (https://arxiv.org/pdf/2203.15556.pdf) vs. the older 'Kaplan' scaling laws (https://arxiv.org/pdf/2001.08361.pdf)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: