Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

FYI: Codeforces competitive programming scores (basically only) by time needed until valid solutions are posted

https://codeforces.com/blog/entry/133094

That means.. this benchmark is just saying o3 can write code faster than must humans (in a very time-limited contest, like 2 hours for 6 tasks). Beauty, readability or creativity is not rated. It’s essentially a "how fast can you make the unit tests pass" kind of competition.



Creativity is inherently rated because it's codeforces... most 2700 problems have unique, creative solutions.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: