Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

“Objective benchmarks are useless, let’s argue about which one works better for me personally.”


Yes. My benchmarks and their benchmarks means AGI. Their benchmarks only means over-fitted.


Ok so what if we get different results for our own personal benchmarks/use cases.

(See why objective benchmarks exist?)


Yes, "objective" benchmarks can be gamed, real-life tasks cannot.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: