Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

So I'm thinking of something like Locked-room mystery where the idea is it's solvable, and the reader is given a chance to solve.

The reason it seems like an interesting bench, is it's a puzzle presented in a long context. Its like testing if an LLm is at Sherlock Holmes level of world and motivation modelling.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: