Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

this is not ideal. If someone wants to fact check controversial claim filtering it only makes it worse.


Controversial != malicious. It sounds like they never intended for the filter to trigger for the former and they already said they'll look into why it did for the prompt.

Filtering malicious (not controversial) usage is ideal as allowing users to flood all of the AI services with jailbreak/against-ToS query attempts can be bad news for your API keys (as well as a likely waste of money given the failure rate of such queries).


This is just a dude's hobby project, chill.


The issue is not about this project. This is a fundamental problem with LLMs. Leaving the decision of what is malicious to LLMs is not ideal.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: