I gave it a photo of Tianamen Square in the present day and promoted it with: > ...

martin-t · on Dec 25, 2024

How does this even work?

Is hugging face hosting just the weights or some custom code?

If it's just weights then I don't see how it could error out, it's just math. Do these chinese models have extra code checking the output for anti-totalitarian content? Can it be turned off?

nextworddev · on Dec 25, 2024

That shows that the censorship mechanism happens at the token sampling level and not post generation, at the least

int_19h · on Dec 25, 2024

On the contrary, it shows that the censorship mechanism is post-generation and stops it once it deems the output accumulated so far "improper". It just runs after every token.