Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

fp16 is overkill though. 8-bit is the sweet spot before perf degradation starts getting noticeable.


I haven't yet seen any evals comparing the original Qwen3-30B-A22B with https://ollama.com/library/qwen3:30b-a3b-q8_0




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: