Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

.


And is also not using an LLM. It's neural machine translation.


NMT is a category containing both transformers and deep RNN. The Mozilla translation models are transformer LLM NMTs trained via Marian https://marian-nmt.github.io/ (ref: https://github.com/mozilla/translations/blob/main/docs/READM...)


NMT doesn't "contain" tranformers and deep RNNs, it can use them. LLMs use a transformer architecture, not everything using a transformer architecture is an LLM. NMT can actually use an LLM, but that's not the case according to the documentation you linked, they use a parallel dataset to train their models.


> they use a parallel dataset to train their models

If you want to be pedantic you should look up the LLM definition.


Care to explain why?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: