Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's actually not the largest. https://huggingface.co/google/switch-c-2048 is 1.6T parameters.


but is switch c even usable? iirc the training set was nowhere near enough for a model of that size to be coherent in a conversation




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: