Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

To be precise, ChatGPT 3.5 turbo being 20B is officially a mistake from a Microsoft Researcher, quoting a wrong source published before the release of chatgpt3.5 turbo. Up to you to believe it or not. But I wouldn’t claim it’s a 20B according to Microsoft Researchers.

The withdrawn paper: https://arxiv.org/abs/2310.17680

The wrong source: https://www.forbes.com/sites/forbestechcouncil/2023/02/17/is...

The discussion: https://www.reddit.com/r/LocalLLaMA/comments/17jrj82/new_mic...



It's interesting how the paper was completely retracted instead of just being corrected.


Yep. It feels like a 20B parameter model.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: