Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wouldn't trust LMArena results much. They measure user preference and users are highly skewed by style, tone etc.

You can litteraly "improve" your model on LMArena by just adding a bunch of emojis.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: