Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Wow. Close to a Qwen3 distill with 75% the size. That's great!

I've been using the smollm base models for my own finetunes just because they're so high quality, it looks like I might be using them to drive local agents/code completion in the near future too.

Their RL algorithm looks interesting. I'm still using OpenAI's algorithm for my stuff, I've been meaning to check on the SoTA since I know my code is pretty outdated (It's crazy how fast that happens with this stuff.)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: