Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's the final, fine-tuned model. The base model (pretraining only, no instruction SFT, RLHF, RLVR etc) is this one: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp-Base It's apparently not offered at any inference provider, nor are older DeepSeek base models.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: