There's nothing new here in terms of architecture. Whatever secret sauce is in t...

BoorishBears · 2025-08-05T21:24:45 1754429085

Part of the secret sauce since O1 has been accesss the real reasoning traces, not the summaries.

If you even glance at the model card you'll see this was trained on the same CoT RL pipeline as O3, and it shows in using the model: this is the most coherent and structured CoT of any open model so far.

Having full access to a model trained on that pipeline is valuable to anyone doing post-training, even if it's just to observe, but especially if you use it as cold start data for your own training.

anticensor · 2025-08-06T05:18:13 1754457493

Its CoT is sadly closer to that sanitised o3 summaries than to R1 style traces.

BoorishBears · 2025-08-06T09:33:03 1754472783

It has both raw and summarized traces.

anticensor · 2025-08-06T12:20:18 1754482818

I mean raw GPT-OSS is close to summarised o3.