Wow really? The agentic coding work that has come out in the last year are super impressive to me.
And before it didn’t seem to understand the fundamentals of Torch well, not well enough to do novel work. Now with Codex in high it absolutely does, and MLE bench reflects that
And before it didn’t seem to understand the fundamentals of Torch well, not well enough to do novel work. Now with Codex in high it absolutely does, and MLE bench reflects that