Hacker Newsnew | past | comments | ask | show | jobs | submit | bazlightyear's commentslogin

Current frontier models are extraordinarily good at performing the surface appearance of competent software engineering.


BTW it looks like Kimi won the subsequent challenge too https://aicc.rayonnant.ai/challenges/hexquerques/


Are you tests and results open source?


Test result summaries are openly available, test environments are not.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: