Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Improve itself through experimentation with reinforcement learning. This is how humans improve too. AlphaZero does it.


The amount of work in that area of research is substantial. You will see world shattering results in a few years.

Current SOTA: https://openai.com/blog/vpt/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: