Hacker Newsnew | past | comments | ask | show | jobs | submit | monadicmonad's submissionslogin
1.Experimenting with policy gradient methods in Jax (github.com/elliotvilhelm)
2 points by monadicmonad 9 months ago | past
2.Policy Evaluation in Grid World (github.com/elliotvilhelm)
1 point by monadicmonad on Nov 18, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: