Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The intention for this playground was to let people try the model. We actually have auto moderation on the user facing platform (https://play.ht/) and malicious text gets blocked and the user get flagged.


Except this post is 8 hours old and I'm still able to view this link.


17 hours old, still there.


Another 8 hours and I can still see it too.


How would your auto moderation detect that example is malicious?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: