Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Am I reading this wrong, or does this only support FP16 inputs, and compares its performance against an FP32 solver?


They compare HGEMM implementations. At least CUBLAS has HGEMM functions.

HGEMM means half-precision (i.e. FP16) general matrix multiplication




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: