Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs) (furiosa.ai)
9 points by olibaw 84 days ago | hide | past | favorite


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: