Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think there's a lot of difference between sounding like someone and being someone. The models are excellent at pretending indeed.
 help



I don't think that sama was arguing that ChatGPT actually passed a PhD thesis defense. But arguably, it could make for an interesting benchmark.

Please do not get swayed by nor defend the words vomited by a snake oil salesman.

Also what benchmark? How will you you design it?


exactly. this is what whole RL thing is optimizing for, even if that is not the intent.



Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: