Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
AI Agent Reliability Tracker (princeton.edu)
1 point by smartmic 52 days ago | hide | past | favorite | 1 comment


> recent capability gains have yielded only small improvements in reliability.

Have I missed something? Why would one expect capability gain to make any such improvement?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: