Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

But then, if an agent picks the best response, how would you know that that is reliable?


Obviously you have multiple agents justify why they picked a certain response and then create another agent that picks the solution with the best justification.


touché


You could get the agents to output something structured and then use a deterministic test if you're worried about that.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: