Hallucinations are more or less a solved problem for me ever since I made a simp...

emp17344 · 2026-01-16T00:47:05 1768524425

But there aren’t very many domains where this type of verification is even possible.

nextaccountic · 2026-01-16T02:09:21 1768529361

Then you apply LLMs in domains where things can be checked

Indeed I expect to see a huge push into formally verified software just because sound mathematical proofs provide an excellent verifier to put into a LLM hardness. Just see how Aristotle has been successful at math, and it could be applied to coding too

Maybe Lean will become the new Python

https://harmonic.fun/news#blog-post-verina-bench-sota

filoeleven · 2026-01-16T03:55:59 1768535759

  "LLMs reliably fail at abstraction."
  "This limitation will go away soon."
  "Hallucinations haven't."
  "I found a workaround for that."
  "That doesn't work for most things."
  "Then don't use LLMs for most things."

baq · 2026-01-16T06:02:32 1768543352

Um, yes? Except ‘most things’ are not much at all by volume.