It's not ironic. PDFs are a container, which can hold scanned documents as well as text. Scanned documents need OCR and to be analyzed for their layout. This is not a failing of the PDF format, but a problem inherent to working with print scans.
I don't claim PDF is a good format. It is inscrutable to me.
I don't claim PDF is a good format. It is inscrutable to me.