Use this framework to avoid expensive remediation work on documents that should be HTML in the first place.
If the content is living, frequently updated, and primarily consumed online, default to HTML. If the format is legally fixed or archival, remediate the PDF with strict QA controls.