Parses a PDF document and returns its structured representation.
Produces the following sections:
%PDF-X.Y
obj … endobj
xref
startxref
%%EOF
Object scanning is capped at 200 entries and tree nodes at 100 to avoid O(n²) performance on large files.
Raw PDF document bytes.
A ParsedStructure with format set to 'PDF' and entryPoint set to undefined.
format
'PDF'
entryPoint
undefined
Parses a PDF document and returns its structured representation.
Produces the following sections:
%PDF-X.Yversion line and optional binary-hint comment.obj … endobjblocks (100 added as tree nodes).xrefsection, if present.startxrefkeyword through%%EOF.Object scanning is capped at 200 entries and tree nodes at 100 to avoid O(n²) performance on large files.