SA Bugzilla – Bug 7960
PDFInfo misses valid metadata
Last modified: 2022-03-03 08:41:29 UTC
As reported on the Users' mailing list by Ricky Boone on 2022-03-02, the PDFInfo plugin fails to parse out much of the metadata from a sizable fraction of today's PDFs. I've fixed this in r1898546 by removing the optimization (no longer valid) of skipping lines in the PDF with high-bit-set characters.
Committed some more cleanups. The PDF "parsing" is unbelievably naive, so I did the same for UTF-16 decoding.. I guess it does the job for now ¯\_(ツ)_/¯ Sending trunk/lib/Mail/SpamAssassin/Plugin/PDFInfo.pm Transmitting file data .done Committing transaction... Committed revision 1898557.