PdfReader.Open fails with error "Token '60' was not expected #211

mikethea1 · 2024-11-19T14:10:26Z

Reporting an Issue Here

Expected Behavior

The PDF opens without error, or with an exception that indicates that this is something PdfSharp does not support.

The PDF

Actual Behavior

PdfSharp.Pdf.IO.PdfReaderException: Token '60' was not expected.
   at PdfSharp.Internal.ParserDiagnostics.ThrowParserException(String message) in D:\THHO\Repos\PDFsharp\src\foundation\src\PDFsharp\src\PdfSharp\Internal\Diagnostics.cs:line 61
   at PdfSharp.Pdf.IO.Parser.ReadObjectInternal(PdfObject pdfObject, PdfObjectID objectID, Boolean includeReferences, Boolean fromObjectStream, SuppressExceptions suppressObjectOrderExceptions) in D:\THHO\Repos\PDFsharp\src\foundation\src\PDFsharp\src\PdfSharp\Pdf.IO\Parser.cs:line 337
   at PdfSharp.Pdf.IO.Parser.ReadIndirectObject(PdfReference pdfReference, SuppressExceptions suppressObjectOrderExceptions, Boolean withoutDecrypting) in D:\THHO\Repos\PDFsharp\src\foundation\src\PDFsharp\src\PdfSharp\Pdf.IO\Parser.cs:line 932
   at PdfSharp.Pdf.IO.Parser.ReadAllIndirectObjects() in D:\THHO\Repos\PDFsharp\src\foundation\src\PDFsharp\src\PdfSharp\Pdf.IO\Parser.cs:line 1020
   at PdfSharp.Pdf.IO.PdfReader.OpenFromStream(Stream stream, String password, PdfDocumentOpenMode openMode, PdfPasswordProvider passwordProvider, PdfReaderOptions options) in D:\THHO\Repos\PDFsharp\src\foundation\src\PDFsharp\src\PdfSharp\Pdf.IO\PdfReader.cs:line 379
   at PdfSharp.Pdf.IO.PdfReader.OpenFromFile(String path, String password, PdfDocumentOpenMode openMode, PdfPasswordProvider passwordProvider) in D:\THHO\Repos\PDFsharp\src\foundation\src\PDFsharp\src\PdfSharp\Pdf.IO\PdfReader.cs:line 251
   at PdfSharp.Pdf.IO.PdfReader.Open(String path, String password, PdfDocumentOpenMode openMode, PdfPasswordProvider passwordProvider, PdfReaderOptions options) in D:\THHO\Repos\PDFsharp\src\foundation\src\PDFsharp\src\PdfSharp\Pdf.IO\PdfReader.cs:line 189
   at PdfSharp.Pdf.IO.PdfReader.Open(String path, PdfDocumentOpenMode openMode, PdfReaderOptions options) in D:\THHO\Repos\PDFsharp\src\foundation\src\PDFsharp\src\PdfSharp\Pdf.IO\PdfReader.cs:line 166

Steps to Reproduce the Behavior

If there's interest in looking into this I can share the file privately via the mechanism described here: https://github.com/empira/PDFsharp.IssueSubmissionTemplate

Based on #207 it isn't clear to me what is considered a potential bug vs something PdfSharp deliberately doesn't support, so apologies in advance if this behavior is expected. Hopefully this can be a quick close-won't fix in that case and the issue can serve as documentation for others who encounter this.

The text was updated successfully, but these errors were encountered:

TH-Soft · 2024-11-19T14:17:11Z

The file is corrupted and this is not a bug in PDFsharp. In issue 207, the file is also corrupted, but PDFsharp has been updated to correct the wrong information in the PDF and read the file nevertheless.

Without PDF file, we cannot investigate what's going on. Depending on what is wrong with the PDF, there may be a way to modify PDFsharp to read it anyway.

mikethea1 · 2024-11-19T16:30:10Z

Thanks @TH-Soft I hadn't realized that #207 resulted in an update!

I've emailed the PDF behind this issue as well as another error I encountered to the email address mentioned on https://github.com/empira/PDFsharp.IssueSubmissionTemplate. Hopefully that helps.

For my understanding, what is the best way to engage on these parsing issues in a helpful way?

I completely understand the perspective that not being able to handle a corrupted file doesn't indicate a library bug, but given the absolute lawlessness of PDFs encountered out in the wild having libraries that are similarly robust to the PDF viewers customers are used to (e.g. in Chrome) is certainly handy.

I'm happy to send over weird files I encounter to help the library improve, but at the same time I don't want to bother the maintainers with yet more instances of errors you've seen before and perhaps decided explicitly not to accommodate.

TH-Soft added the Cannot Reproduce https://xkcd.com/583/ label Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PdfReader.Open fails with error "Token '60' was not expected #211

PdfReader.Open fails with error "Token '60' was not expected #211

mikethea1 commented Nov 19, 2024

TH-Soft commented Nov 19, 2024

mikethea1 commented Nov 19, 2024

PdfReader.Open fails with error "Token '60' was not expected #211

PdfReader.Open fails with error "Token '60' was not expected #211

Comments

mikethea1 commented Nov 19, 2024

Reporting an Issue Here

Expected Behavior

Actual Behavior

Steps to Reproduce the Behavior

TH-Soft commented Nov 19, 2024

mikethea1 commented Nov 19, 2024