Fixes a bug that caused the binary reader not to fail cleanly when parsing incomplete containers in certain cases. #710

tgregg · 2024-01-27T02:52:15Z

Issue #, if available:
FasterXML/jackson-dataformats-binary#473

Description of changes:
Before this fix, the added test expectIncompleteContainerToFailCleanlyAfterFieldSid would fail with ArrayIndexOutOfBoundsException. The other two tests succeeded before and after the fix; I just wanted to be sure these cases were covered.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

codecov · 2024-01-27T03:00:59Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (3c1b6b1) 67.23% compared to head (8b5c43e) 67.25%.
Report is 5 commits behind head on master.

❗ Current head 8b5c43e differs from pull request most recent head 08752c2. Consider uploading reports for the commit 08752c2 to get more accurate results

Additional details and impacted files

@@             Coverage Diff              @@
##             master     #710      +/-   ##
============================================
+ Coverage     67.23%   67.25%   +0.01%     
- Complexity     5484     5487       +3     
============================================
  Files           159      159              
  Lines         23025    23027       +2     
  Branches       4126     4127       +1     
============================================
+ Hits          15481    15486       +5     
+ Misses         6262     6259       -3     
  Partials       1282     1282

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

zslayton · 2024-01-29T21:26:08Z

src/main/java/com/amazon/ion/impl/IonCursorBinary.java

@@ -1568,6 +1568,9 @@ private boolean uncheckedNextToken() {
            if (uncheckedNextContainedToken()) {
                return false;
            }
+            if (peekIndex >= limit) {
+                throw new IonException("Malformed data: declared length exceeds the number of bytes remaining in the stream.");


Would

...exceeds the number of bytes remaining in the container.

be more accurate?

It took me a while to understand why you made this suggestion. It's because we're in an unchecked method, which means that we already believe we have enough data to finish the container. Therefore if peekIndex >= limit we're definitely past the container bounds because implicitly parent.endIndex < limit?

Another thing strikes me about this though- we check the same condition peekIndex >= limit on line 1551, and from my read the only thing that can change that condition in between there and here is this clause of uncheckedNextContainedToken():

} else if (parent.typeId.type == IonType.STRUCT) { if (minorVersion == 0) { byte b = buffer[(int) peekIndex++]; if (b < 0) { fieldSid = (b & LOWER_SEVEN_BITS_BITMASK); } else { fieldSid = (int) uncheckedReadVarUInt_1_0(b); } } else { uncheckedReadFieldName_1_1(); } }

We must be in minorVersion == 0, and I note that uncheckedReadVarUInt_1_0(b) already contains a malformed data check. Does that mean that this error condition is discovered only when we've just read a 1-byte VarUInt field name, in the if (b < 0) { case above? In that case, should this error check go there, in uncheckedNextContainedToken()?

This code appears in the else branch which handles values below the top level:

if (parent == null) { // Depth 0 // ... } else { // the new code }

I wasn't certain whether limit was the end of the stream's data or the end of the container's data in the buffer. Based on the location of the code, the latter seemed plausible.

Another thing strikes me about this though- we check the same condition peekIndex >= limit on line 1551, and from my read the only thing that can change that condition in between there and here is this clause of uncheckedNextContainedToken():

I wondered if reset() might affect peekIndex or limit, but didn't investigate further.

Would

...exceeds the number of bytes remaining in the container.

be more accurate?

Yes, this is more accurate.

We must be in minorVersion == 0, and I note that uncheckedReadVarUInt_1_0(b) already contains a malformed data check. Does that mean that this error condition is discovered only when we've just read a 1-byte VarUInt field name, in the if (b < 0) { case above? In that case, should this error check go there, in uncheckedNextContainedToken()?

I preferred to put this check in the proposed location, rather than in uncheckedNextContainedToken(), because the check protects the line that immediately follows (the access to the buffer at peekIndex). Note: uncheckedReadVarUInt_1_0 performs the check before each byte it consumes, but the added check protects access to the first byte after the field name.

…rsing incomplete containers in certain cases.

tgregg mentioned this pull request Jan 27, 2024

Update IonFuzz_473_66131_AIOOBE_Test to provoke an ArrayIndexOutOfBoundsException FasterXML/jackson-dataformats-binary#477

Merged

zslayton approved these changes Jan 29, 2024

View reviewed changes

tgregg force-pushed the fix-aioobe branch 2 times, most recently from 08af8cb to 154663e Compare February 2, 2024 01:24

Fixes a bug that caused the binary reader not to fail cleanly when pa…

08752c2

…rsing incomplete containers in certain cases.

tgregg force-pushed the fix-aioobe branch from 154663e to 08752c2 Compare February 2, 2024 01:30

tgregg merged commit d0a4a4a into master Feb 2, 2024
23 of 32 checks passed

tgregg deleted the fix-aioobe branch February 2, 2024 01:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes a bug that caused the binary reader not to fail cleanly when parsing incomplete containers in certain cases. #710

Fixes a bug that caused the binary reader not to fail cleanly when parsing incomplete containers in certain cases. #710

tgregg commented Jan 27, 2024

codecov bot commented Jan 27, 2024 •

edited

Loading

zslayton Jan 29, 2024

jobarr-amzn Jan 30, 2024

zslayton Jan 30, 2024 •

edited

Loading

tgregg Feb 2, 2024

Fixes a bug that caused the binary reader not to fail cleanly when parsing incomplete containers in certain cases. #710

Fixes a bug that caused the binary reader not to fail cleanly when parsing incomplete containers in certain cases. #710

Conversation

tgregg commented Jan 27, 2024

codecov bot commented Jan 27, 2024 • edited Loading

Codecov Report

zslayton Jan 29, 2024

Choose a reason for hiding this comment

jobarr-amzn Jan 30, 2024

Choose a reason for hiding this comment

zslayton Jan 30, 2024 • edited Loading

Choose a reason for hiding this comment

tgregg Feb 2, 2024

Choose a reason for hiding this comment

codecov bot commented Jan 27, 2024 •

edited

Loading

zslayton Jan 30, 2024 •

edited

Loading