http: Skip req and beresp trailers #4125

dridi · 2024-06-18T10:55:26Z

This is the first step of the following plan:

process but drop incoming trailers to preserve HTTP framing
forward verbatim trailers for pass transactions
- guarded by an experimental::pass_trailers flag
manipulate trailers in VCL

The first step, this pull request, should allow applications producing trailers to work with Varnish when the trailers are not strictly required.

The second step should allow protocols like gRPC to pass through Varnish, with the caveats that body filters could break the meaning of trailers (for example checksums), hence the experimental flag.

It will require internal changes, and from the look of it we may not be able to directly use the existing data structures and we may need to a dedicated OA_TRAILERS object attribute.

The last step will likely require changes to the VCL state machines in addition to new symbols.

Most of the work on trailers was done by @walid-git under my supervision.

This pull request includes work from other pull requests too:

4 commits of mine taken from std.cache_req_body(BYTES size, BOOL partial = 0) #3798, turning BS_CACHED into a request flag
4 commits from @nigoroll taken from Send VFP_END with last data for v1f chunked & read ahead #3809 to better delimit HTTP/1 chunks
- I disregarded @mbgrydeland's concern from V1F: Read end-of-chunk as part of the chunk #3811 that can be implemented later
2 commits of mine generalizing pipelining in a workspace
the rest is trailer support from @walid-git
- new BS_TRAILERS body status for coordination
- new gettrls() callback for directors (missing docs update)
- process incoming trailers in HTTP/1 code
- move trailers outside of the HTTP/1 VFP code, they are not part of the body (as suggested in Send VFP_END with last data for v1f chunked & read ahead #3809 (comment))
- process beresp trailers
- process req trailers
- process incoming trailers in h2 code

This is a lot of commits, but they should all have a reasonable size for reviewers. I already reviewed Walid's work, and already addressed my own review items while Walid is away. I added Walid or myself as a co-author when I made significant changes (at the scale of individual commits).

dridi · 2024-06-18T12:46:11Z

All failing platforms appear to overflow the workspace used in the h2 test case, except the sanitizers job that fails on something else.

I will have a closer look later.

dridi · 2024-06-24T18:55:39Z

As it turned out, the failing tests were exposing a race revolving around the request body status. Transitioning to BS_TAKEN would compete against the new transition to BS_TRAILERS (happening in different threads in the h2 case).

I took care of that and added a couple patches next to the ones picked from #3798 at the beginning of this patch series. I'm no longer able to stress test cases and cause failures and I believe the race is gone.

I have not yet studied the remaining CI failure, triggering a panic from the workspace emulator. I suspect either a logic error in the workspace pipelining (expanding from req keep-alive only to req and beresp trailers) or an error in how it translates in the workspace emulator.

I also started an effort to improve feature parity between HTTP/1 and h2 txreq in varnishtest but this is not ready.

dridi · 2024-06-25T07:38:41Z

As I suspected, the emulated WS_Pipeline() was incorrect. There was a missing workspace release too.

dridi · 2024-07-02T12:45:48Z

bugwash:

The following commits can be merged: 23ec1fc...9875258

The workspace refactoring commits can be submitted independently.

bsdphk · 2024-07-02T12:47:05Z

I'm fine with you commiting those first six commits while we wait for @nigoroll

dridi · 2024-07-03T13:17:46Z

As per bugwash:

I pushed ce3e446...f0e9df8 to trunk
I submitted workspace changes independently (ws: Prep work for trailers support #4130)
- they are still present at the beginning of this patch series
I squashed outstanding commits

Until now, we read the (CR)?LF at the end of a chunk as part of the next chunk header (see: /* Skip leading whitespace */). For a follow up commit, we are going to want to know if the next chunk header is available for read, so we now consume the chunk end as part of the chunk itself. This also fixes a corner case: We previously accepted chunks with a missing end-of-chunk (see fix of r01729.vtc). Ref: https://datatracker.ietf.org/doc/html/rfc7230#section-4.1

... which we are going to need in a follow up commit. No functional changes, diff best viewed with -b

While working on request body caching improvements, it was noticed that, for chunked encoding, we did not know the request body size before we attempted to read at least one additional byte because the chunked fetch did not return VFP_END with the last chunk, but rather only as a zero-length additional chunk. This commit changes chunked encoding processing to return VFP_END opportunistically if the next chunk header is available. Implementation: Unless there is pipeline readahead data available, the test for available data implies a poll(2) call. Relative to the existing cost of the current implementation, this cost is not considered relevant. To improve efficiency, we should consider a generic readahead for chunked encoding, probably by making the file descriptor non blocking and adding a readahead buffer.

It seems this test now shows no data loss more frequently, which I hope should be fine for the purpose of the test?

Now that WS_Pipeline() can be used on backend side too, we may have situations where we run out of workspace when copying pipelined data. We should gracefully handle these failures by failing the task instead of panicking. For existing fail-safe call sites, we check the result.

This will be the method responsible for reading and parsing http trailers. This commit only prepares the structure, the method is implemented in follow up commits.

This will allow us to check that we have received a complete end of chunked body, possibly including HTTP traillers. Co-authored-by: Dridi Boukelmoune <[email protected]>

It will be used to convey that the body was fully fetched and trailers are expected or were already encountered.

For now, we simply do nothing about them.

It cannot be reused for req headers, but it will apply to beresp and req trailers.

At this point trailer fields are just checked for correctness and discarded. Co-authored-by: Walid Boudebouda <[email protected]>

It is no longer the responsibility of the VFP to read the final (CR)?LF in chunked bodies, because trailers are not part of the request or response body. For backend responses trailers are handled separately. The only reason why this does not break keep-alive for the client is that (CR)?LF sequences are legitimately ignored between requests, so an empty trailer list just turns into a no-op (CR)?LF for the next client request. This should also be the case for backend responses, but having extra unread empty lines breaks connection pooling instead. We can probably as part of recycling connections clear remaining (CR)?LF sequences and close the connection with SC_JUNK if something remains.

This is not allowed for HEADERS frames containing trailers.

This is currently the only state where HEADERS frames are processed, but only a subset of the operations will be shared by trailers. Better diff with the --ignore-all-space option.

It will not necessarily be a new stream once trailers are involved.

It is now possible to receive a HEADERS frame on an OPEN stream. The HPACK block is processed but trailers are skipped. Therefore a workspace overflow is not a failure. Trailers are processed from the h2 session, and the request might be processed concurrently in a different worker, so we can't use the request workspace, even as a temporary buffer. Instead of relying on scratch space on the stack, we can use the h2 session's req workspace. This will make things easier once we collect trailers instead of merely skipping them.

dridi · 2024-07-08T17:29:27Z

Force-pushed to rebase against trunk after merging #4130.

dridi added b=enhancement r=trunk c=varnishd c=H/2 b=cleanup labels Jun 18, 2024

dridi marked this pull request as ready for review June 18, 2024 10:55

daghf self-requested a review June 18, 2024 14:52

dridi force-pushed the skip_trailers branch from 1a8d8d2 to 99d559f Compare June 24, 2024 18:00

dridi force-pushed the skip_trailers branch from 5f4f8dd to 1e98c5a Compare July 3, 2024 13:10

dridi mentioned this pull request Jul 3, 2024

ws: Prep work for trailers support #4130

Merged

dridi mentioned this pull request Jul 3, 2024

std.cache_req_body(BYTES size, BOOL partial = 0) #3798

Draft

nigoroll and others added 13 commits July 8, 2024 19:26

v1f: pull chunk header parsing into an own function

53e5ebf

... which we are going to need in a follow up commit. No functional changes, diff best viewed with -b

Stabilize partial write -sdeprecated_persistent test

1abb0fa

It seems this test now shows no data loss more frequently, which I hope should be fine for the purpose of the test?

vrt: New .gettrls() callback in struct vdi_methods

58c0ca3

This will be the method responsible for reading and parsing http trailers. This commit only prepares the structure, the method is implemented in follow up commits.

http1: Rename HTTP1_Complete() to HTTP1_Headers()

49ce4a9

http1: Introduce HTTP1_Trailers()

16f13e0

This will allow us to check that we have received a complete end of chunked body, possibly including HTTP traillers. Co-authored-by: Dridi Boukelmoune <[email protected]>

vdi: New VDI_GetTrl() function

b4cab11

body: New BS_TRAILERS body status

1117c0a

It will be used to convey that the body was fully fetched and trailers are expected or were already encountered.

http1: Learn how to dissect trailers

e002a8a

For now, we simply do nothing about them.

http1: Extract reusable logic from V1F_FetchRespHdr()

818c8bb

It cannot be reused for req headers, but it will apply to beresp and req trailers.

http1: Introduce HTTP1_RxTrailers()

739b4a1

At this point trailer fields are just checked for correctness and discarded. Co-authored-by: Walid Boudebouda <[email protected]>

dridi and others added 10 commits July 8, 2024 19:26

vbe: Implement the gettrls() method

9e827c5

vrb: Process req trailers

f1be56e

hpack: Keep track of seeing any pseudo-header

caf39cd

This is not allowed for HEADERS frames containing trailers.

http2_proto: Guard HEADERS setup for the IDLE state

29a53a5

This is currently the only state where HEADERS frames are processed, but only a subset of the operations will be shared by trailers. Better diff with the --ignore-all-space option.

hpack: Pass the request being decoded

05fbea9

It will not necessarily be a new stream once trailers are involved.

http1_fsm: Homework for later

2776798

vtc_http2: Avoid argv overflow

e6adb44

vtc_http2: content-encoding is not a pseudo header

9364f3b

dridi force-pushed the skip_trailers branch from 1e98c5a to 9364f3b Compare July 8, 2024 17:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

http: Skip req and beresp trailers #4125

http: Skip req and beresp trailers #4125

dridi commented Jun 18, 2024

dridi commented Jun 18, 2024

dridi commented Jun 24, 2024

dridi commented Jun 25, 2024

dridi commented Jul 2, 2024

bsdphk commented Jul 2, 2024

dridi commented Jul 3, 2024

dridi commented Jul 8, 2024

http: Skip req and beresp trailers #4125

Are you sure you want to change the base?

http: Skip req and beresp trailers #4125

Conversation

dridi commented Jun 18, 2024

dridi commented Jun 18, 2024

dridi commented Jun 24, 2024

dridi commented Jun 25, 2024

dridi commented Jul 2, 2024

bsdphk commented Jul 2, 2024

dridi commented Jul 3, 2024

dridi commented Jul 8, 2024