Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DO NOT MERGE YET - HTTP Caching API #1018

Closed
wants to merge 1 commit into from
Closed

Conversation

harmony7
Copy link
Member

@harmony7 harmony7 commented Oct 22, 2024

This PR tracks the implementation of the HTTP caching API.

Implements #991

@harmony7
Copy link
Member Author

harmony7 commented Oct 22, 2024

Starting by committing proposed TypeScript types for the API surface. The design and behavior are very similar to what's in the Rust SDK, described in the developer documentation: Customizing cache interaction with the backend

@harmony7
Copy link
Member Author

harmony7 commented Oct 22, 2024

Examples

Inject headers before sending

Sometimes it is useful to perform modifications to the incoming Request before invoking the origin through the readthrough cache, for example to add an authorization header. If this is expensive to generate, then it makes sense to add it only if the request would make it to the backend. Specify onBeforeSend on CacheOverride to define a before-send callback function, an operation to be performed just before the readthrough cache would invoke the backend.

// Example: inject headers before sending.
const request = event.request;
const response = await fetch(request, {
  backend: 'example',
  cacheOverride: new CacheOverride('override', {
    onBeforeSend(req) {
      // Assume buildAuthorizationHeader() may be expensive, so
      // we only want to call this when the request would actually reach the backend.
      req.headers.set('Authorization', buildAuthorizationHeader());
    },
  }),
});

Customize caching based on content type

Sometimes it is useful to modify caching policy based on the backend response. Specify onAfterSend on CacheOverride to define an after-send callback function, an operation that runs only when the readthrough cache has received a response from the backend, before it is (potentially) stored into the cache.

The CandidateResponse object passed to the callback represents the response from the backend and contains interfaces to read and manipulate headers and cache policy. It intentionally does not allow reading or writing directory the response body (more on that later).

This example shows usages that utilize these members of CandidateResponse.

  • the ttl property, which is used to override the Time to Live (TTL) of the object in the cache
  • the setUncacheable() property, which is used to specify that this object is not to be stored in the cache
// Example: customizing caching based on content type.
const request = event.request;
const response = await fetch(request, {
  backend: 'example',
  cacheOverride: new CacheOverride('override', {
    onAfterSend(resp) {
      const contentType = resp.headers.get('Content-Type') ?? '';
      switch(true) {
        case contentType.startsWith('image/'):
          resp.ttl = 67;
          break;
        case contentType === 'text/html':
          resp.ttl = 321;
          break;
        case contentType === 'application/json':
          // setUncacheable() with no param (default false) marks this object as uncacheable
          // without disabling request collapsing
          resp.setUncacheable();
          break;
        default:
          resp.ttl = 2;
      }
    },
  }),
});

Creating a hit-for-pass object

By specifying true when calling the setUncacheable() method of CandidateResponse, you mark the request as hit-for-pass, which is a marker to disable request collapsing until a cacheable response is returned.

// Example: creating a hit-for-pass object.
const request = event.request;
const response = await fetch(request, {
  backend: 'example',
  cacheOverride: new CacheOverride('override', {
    onAfterSend(resp) {
      if (resp.headers.has('my-private-header')) {
        // setUncacheable() with true param marks this object as uncacheable
        // and marks it as hit-for-pass, resulting in disabling request collapsing
        resp.setUncacheable(true);
      }
    },
  }),
});

Manipulating the response body that is stored to the cache

In an after-send callback, optionally set the bodyTransform property of the CandidateResponse object to an instance of TransformStream to define a body-transform to be applied to the backend response body before it is stored into the cache.

Employing TransformStream allows working with streamed chunks of the backend body, rather than necessarily reading it entirely into memory (though the code example below does not take advantage of this, as it attempts to parse JSON, which requires the entire body to be present).

The transformation is specified by setting a property (resp.bodyTransform =) rather than directly working with the body during the after-send callback function. This is because not every response contains a fresh body. Specifically, 304 Not Modified responses, which are used to revalidate a stale cached response, are valuable precisely because they do not retransmit the body; in this case, the backend and (if specified) your after-send callback function update the headers and cache policy of the existing response object "in-place", without applying the body-transform or changing the cached response body.

This design enables the readthrough cache to internally manage the complexities of revalidation, allowing the developer to provide a single code path without needing to think about revalidation at all.

// Example: expanding a template before caching.
const request = event.request;
const response = await fetch(request, {
  backend: 'example',
  cacheOverride: new CacheOverride('override', {
    onAfterSend(resp) {
      resp.headers.set('Content-Type', 'text/html');
      resp.bodyTransform = new TransformStream({
        bytes: null,
        start() {
          this.bytes = new Uint8Array(0);
        },
        transform(chunk) {
          // The ideal transform would transform bytes as it goes over the wire,
          // but for a transformation whose input is JSON, we need to buffer the
          // bytes to memory, because we need the whole body before we can
          // deserialize it.
          const newBytes = new Uint8Array(this.bytes.length + chunk.length);
          newBytes.set(this.bytes, 0);
          newBytes.set(chunk, this.bytes.length);
          this.bytes = newBytes;
        },
        flush(controller) {
          const str = new TextDecoder().decode(this.bytes);
          // jsonToHtml applies a template to generate HTML from JSON
          const html = jsonToHtml(str);
          controller.enqueue(new TextEncoder().encode(html));
        },
      });
    },
  }),
});

// The resulting cached object will have a body that is HTML
// despite that the backend returned a JSON response:
return response;

Notes

  • The HTTP readthrough cache interface performs a number of transformations related to range collapsing, client revalidation, and backend revalidation automatically. To gain an understanding of why this is useful, and to make best use of the interface, it is worth reading about these automatic request transformations.

  • The new onBeforeSend() and onAfterSend() callback functions either return nothing or return a Promise that resolves to nothing. This should be doable since fetch() is a function that returns a Promise. Additionally, if these callback functions throw an exception or the Promise returned from them rejects, then the exception, or the rejected value of the Promise, is used to reject the Promise returned from fetch(), and nothing is stored to the cache.

    • In a request-collapsing case, one request out of the pooled requests will be selected to perform the before-send callback functionbackend fetchafter-send callback function sequence, while the other pooled requests will wait. If the sequence fails at any point, then any response received from the backend is discarded, the Promise returned by fetch() in just that instance will reject with the error value, and another request out of the pooled requests will be selected to attempt the before-send callback functionbackend fetchafter-send callback function sequence again.
  • When using the new before-send and after-send callback functions, as well as the body-transform, there are some additional considerations that help you understand that behavior.

  • The Fetch API defines a cache property on RequestInit that specifies "cache modes". However, these cache modes call for behavior that is not possible under the existing set of host calls, so at this time let's not touch that property, and maybe revisit them if there is real demand or use-cases for them.

    cache value check cache? fresh stale uncached store? (*1) notes
    "default" yes return cached revalidate (*2) fetch yes this is the exiting behavior in Compute
    "no-store" no no readthrough cache has no way to skip checking the cache
    "reload" no yes readthrough cache has no way to skip checking the cache
    "no-cache" yes revalidate revalidate fetch yes readthrough cache has no way to force a revalidation for fresh objects
    "force-cache" yes return cached return cached fetch yes readthrough cache has no way to get the existing stale value
    "only-if-cache" yes return cached return cached error no readthrough cache has no way to get the existing stale value

    *1 - if allowed by object's cache policy specified by backend or set in after-send callback
    *2 - if the object is in the stale-while-revalidate window, the stale object is returned immediately, and "revalidate and store" are done async

@harmony7 harmony7 force-pushed the http-caching-api branch 2 times, most recently from 676f982 to 7e4fd83 Compare October 23, 2024 03:40
@guybedford guybedford force-pushed the main branch 4 times, most recently from 6aff377 to 206a60e Compare November 7, 2024 20:35
@guybedford guybedford mentioned this pull request Nov 22, 2024
@guybedford
Copy link
Member

guybedford commented Nov 23, 2024

I've started to put together a variation of this in #1051. Using Response mutations instead of having a separate CandidateResponse, and also using a CacheOptions return.

Here are the same examples updated to this API.

Depending on how the implementation goes I will update this.

// Example: inject headers before sending.
const request = event.request;
const response = await fetch(request, {
  backend: 'example',
  cacheOverride: new CacheOverride('override', {
    beforeSend(req) {
      // Assume buildAuthorizationHeader() may be expensive, so
      // we only want to call this when the request would actually reach the backend.
      req.headers.set('Authorization', buildAuthorizationHeader());
    },
  }),
});
// Example: customizing caching based on content type.
const request = event.request;
const response = await fetch(request, {
  backend: 'example',
  cacheOverride: new CacheOverride('override', {
    beforeCache(resp) {
      const contentType = resp.headers.get('Content-Type') ?? '';
      switch(true) {
        case contentType.startsWith('image/'):
          resp.ttl = 67;
          break;
        case contentType === 'text/html':
          resp.ttl = 321;
          break;
        case contentType === 'application/json':
          // setUncacheable becomes returning { cache: false }
          return { cache: false };
        default:
          resp.ttl = 2;
      }
    },
  }),
});
// Example: creating a hit-for-pass object.
const request = event.request;
const response = await fetch(request, {
  backend: 'example',
  cacheOverride: new CacheOverride('override', {
    beforeCache(resp) {
      if (resp.headers.has('my-private-header')) {
        // setUncacheable() becomes 'uncacheable'
        return { cache: 'uncacheable' };
      }
    },
  }),
});
// Example: expanding a template before caching.
const request = event.request;
const response = await fetch(request, {
  backend: 'example',
  cacheOverride: new CacheOverride('override', {
    beforeCache(resp) {
      resp.headers.set('Content-Type', 'text/html');
      const bodyTransform = new TransformStream({
        bytes: null,
        start() {
          this.bytes = new Uint8Array(0);
        },
        transform(chunk) {
          // The ideal transform would transform bytes as it goes over the wire,
          // but for a transformation whose input is JSON, we need to buffer the
          // bytes to memory, because we need the whole body before we can
          // deserialize it.
          const newBytes = new Uint8Array(this.bytes.length + chunk.length);
          newBytes.set(this.bytes, 0);
          newBytes.set(chunk, this.bytes.length);
          this.bytes = newBytes;
        },
        flush(controller) {
          const str = new TextDecoder().decode(this.bytes);
          // jsonToHtml applies a template to generate HTML from JSON
          const html = jsonToHtml(str);
          controller.enqueue(new TextEncoder().encode(html));
        },
      });
      return { bodyTransform };
    },
  }),
});

// The resulting cached object will have a body that is HTML
// despite that the backend returned a JSON response:
return response;

@harmony7
Copy link
Member Author

harmony7 commented Nov 23, 2024

I am liking the shape of this.

I'd like to point out one thing, one of the reasons my previous suggested API surface has a CandidateResponse type was to provide an interface that specifically does not provide a way to access certain parts of the Response (mainly the Body) during after-send. For example, if you consume Body here then the caller of fetch() will be in trouble. Also, revalidation responses will have an empty Body during after-send, to be filled in by the cached body before returning from fetch(). This is why we set a body-transform, this is only run on a full response and then that transform is applied before storing in the cache.

If you use a normal Response here, will you be able to enforce this?

@guybedford
Copy link
Member

Thanks for bringing that up. We could add a state bit to the response that treats it as a locked response somehow I'm sure, and to give an error on access to the body or something similar.

@guybedford
Copy link
Member

Closing as the final design is now fully tracking in #1051. Any final comments with regards to the API surface area very much welcome.

@guybedford guybedford closed this Jan 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants