ChatCompletionCache should support component config #5141

jackgerrits · 2025-01-22T15:38:03Z

The new cache should support declarative configuration

victordibia · 2025-01-23T18:49:26Z

Small side comment ... @jackgerrits
Would it be possible to have a simpler interface for enabling cache in the chatcompletion client?

cached_model_client = OpenAIChatCompletionClient(model="gpt-4o", cache=True)

This way ChatCompletionClient caching can be reflected in the config for the client.

jackgerrits · 2025-01-23T19:56:59Z

This means that each model client need to implement it. The point of the current design is separation of concerns and generalizability.

This way ChatCompletionClient caching can be reflected in the config for the client.

This is the case currently - itll be a model client's config nested with the config of the cache client.

If we allow a default cache store, then the current usage can be as simple as:

cached_model_client = ChatCompletionCache(OpenAIChatCompletionClient(model="gpt-4o"))

github-actions bot added the needs-triage label Jan 22, 2025

ekzhu added this to the 0.4.4 milestone Jan 23, 2025

jackgerrits mentioned this issue Jan 23, 2025

ChatCompletionClient to support request caching #4752

Closed

Provide feedback