OpenAI: Support audio modality in chat completion #1560

ThomasVitale · 2024-10-18T21:49:32Z

OpenAI has finally launched support for audio modality in its chat completion API, both as input and as output.

We should start by supporting the input audio modality, in line with the existing APIs in Spring AI for multimodality.
The output audio modality can be supported at a lower level (OpenAiApi), but it will need some design discussions to surface it through the Spring AI abstractions.

More information: https://platform.openai.com/docs/guides/audio

The text was updated successfully, but these errors were encountered:

* Extend OpenAiApi to support the latest version of the Chat Completion API, including input and output audio modality. * Support input audio modality in OpenAiChatModel via the existing multimodality support in Spring AI. Fixes spring-projectsgh-1560

ThomasVitale · 2024-10-18T21:51:43Z

Here's a PR: #1561

ThomasVitale linked a pull request Oct 18, 2024 that will close this issue

OpenAI - Support audio input modality #1561

Open

This was referenced Oct 18, 2024

Review the Media APIs for multimodality #1562

Open

Support multimodality in chat completion output #1563

Open

How to expose vendor specific usage information #1407

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI: Support audio modality in chat completion #1560

OpenAI: Support audio modality in chat completion #1560

ThomasVitale commented Oct 18, 2024

ThomasVitale commented Oct 18, 2024

OpenAI: Support audio modality in chat completion #1560

OpenAI: Support audio modality in chat completion #1560

Comments

ThomasVitale commented Oct 18, 2024

ThomasVitale commented Oct 18, 2024