Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OpenAI: Support audio modality in chat completion #1560

Open
ThomasVitale opened this issue Oct 18, 2024 · 1 comment · May be fixed by #1561
Open

OpenAI: Support audio modality in chat completion #1560

ThomasVitale opened this issue Oct 18, 2024 · 1 comment · May be fixed by #1561

Comments

@ThomasVitale
Copy link
Contributor

OpenAI has finally launched support for audio modality in its chat completion API, both as input and as output.

We should start by supporting the input audio modality, in line with the existing APIs in Spring AI for multimodality.
The output audio modality can be supported at a lower level (OpenAiApi), but it will need some design discussions to surface it through the Spring AI abstractions.

More information: https://platform.openai.com/docs/guides/audio

ThomasVitale added a commit to ThomasVitale/spring-ai that referenced this issue Oct 18, 2024
* Extend OpenAiApi to support the latest version of the Chat Completion API, including input and output audio modality.
* Support input audio modality in OpenAiChatModel via the existing multimodality support in Spring AI.

Fixes spring-projectsgh-1560
@ThomasVitale ThomasVitale linked a pull request Oct 18, 2024 that will close this issue
@ThomasVitale
Copy link
Contributor Author

Here's a PR: #1561

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant