Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unified Driver for platform. #336

Open
willwade opened this issue Oct 17, 2024 · 6 comments
Open

Unified Driver for platform. #336

willwade opened this issue Oct 17, 2024 · 6 comments
Labels

Comments

@willwade
Copy link
Collaborator

willwade commented Oct 17, 2024

Here's a suggestion. There is a need to have a AVSpeechSynth driver for MacOS as Nss is depreciated but also it would allow this tts wrapper to support ssml across the board.

And on windows we also need uwp and sapi4 driver. To make things easier I'm suggesting the default would be a unified driver for each platform. Linux would be still just espeak (unless we can nicely integrate pico / piper or Sherpa onnx*)

This is thrown together code but you can see what I'm thinking https://gist.github.com/willwade/86ef5c0655badc90c2f3fec811c06cde

  • I'm not keen on any of these as they all a) are dependency heavy b) rely on a download step. And piper is the only one with ssml on its roadmap so it would then be feature matched across engine. Let's have a think.
@willwade
Copy link
Collaborator Author

#121 #287 #104

willwade added a commit to willwade/pyttsx3 that referenced this issue Oct 26, 2024
nateshmbhat#347 nateshmbhat#336

I think this is passing all the tests. Its far simpler than nsss and we can do thngs like ssml nateshmbhat#121 nateshmbhat#287
This was referenced Oct 26, 2024
@cclauss
Copy link
Contributor

cclauss commented Nov 2, 2024

@willwade
Copy link
Collaborator Author

willwade commented Nov 2, 2024

SSML: https://cloud.google.com/text-to-speech/docs/ssml

Sapi. AVSynyh & espeak all support ssml. It's on my radar to do a pr for supporting this as soon as espeak and avsynth done

@willwade
Copy link
Collaborator Author

willwade commented Nov 6, 2024

So I’m actually wondering if windows should have a second driver using the more modern .net api.

Here’s a quick thrown together attempt https://gist.github.com/willwade/cc268ef0b0b0ec106e644eec6835b228

it Would remove problems like #263

@willwade
Copy link
Collaborator Author

willwade commented Nov 6, 2024

That sound very promising!

Please name the file system_speech.py instead of SystemSpeech.py to align with Python naming.

Yeah. I’m not sure I like the name of that engine. It’s technically the .NET interface. Dotnetspeech synth” I don’t know 🤷

@cclauss
Copy link
Contributor

cclauss commented Nov 6, 2024

DotNetSpeech, DotNetSpeechEngine, DotNetSpeechDriver

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants