You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For better transcription in more languages, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate
#4
Open
menelic opened this issue
May 26, 2023
· 2 comments
Because of the error rate viz and above al speaker detection your whisper ui is better for research use than all the others I have tried. Please consider implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:
Because of the error rate viz and above al speaker detection your whisper ui is better for research use than all the others I have tried. Please consider implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:
https://github.com/facebookresearch/fairseq/tree/main/examples/mms
https://ai.facebook.com/blog/multilingual-model-speech-recognition/
The text was updated successfully, but these errors were encountered: