-
Notifications
You must be signed in to change notification settings - Fork 8.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added no_mp3_support argument and added a check for ffmpg installation #517
Conversation
Hi @ramalamadingdong , thank you for contributing a fix for #446. If we are going to distribute a sample mp3 with the repo, I'd prefer the reference audios from the SV2TTS audio samples page since they can also be used for trying out the toolbox and benchmarking. We can get all 6 "speaker adaptation for unseen speakers" reference audios for 100k total: samples.zip |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks pretty good @ramalamadingdong , please consider the suggestions that I have made.
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
Did the updates and uploaded the correct samples :D |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the future please make sure to test changes (the two that I found by testing escaped me during review).
- Please add a file called
samples/README.md
with the following text. It is needed to comply with the requirements of the CC-BY-4.0 license.
The audio files in this folder are provided for toolbox testing and
benchmarking purposes. These are the same reference utterances
used by the SV2TTS authors to generate the audio samples located at:
https://google.github.io/tacotron/publications/speaker_adaptation/index.html
The `p240_00000.mp3` and `p260_00000.mp3` files are compressed
versions of audios from the VCTK corpus available at:
https://datashare.is.ed.ac.uk/handle/10283/3443
VCTK.txt contains the copyright notices and licensing information.
The `1320_00000.mp3`, `3575_00000.mp3`, `6829_00000.mp3`
and `8230_00000.mp3` files are compressed versions of audios
from the LibriSpeech dataset available at: https://openslr.org/12
For these files, the following notice applies:
```
LibriSpeech (c) 2014 by Vassil Panayotov
LibriSpeech ASR corpus is licensed under a
Creative Commons Attribution 4.0 International License.
See <http://creativecommons.org/licenses/by/4.0/>.
```
- Please download README.txt from https://datashare.is.ed.ac.uk/handle/10283/3443 and import that to the repo as
samples/VCTK.txt
Co-authored-by: blue-fish <[email protected]>
Co-authored-by: blue-fish <[email protected]>
This is a fix for this issue Please let me know if anything else needs to get done