Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added no_mp3_support argument and added a check for ffmpg installation #517

Merged
15 commits merged into from
Sep 3, 2020
Merged

Added no_mp3_support argument and added a check for ffmpg installation #517

15 commits merged into from
Sep 3, 2020

Conversation

ramalamadingdong
Copy link
Contributor

This is a fix for this issue Please let me know if anything else needs to get done

@ghost
Copy link

ghost commented Sep 2, 2020

Hi @ramalamadingdong , thank you for contributing a fix for #446.

If we are going to distribute a sample mp3 with the repo, I'd prefer the reference audios from the SV2TTS audio samples page since they can also be used for trying out the toolbox and benchmarking. We can get all 6 "speaker adaptation for unseen speakers" reference audios for 100k total: samples.zip

Copy link

@ghost ghost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks pretty good @ramalamadingdong , please consider the suggestions that I have made.

demo_cli.py Outdated Show resolved Hide resolved
demo_cli.py Outdated Show resolved Hide resolved
demo_cli.py Outdated Show resolved Hide resolved
demo_cli.py Outdated Show resolved Hide resolved
demo_cli.py Outdated Show resolved Hide resolved
toolbox/__init__.py Outdated Show resolved Hide resolved
toolbox/__init__.py Outdated Show resolved Hide resolved
toolbox/__init__.py Outdated Show resolved Hide resolved
toolbox/__init__.py Outdated Show resolved Hide resolved
toolbox/__init__.py Outdated Show resolved Hide resolved
@ramalamadingdong
Copy link
Contributor Author

Did the updates and uploaded the correct samples :D

Copy link

@ghost ghost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the future please make sure to test changes (the two that I found by testing escaped me during review).

  1. Please add a file called samples/README.md with the following text. It is needed to comply with the requirements of the CC-BY-4.0 license.
The audio files in this folder are provided for toolbox testing and
benchmarking purposes. These are the same reference utterances
used by the SV2TTS authors to generate the audio samples located at:
https://google.github.io/tacotron/publications/speaker_adaptation/index.html

The `p240_00000.mp3` and `p260_00000.mp3` files are compressed
versions of audios from the VCTK corpus available at:
https://datashare.is.ed.ac.uk/handle/10283/3443
VCTK.txt contains the copyright notices and licensing information.

The `1320_00000.mp3`, `3575_00000.mp3`, `6829_00000.mp3`
and `8230_00000.mp3` files are compressed versions of audios
from the LibriSpeech dataset available at: https://openslr.org/12
For these files, the following notice applies:
```
LibriSpeech (c) 2014 by Vassil Panayotov

LibriSpeech ASR corpus is licensed under a
Creative Commons Attribution 4.0 International License.

See <http://creativecommons.org/licenses/by/4.0/>.
```
  1. Please download README.txt from https://datashare.is.ed.ac.uk/handle/10283/3443 and import that to the repo as samples/VCTK.txt

demo_cli.py Outdated Show resolved Hide resolved
toolbox/__init__.py Outdated Show resolved Hide resolved
@ramalamadingdong ramalamadingdong requested a review from a user September 3, 2020 17:06
@ghost ghost requested a review from CorentinJ September 3, 2020 17:15
This pull request was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants