Replies: 1 comment 11 replies
-
Thanks for sharing your results. Really interesting. One question. What did you use to measure speaker similarity? |
Beta Was this translation helpful? Give feedback.
11 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi According to my recent experiments because (figure 0), I have something experience about select the dataset on ZS-TTS.
figure 0
First, I think the speaker have wav files count and the speaker's TTS similarity it not have a direct relationship(see figure 1)
Second, base the first, so I try to add Mozilla dataset in my training data, Then I got good result in unseendata. (See figure 2 data is my classmate. figure3 is the official YourTTS and use data is VoxCeleb1)
Finally, I found that different accents need to be trained separately, I have tried training in Chinese (China and TW)
Thanks for reading and hope to get a reply 😀
Beta Was this translation helpful? Give feedback.
All reactions