You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The paper mentions that they modified certain aspects of VITS, but the code does not provide any training scripts to replicate the same type of procedure.
The text was updated successfully, but these errors were encountered:
Hey @blaisewf! 👋 I'm here to help you with any bugs, questions, or contributions while you wait for a human maintainer. Let's tackle this together!
I wasn't able to find specific information within the repository about the training scripts or procedures used for the base models, including the modifications to the VITS model for emotion, language, and speaker ID embeddings. However, the codebase references a paper that details the training procedure or modifications made to the VITS model. You can access the paper at https://arxiv.org/abs/2312.01479[1].
The paper mentions that they modified certain aspects of VITS, but the code does not provide any training scripts to replicate the same type of procedure.
The text was updated successfully, but these errors were encountered: