You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Dear Pyannote-team, we are using pyannote speaker segmentation 3.1.1 for distinguishing speakers for further analysis in a dialogue of 2-3 people. However, if people have similar pitch (e.g. two men with deep voices), pyannote oftentimes misclassifies the speakers. For the human ear, distinguishing the two speakers is easy, so there must be only slight differences in pitch. Thus, I would like to ask whether you have tip for preferences or preprocessing for optimising the classification. Thanks a lot!
Minimal reproduction example (MRE)
Can be provided if necessary
The text was updated successfully, but these errors were encountered:
ChristianNSchmitz
changed the title
Speakers with similar voices are difficult to distinguish
Speakers with similar pitch are difficult to distinguish
May 17, 2024
Tested versions
3.1
System information
Ubuntu 24.04, pyannote 3.1.1
Issue description
Dear Pyannote-team, we are using pyannote speaker segmentation 3.1.1 for distinguishing speakers for further analysis in a dialogue of 2-3 people. However, if people have similar pitch (e.g. two men with deep voices), pyannote oftentimes misclassifies the speakers. For the human ear, distinguishing the two speakers is easy, so there must be only slight differences in pitch. Thus, I would like to ask whether you have tip for preferences or preprocessing for optimising the classification. Thanks a lot!
Minimal reproduction example (MRE)
Can be provided if necessary
The text was updated successfully, but these errors were encountered: