Open Voice Cloning Leaderboard

The Open Voice Cloning Leaderboard ranks and evaluates the voice cloning models across diverse datasets, including emotional speech.
It also delivers an in-depth analysis of how different acoustic features shape the final results.

The results represent the cosine similarity between the speaker embeddings of the original and cloned samples, generated by the WavLM model.

1
0.8356
0.8881
0.7618
0.8539
0.8135
0.8167