File size: 198 Bytes
9fc4854 |
1 |
The results represent the cosine similarity between the speaker embeddings of the original and cloned samples, generated by the WavLM model. The values can be filtered by dataset or emotional state. |