"For VC, we used two subjective metrics: naturalness mean opinion score (nMOS) and voice similarity MOS (sMOS) with a CI of 95%; and three objective metrics for naturalness: UTMOS, character error rate (CER) and word error rate (WER)."
"We conducted seven objective metrics for reconstruction and resynthesis tasks: a log-scale Mel error distance, perceptual evaluation of speech quality (PESQ), and others, showcasing a comprehensive evaluation framework for our models."
Collection
[
|
...
]