📄 Optimality of FSQ tokens for continuous diffusion for categorical data with application to text-to-speech

7.0/10 | 前50% | arxiv


← 返回 2026-05-23 语音/音乐/音频论文速递