📄 T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation ✅ 6.5/10 | 前50% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递