📄 LALM-as-a-Judge: Benchmarking Large Audio-Language Models for Safety Evaluation in Multi-Turn Spoken Dialogues

📝 3.5/10 | 后50% | arxiv


← 返回 2026-05-23 语音/音乐/音频论文速递