📄 AVI-Bench: Toward Human-like Audio-Visual Intelligence of Omni-MLLMs ✅ 7.5/10 | 前25% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递