📄 EchoingPixels: Aliasing-Resistant Joint Token Reduction for Audio-Visual LLMs

7.5/10 | 前25% | arxiv


← 返回 2026-05-23 语音/音乐/音频论文速递