Language Model Augmented Semi-Supervised Statistical Inference

📄 Language Model Augmented Semi-Supervised Statistical Inference 🔥 8.2/10 | 前25% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 17 words

Learning Tight Rejection Boundaries without Negatives for Strict One-Class Audio Deepfake Detection

📄 Learning Tight Rejection Boundaries without Negatives for Strict One-Class Audio Deepfake Detection ✅ 7.0/10 | 前50% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 23 words

LightAVSeg: Lightweight Audio-Visual Segmentation

📄 LightAVSeg: Lightweight Audio-Visual Segmentation ✅ 7.5/10 | 前25% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 15 words

Listening Through the Noise: Cauchy-Driven Diffusion Bridges for Robust Gastrointestinal Auscultation and Clinical Benchmarking

📄 Listening Through the Noise: Cauchy-Driven Diffusion Bridges for Robust Gastrointestinal Auscultation and Clinical Benchmarking ✅ 7.5/10 | 前25% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 25 words

Long Grounded Thoughts: Synthesizing Grounded Visual Problems and Distilling Reasoning Chains at Scale

📄 Long Grounded Thoughts: Synthesizing Grounded Visual Problems and Distilling Reasoning Chains at Scale ✅ 7.5/10 | 前25% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 24 words

LynX: Token Interface Alignment for Video+X LLMs

📄 LynX: Token Interface Alignment for Video+X LLMs #** #Video #LLMs #Token #Interface #Alignment #多模态整合 #流形对齐 #单模态数据 ✅ 7.5/10 | 前25% | #** | #Video | #LLMs #Token | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 34 words

MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks

📄 MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks ✅ 7.2/10 | 前50% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 21 words

MedMosaic: A Challenging Large Scale Benchmark of Diverse Medical Audio

📄 MedMosaic: A Challenging Large Scale Benchmark of Diverse Medical Audio ✅ 7.5/10 | 前25% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 21 words

MetaBio: Learning from metadata for bioacoustics foundation models

📄 MetaBio: Learning from metadata for bioacoustics foundation models ✅ 6.5/10 | 前50% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 19 words

MFCL Audio: An Audio Function Calling Evaluation for Large Language Models

📄 MFCL Audio: An Audio Function Calling Evaluation for Large Language Models 📝 3.0/10 | 后50% | arxiv ← 返回 2026-05-23 语音/音乐/音频论文速递

2026-05-23 · 更新于 2026-06-19 · 1 min · 22 words