Audio Processing - 2026-04

Publish Date Title Authors PDF Translate Read Code
2026-04-01 Diff-VS: Efficient Audio-Aware Diffusion U-Net for Vocals Separation Yun-Ning et.al. 2604.01120 translate read null
2026-04-01 VisG AV-HuBERT: Viseme-Guided AV-HuBERT Aristeidis Papadopoulos et.al. 2604.00982 translate read null
2026-04-01 Speech LLMs are Contextual Reasoning Transcribers Keqi Deng et.al. 2604.00610 translate read null
2026-04-01 Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling Kazuki Yano et.al. 2604.00489 translate read null

(<a href=../Audio_Processing.md>back to Audio Processing</a>)