Audio Processing - 2026-04
Audio Processing - 2026-04
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-04-01 | Diff-VS: Efficient Audio-Aware Diffusion U-Net for Vocals Separation | Yun-Ning et.al. | 2604.01120 | translate | read | null |
| 2026-04-01 | VisG AV-HuBERT: Viseme-Guided AV-HuBERT | Aristeidis Papadopoulos et.al. | 2604.00982 | translate | read | null |
| 2026-04-01 | Speech LLMs are Contextual Reasoning Transcribers | Keqi Deng et.al. | 2604.00610 | translate | read | null |
| 2026-04-01 | Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling | Kazuki Yano et.al. | 2604.00489 | translate | read | null |
(<a href=../Audio_Processing.md>back to Audio Processing</a>)