Audio Processing - 2025-08
Audio Processing - 2025-08
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-08-26 | A Framework for Robust Speaker Verification in Highly Noisy Environments Leveraging Both Noisy and Enhanced Audio | Adam Katav et.al. | 2508.18913 | translate | read | null |
| 2025-08-20 | Improving Resource-Efficient Speech Enhancement via Neural Differentiable DSP Vocoder Refinement | Heitor R. Guimarães et.al. | 2508.14709 | translate | read | null |
| 2025-08-18 | Integrating Feedback Loss from Bi-modal Sarcasm Detector for Sarcastic Speech Synthesis | Zhu Li et.al. | 2508.13028 | translate | read | null |
| 2025-08-15 | EmoSSLSphere: Multilingual Emotional Speech Synthesis with Spherical Vectors and Discrete Speech Tokens | Joonyong Park et.al. | 2508.11273 | translate | read | null |
| 2025-08-12 | Multi-Target Backdoor Attacks Against Speaker Recognition | Alexandrine Fortier et.al. | 2508.08559 | translate | read | null |
(<a href=../Audio_Processing.md>back to Audio Processing</a>)