Multimodal - 2024-09
Multimodal - 2024-09
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-09-30 | Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning | Weitai Kang et.al. | 2410.00255 | translate | read | link |
| 2024-09-30 | Towards Robust Multimodal Sentiment Analysis with Incomplete Data | Haoyu Zhang et.al. | 2409.20012 | translate | read | link |
| 2024-09-26 | Infer Human’s Intentions Before Following Natural Language Instructions | Yanming Wan et.al. | 2409.18073 | translate | read | link |
| 2024-09-26 | A Multimodal Single-Branch Embedding Network for Recommendation in Cold-Start and Missing Modality Scenarios | Christian Ganhör et.al. | 2409.17864 | translate | read | null |
| 2024-09-26 | Harnessing Shared Relations via Multimodal Mixup Contrastive Learning for Multimodal Classification | Raja Kumar et.al. | 2409.17777 | translate | read | null |
| 2024-09-25 | Language Grounded Multi-agent Communication for Ad-hoc Teamwork | Huao Li et.al. | 2409.17348 | translate | read | null |
| 2024-09-24 | CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation | Fuxian Huang et.al. | 2409.15806 | translate | read | null |
| 2024-09-18 | All-in-one foundational models learning across quantum chemical levels | Yuxinxin Chen et.al. | 2409.12015 | translate | read | link |
| 2024-09-13 | Hierarchical Hypercomplex Network for Multimodal Emotion Recognition | Eleonora Lopez et.al. | 2409.09194 | translate | read | link |
| 2024-09-13 | Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing | Minh-Duc Vu et.al. | 2409.08885 | translate | read | null |
| 2024-09-13 | A Multimodal Approach for Fluid Overload Prediction: Integrating Lung Ultrasound and Clinical Data | Tianqi Yang et.al. | 2409.08790 | translate | read | null |
| 2024-09-13 | A Comprehensive Survey on Deep Multimodal Learning with Missing Modality | Renjie Wu et.al. | 2409.07825 | translate | read | null |
| 2024-09-11 | What to align in multimodal contrastive learning? | Benoit Dufumier et.al. | 2409.07402 | translate | read | null |
| 2024-09-11 | Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective | Guimin Hu et.al. | 2409.07388 | translate | read | link |
| 2024-09-11 | Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout | Anbin QI et.al. | 2409.07078 | translate | read | null |
| 2024-09-11 | A Survey of Multimodal Composite Editing and Retrieval | Suyan Li et.al. | 2409.05405 | translate | read | link |
| 2024-09-09 | Diagnostic Reasoning in Natural Language: Computational Model and Application | Nils Dycke et.al. | 2409.05367 | translate | read | null |
| 2024-09-10 | Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment | Zhixian Zhao et.al. | 2409.05015 | translate | read | null |
| 2024-09-03 | Foundation Models for Music: A Survey | Yinghao Ma et.al. | 2408.14340 | translate | read | link |
| 2024-09-06 | Quantum Multimodal Contrastive Learning Framework | Chi-Sheng Chen et.al. | 2408.13919 | translate | read | null |
(<a href=../Multimodal.md>back to Multimodal</a>)