Multimodal - 2024-07

Publish Date Title Authors PDF Translate Read Code
2024-07-31 Tracing Intricate Cues in Dialogue: Joint Graph Structure and Sentiment Dynamics for Multimodal Emotion Recognition Jiang Li et.al. 2407.21536 translate read null
2024-07-31 DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations Dongwon Son et.al. 2407.21267 translate read null
2024-07-30 HyperMM : Robust Multimodal Learning with Varying-sized Inputs Hava Chaptoukaev et.al. 2407.20768 translate read null
2024-07-29 ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2 Wenjun Huang et.al. 2407.19832 translate read null
2024-07-28 Detached and Interactive Multimodal Learning Yunfeng Fan et.al. 2407.19514 translate read link
2024-07-26 Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment Yuze Zheng et.al. 2407.18854 translate read null
2024-07-26 Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention Joe Dhanith P R et.al. 2407.18552 translate read null
2024-07-25 $\mathbb{X}$ -Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs Vlad Sobal et.al. 2407.18134 translate read null
2024-07-25 Cross-Vendor Reproducibility of Radiomics-based Machine Learning Models for Computer-aided Diagnosis Jatin Chaudhary et.al. 2407.18060 translate read null
2024-07-23 Masked Graph Learning with Recurrent Alignment for Multimodal Emotion Recognition in Conversation Tao Meng et.al. 2407.16714 translate read null
2024-07-24 MicroEmo: Time-Sensitive Multimodal Emotion Recognition with Micro-Expression Dynamics in Video Dialogues Liyun Zhang et.al. 2407.16552 translate read null
2024-07-23 Chameleon: Images Are What You Need For Multimodal Learning Robust To Missing Modalities Muhammad Irzam Liaqat et.al. 2407.16243 translate read null
2024-07-22 Resource-Efficient Federated Multimodal Learning via Layer-wise and Progressive Training Ye Lin Tun et.al. 2407.15426 translate read null
2024-07-17 Text- and Feature-based Models for Compound Multimodal Emotion Recognition in the Wild Nicolas Richet et.al. 2407.12927 translate read link
2024-07-17 Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models Donggeun Kim et.al. 2407.12616 translate read null
2024-07-12 Diagnosing and Re-learning for Balanced Multimodal Learning Yake Wei et.al. 2407.09705 translate read link
2024-07-12 Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework Haoqin Sun et.al. 2407.09029 translate read null
2024-07-10 AffectGPT: Dataset and Framework for Explainable Multimodal Emotion Recognition Zheng Lian et.al. 2407.07653 translate read link
2024-07-06 Completed Feature Disentanglement Learning for Multimodal MRIs Analysis Tianling Liu et.al. 2407.04916 translate read null
2024-07-05 Multimodal Classification via Modal-Aware Interactive Enhancement Qing-Yuan Jiang et.al. 2407.04587 translate read null
2024-07-05 Robust Multimodal Learning via Representation Decoupling Shicai Wei et.al. 2407.04458 translate read null
2024-07-05 Smart Vision-Language Reasoners Denisa Roberts et.al. 2407.04212 translate read link
2024-07-04 ADAPT: Multimodal Learning for Detecting Physiological Changes under Missing Modalities Julie Mordacq et.al. 2407.03836 translate read link
2024-07-02 Multi-Peptide: Multimodality Leveraged Language-Graph Learning of Peptide Properties Srivathsan Badrinarayanan et.al. 2407.03380 translate read link
2024-07-05 Multi-Task Domain Adaptation for Language Grounding with 3D Objects Penglei Sun et.al. 2407.02846 translate read null
2024-07-01 Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation Sirui Xia et.al. 2407.01796 translate read null
2024-07-01 Multimodal Learning With Intraoperative CBCT & Variably Aligned Preoperative CT Data To Improve Segmentation Maximilian E. Tschuchnig et.al. 2406.11650 translate read null

(<a href=../Multimodal.md>back to Multimodal</a>)