Action Recognition - 2024-05

Publish Date Title Authors PDF Translate Read Code
2024-05-31 Action-OOD: An End-to-End Skeleton-Based Model for Robust Out-of-Distribution Human Action Detection Jing Xu et.al. 2405.20633 translate read link
2024-05-31 Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning Yang Chen et.al. 2405.20606 translate read null
2024-05-30 ENTIRe-ID: An Extensive and Diverse Dataset for Person Re-Identification Serdar Yildiz et.al. 2405.20465 translate read null
2024-05-30 From Forest to Zoo: Great Ape Behavior Recognition with ChimpBehave Michael Fuchs et.al. 2405.20025 translate read null
2024-05-31 Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition Masashi Hatano et.al. 2405.19917 translate read null
2024-05-30 EgoSurgery-Phase: A Dataset of Surgical Phase Recognition from Egocentric Open Surgery Videos Ryo Fujii et.al. 2405.19644 translate read link
2024-05-30 SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation Junjie Zhang et.al. 2405.19586 translate read null
2024-05-29 Matrix Manifold Neural Networks++ Xuan Son Nguyen et.al. 2405.19206 translate read null
2024-05-29 Exploring AI-based Anonymization of Industrial Image and Video Data in the Context of Feature Preservation Sabrina Cynthia Triess et.al. 2405.19173 translate read null
2024-05-28 Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition Muhammad Adi Nugroho et.al. 2405.18012 translate read null
2024-05-30 Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson’s Disease Severity in Walking Sequences Vida Adeli et.al. 2405.17817 translate read link
2024-05-28 Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions Rui Zhang et.al. 2405.17729 translate read null
2024-05-28 EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions? Boshen Xu et.al. 2405.17719 translate read link
2024-05-27 Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction Chiara Fumelli et.al. 2405.17038 translate read null
2024-05-27 A Cross-Dataset Study for Text-based 3D Human Motion Retrieval Léore Bensabath et.al. 2405.16909 translate read null
2024-05-26 Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception Shuangpeng Han et.al. 2405.16493 translate read null
2024-05-25 Application of Artificial Intelligence in Hand Gesture Recognition with Virtual Reality: Survey and Analysis of Hand Gesture Hardware Selection Jindi Wang et.al. 2405.16264 translate read null
2024-05-22 From CNNs to Transformers in Multimodal Human Action Recognition: A Survey Muhammad Bilal Shaikh et.al. 2405.15813 translate read null
2024-05-24 V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM Abdur Rahman et.al. 2405.15341 translate read link
2024-05-23 Enhanced Spatiotemporal Prediction Using Physical-guided And Frequency-enhanced Recurrent Neural Networks Xuanle Zhao et.al. 2405.14504 translate read null
2024-05-23 SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network Weiyu Guo et.al. 2405.14398 translate read null
2024-05-23 MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models Jiuming Liu et.al. 2405.14338 translate read null
2024-05-22 Counterfactual Gradients-based Quantification of Prediction Trust in Neural Networks Mohit Prabhushankar et.al. 2405.13758 translate read null
2024-05-21 Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding Rong Gao et.al. 2405.13206 translate read null
2024-05-22 Building Temporal Kernels with Orthogonal Polynomials Yan Ru Pei et.al. 2405.12179 translate read link
2024-05-18 GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition Mallika Garg et.al. 2405.11180 translate read link
2024-05-17 Air Signing and Privacy-Preserving Signature Verification for Digital Documents P. Sarveswarasarma et.al. 2405.10868 translate read null
2024-05-17 MC-GPT: Empowering Vision-and-Language Navigation with Memory Map and Reasoning Chains Zhaohuan Zhan et.al. 2405.10620 translate read null
2024-05-06 MEET: Mixture of Experts Extra Tree-Based sEMG Hand Gesture Identification Naveen Gehlot et.al. 2405.09562 translate read null
2024-05-14 Wearable Sensor-Based Few-Shot Continual Learning on Hand Gestures for Motor-Impaired Individuals via Latent Embedding Exploitation Riyad Bin Rafiq et.al. 2405.08969 translate read link
2024-05-14 The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks Carmela Calabrese et.al. 2405.08695 translate read null
2024-05-15 POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning Chang Huang et.al. 2405.08036 translate read null
2024-05-13 Coarse or Fine? Recognising Action End States without Labels Davide Moltisanti et.al. 2405.07723 translate read link
2024-05-11 PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition Shenglin He et.al. 2405.06929 translate read null
2024-05-10 CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras James Tang et.al. 2405.06845 translate read link
2024-05-09 A Survey on Backbones for Deep Video Action Recognition Zixuan Tang et.al. 2405.05584 translate read null
2024-05-06 OmniActions: Predicting Digital Actions in Response to Real-World Multimodal Sensory Inputs with LLMs Jiahao Nick Li et.al. 2405.03901 translate read null
2024-05-05 JOSENet: A Joint Stream Embedding Network for Violence Detection in Surveillance Videos Pietro Nardelli et.al. 2405.02961 translate read null
2024-05-03 On the Utility of External Agent Intention Predictor for Human-AI Coordination Chenxu Wang et.al. 2405.02229 translate read null
2024-05-11 MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition Hongyu Qu et.al. 2405.02077 translate read null
2024-05-03 Enhancing Micro Gesture Recognition for Emotion Understanding via Context-aware Visual-Text Contrastive Learning Deng Li et.al. 2405.01885 translate read link
2024-05-02 Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy Hoang-Quan Nguyen et.al. 2405.01337 translate read null
2024-05-07 Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration Praveen Kumar Chandaliya et.al. 2405.01273 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)