Action Recognition - 2024-05
Action Recognition - 2024-05
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-05-31 | Action-OOD: An End-to-End Skeleton-Based Model for Robust Out-of-Distribution Human Action Detection | Jing Xu et.al. | 2405.20633 | translate | read | link |
| 2024-05-31 | Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning | Yang Chen et.al. | 2405.20606 | translate | read | null |
| 2024-05-30 | ENTIRe-ID: An Extensive and Diverse Dataset for Person Re-Identification | Serdar Yildiz et.al. | 2405.20465 | translate | read | null |
| 2024-05-30 | From Forest to Zoo: Great Ape Behavior Recognition with ChimpBehave | Michael Fuchs et.al. | 2405.20025 | translate | read | null |
| 2024-05-31 | Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition | Masashi Hatano et.al. | 2405.19917 | translate | read | null |
| 2024-05-30 | EgoSurgery-Phase: A Dataset of Surgical Phase Recognition from Egocentric Open Surgery Videos | Ryo Fujii et.al. | 2405.19644 | translate | read | link |
| 2024-05-30 | SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation | Junjie Zhang et.al. | 2405.19586 | translate | read | null |
| 2024-05-29 | Matrix Manifold Neural Networks++ | Xuan Son Nguyen et.al. | 2405.19206 | translate | read | null |
| 2024-05-29 | Exploring AI-based Anonymization of Industrial Image and Video Data in the Context of Feature Preservation | Sabrina Cynthia Triess et.al. | 2405.19173 | translate | read | null |
| 2024-05-28 | Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition | Muhammad Adi Nugroho et.al. | 2405.18012 | translate | read | null |
| 2024-05-30 | Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson’s Disease Severity in Walking Sequences | Vida Adeli et.al. | 2405.17817 | translate | read | link |
| 2024-05-28 | Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions | Rui Zhang et.al. | 2405.17729 | translate | read | null |
| 2024-05-28 | EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions? | Boshen Xu et.al. | 2405.17719 | translate | read | link |
| 2024-05-27 | Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction | Chiara Fumelli et.al. | 2405.17038 | translate | read | null |
| 2024-05-27 | A Cross-Dataset Study for Text-based 3D Human Motion Retrieval | Léore Bensabath et.al. | 2405.16909 | translate | read | null |
| 2024-05-26 | Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception | Shuangpeng Han et.al. | 2405.16493 | translate | read | null |
| 2024-05-25 | Application of Artificial Intelligence in Hand Gesture Recognition with Virtual Reality: Survey and Analysis of Hand Gesture Hardware Selection | Jindi Wang et.al. | 2405.16264 | translate | read | null |
| 2024-05-22 | From CNNs to Transformers in Multimodal Human Action Recognition: A Survey | Muhammad Bilal Shaikh et.al. | 2405.15813 | translate | read | null |
| 2024-05-24 | V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM | Abdur Rahman et.al. | 2405.15341 | translate | read | link |
| 2024-05-23 | Enhanced Spatiotemporal Prediction Using Physical-guided And Frequency-enhanced Recurrent Neural Networks | Xuanle Zhao et.al. | 2405.14504 | translate | read | null |
| 2024-05-23 | SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network | Weiyu Guo et.al. | 2405.14398 | translate | read | null |
| 2024-05-23 | MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | Jiuming Liu et.al. | 2405.14338 | translate | read | null |
| 2024-05-22 | Counterfactual Gradients-based Quantification of Prediction Trust in Neural Networks | Mohit Prabhushankar et.al. | 2405.13758 | translate | read | null |
| 2024-05-21 | Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding | Rong Gao et.al. | 2405.13206 | translate | read | null |
| 2024-05-22 | Building Temporal Kernels with Orthogonal Polynomials | Yan Ru Pei et.al. | 2405.12179 | translate | read | link |
| 2024-05-18 | GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition | Mallika Garg et.al. | 2405.11180 | translate | read | link |
| 2024-05-17 | Air Signing and Privacy-Preserving Signature Verification for Digital Documents | P. Sarveswarasarma et.al. | 2405.10868 | translate | read | null |
| 2024-05-17 | MC-GPT: Empowering Vision-and-Language Navigation with Memory Map and Reasoning Chains | Zhaohuan Zhan et.al. | 2405.10620 | translate | read | null |
| 2024-05-06 | MEET: Mixture of Experts Extra Tree-Based sEMG Hand Gesture Identification | Naveen Gehlot et.al. | 2405.09562 | translate | read | null |
| 2024-05-14 | Wearable Sensor-Based Few-Shot Continual Learning on Hand Gestures for Motor-Impaired Individuals via Latent Embedding Exploitation | Riyad Bin Rafiq et.al. | 2405.08969 | translate | read | link |
| 2024-05-14 | The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks | Carmela Calabrese et.al. | 2405.08695 | translate | read | null |
| 2024-05-15 | POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning | Chang Huang et.al. | 2405.08036 | translate | read | null |
| 2024-05-13 | Coarse or Fine? Recognising Action End States without Labels | Davide Moltisanti et.al. | 2405.07723 | translate | read | link |
| 2024-05-11 | PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition | Shenglin He et.al. | 2405.06929 | translate | read | null |
| 2024-05-10 | CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras | James Tang et.al. | 2405.06845 | translate | read | link |
| 2024-05-09 | A Survey on Backbones for Deep Video Action Recognition | Zixuan Tang et.al. | 2405.05584 | translate | read | null |
| 2024-05-06 | OmniActions: Predicting Digital Actions in Response to Real-World Multimodal Sensory Inputs with LLMs | Jiahao Nick Li et.al. | 2405.03901 | translate | read | null |
| 2024-05-05 | JOSENet: A Joint Stream Embedding Network for Violence Detection in Surveillance Videos | Pietro Nardelli et.al. | 2405.02961 | translate | read | null |
| 2024-05-03 | On the Utility of External Agent Intention Predictor for Human-AI Coordination | Chenxu Wang et.al. | 2405.02229 | translate | read | null |
| 2024-05-11 | MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition | Hongyu Qu et.al. | 2405.02077 | translate | read | null |
| 2024-05-03 | Enhancing Micro Gesture Recognition for Emotion Understanding via Context-aware Visual-Text Contrastive Learning | Deng Li et.al. | 2405.01885 | translate | read | link |
| 2024-05-02 | Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy | Hoang-Quan Nguyen et.al. | 2405.01337 | translate | read | null |
| 2024-05-07 | Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration | Praveen Kumar Chandaliya et.al. | 2405.01273 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)