Action Recognition - 2026-03

Publish Date Title Authors PDF Translate Read Code
2026-03-31 SkeletonContext: Skeleton-side Context Prompt Learning for Zero-Shot Skeleton-based Action Recognition Ning Wang et.al. 2603.29692 translate read null
2026-03-25 B-MoE: A Body-Part-Aware Mixture-of-Experts “All Parts Matter” Approach to Micro-Action Recognition Nishit Poddar et.al. 2603.24245 translate read null
2026-03-24 VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs Haoran Yuan et.al. 2603.23481 translate read null
2026-03-23 mmFHE: mmWave Sensing with End-to-End Fully Homomorphic Encryption Tanvir Ahmed et.al. 2603.22437 translate read null
2026-03-23 WiRD-Gest: Gesture Recognition In The Real World Using Range-Doppler Wi-Fi Sensing on COTS Hardware Jessica Sanson et.al. 2603.22131 translate read null
2026-03-22 Privacy-Preserving Federated Action Recognition via Differentially Private Selective Tuning and Efficient Communication Idris Zakariyya et.al. 2603.21305 translate read null
2026-03-22 A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors Gia-Bao Doan et.al. 2603.21048 translate read null
2026-03-20 Learning Dynamic Belief Graphs for Theory-of-mind Reasoning Ruxiao Chen et.al. 2603.20170 translate read null
2026-03-20 Subspace Kernel Learning on Tensor Sequences Lei Wang et.al. 2603.19546 translate read null
2026-03-19 SurfaceXR: Fusing Smartwatch IMUs and Egocentric Hand Pose for Seamless Surface Interactions Vasco Xu et.al. 2603.19529 translate read null
2026-03-18 S3T-Former: A Purely Spike-Driven State-Space Topology Transformer for Skeleton Action Recognition Naichuan Zheng et.al. 2603.18062 translate read null
2026-03-18 Unified Spatio-Temporal Token Scoring for Efficient Video VLMs Jianrui Zhang et.al. 2603.18004 translate read null
2026-03-18 GigaWorld-Policy: An Efficient Action-Centered World–Action Model Angen Ye et.al. 2603.17240 translate read null
2026-03-16 KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition Yuhan Chen et.al. 2603.16943 translate read null
2026-03-17 FG-SGL: Fine-Grained Semantic Guidance Learning via Motion Process Decomposition for Micro-Gesture Recognition Jinsheng Wei et.al. 2603.16269 translate read null
2026-03-17 S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight Haodong Yan et.al. 2603.16195 translate read null
2026-03-16 Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models Yulin Luo et.al. 2603.15618 translate read null
2026-03-15 Wi-Spike: A Low-power WiFi Human Multi-action Recognition Model with Spiking Neural Networks Nengbo Zhang et.al. 2603.14475 translate read null
2026-03-15 eNavi: Event-based Imitation Policies for Low-Light Indoor Mobile Robot Navigation Prithvi Jai Ramesh et.al. 2603.14397 translate read null
2026-03-12 UniMotion: Self-Supervised Learning for Cross-Domain IMU Motion Recognition Prerna Khanna et.al. 2603.12218 translate read null
2026-03-11 DiT4DiT: Jointly Modeling Video Dynamics and Actions for Generalizable Robot Control Teli Ma et.al. 2603.10448 translate read null
2026-03-09 Decision-Aware Uncertainty Evaluation of Vision-Language Model-Based Early Action Anticipation for Human-Robot Interaction Zhaoda Du et.al. 2603.10061 translate read null
2026-03-10 M3GCLR: Multi-View Mini-Max Infinite Skeleton-Data Game Contrastive Learning For Skeleton-Based Action Recognition Yanshan Li et.al. 2603.09367 translate read null
2026-03-09 mmGAT: Pose Estimation by Graph Attention with Mutual Features from mmWave Radar Point Cloud Abdullah Al Masud et.al. 2603.08551 translate read null
2026-03-09 Human-AI Divergence in Ego-centric Action Recognition under Spatial and Spatiotemporal Manipulations Sadegh Rahmaniboldaji et.al. 2603.08317 translate read null
2026-03-09 Novel Semantic Prompting for Zero-Shot Action Recognition Salman Iqbal et.al. 2603.08289 translate read null
2026-03-09 DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining Shangeth Rajaa et.al. 2603.08216 translate read null
2026-03-08 Active Inference for Micro-Gesture Recognition: EFE-Guided Temporal Sampling and Adaptive Learning Weijia Feng et.al. 2603.07559 translate read null
2026-03-08 ICLR: In-Context Imitation Learning with Visual Reasoning Toan Nguyen et.al. 2603.07530 translate read null
2026-03-07 Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction Michael Hauri et.al. 2603.07083 translate read null
2026-03-06 Skeleton-to-Image Encoding: Enabling Skeleton Representation Learning via Vision-Pretrained Models Siyuan Yang et.al. 2603.05963 translate read null
2026-03-06 Learning Next Action Predictors from Human-Computer Interaction Omar Shaikh et.al. 2603.05923 translate read null
2026-03-04 A Baseline Study and Benchmark for Few-Shot Open-Set Action Recognition with Feature Residual Discrimination Stefano Berti et.al. 2603.04125 translate read null
2026-03-03 Chain of World: World Model Thinking in Latent Motion Fuxiang Yang et.al. 2603.03195 translate read null
2026-03-02 Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models Haoyun Liu et.al. 2603.01766 translate read null
2026-03-02 ATA: Bridging Implicit Reasoning with Attention-Guided and Action-Guided Inference for Vision-Language Action Models Cheng Yang et.al. 2603.01490 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)