Action Recognition - 2026-03 | Paper Arxiv Daily

Action Recognition - 2026-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2026-03-31	SkeletonContext: Skeleton-side Context Prompt Learning for Zero-Shot Skeleton-based Action Recognition	Ning Wang et.al.	2603.29692	translate	read	null
2026-03-25	B-MoE: A Body-Part-Aware Mixture-of-Experts “All Parts Matter” Approach to Micro-Action Recognition	Nishit Poddar et.al.	2603.24245	translate	read	null
2026-03-24	VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs	Haoran Yuan et.al.	2603.23481	translate	read	null
2026-03-23	mmFHE: mmWave Sensing with End-to-End Fully Homomorphic Encryption	Tanvir Ahmed et.al.	2603.22437	translate	read	null
2026-03-23	WiRD-Gest: Gesture Recognition In The Real World Using Range-Doppler Wi-Fi Sensing on COTS Hardware	Jessica Sanson et.al.	2603.22131	translate	read	null
2026-03-22	Privacy-Preserving Federated Action Recognition via Differentially Private Selective Tuning and Efficient Communication	Idris Zakariyya et.al.	2603.21305	translate	read	null
2026-03-22	A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors	Gia-Bao Doan et.al.	2603.21048	translate	read	null
2026-03-20	Learning Dynamic Belief Graphs for Theory-of-mind Reasoning	Ruxiao Chen et.al.	2603.20170	translate	read	null
2026-03-20	Subspace Kernel Learning on Tensor Sequences	Lei Wang et.al.	2603.19546	translate	read	null
2026-03-19	SurfaceXR: Fusing Smartwatch IMUs and Egocentric Hand Pose for Seamless Surface Interactions	Vasco Xu et.al.	2603.19529	translate	read	null
2026-03-18	S3T-Former: A Purely Spike-Driven State-Space Topology Transformer for Skeleton Action Recognition	Naichuan Zheng et.al.	2603.18062	translate	read	null
2026-03-18	Unified Spatio-Temporal Token Scoring for Efficient Video VLMs	Jianrui Zhang et.al.	2603.18004	translate	read	null
2026-03-18	GigaWorld-Policy: An Efficient Action-Centered World–Action Model	Angen Ye et.al.	2603.17240	translate	read	null
2026-03-16	KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition	Yuhan Chen et.al.	2603.16943	translate	read	null
2026-03-17	FG-SGL: Fine-Grained Semantic Guidance Learning via Motion Process Decomposition for Micro-Gesture Recognition	Jinsheng Wei et.al.	2603.16269	translate	read	null
2026-03-17	S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight	Haodong Yan et.al.	2603.16195	translate	read	null
2026-03-16	Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models	Yulin Luo et.al.	2603.15618	translate	read	null
2026-03-15	Wi-Spike: A Low-power WiFi Human Multi-action Recognition Model with Spiking Neural Networks	Nengbo Zhang et.al.	2603.14475	translate	read	null
2026-03-15	eNavi: Event-based Imitation Policies for Low-Light Indoor Mobile Robot Navigation	Prithvi Jai Ramesh et.al.	2603.14397	translate	read	null
2026-03-12	UniMotion: Self-Supervised Learning for Cross-Domain IMU Motion Recognition	Prerna Khanna et.al.	2603.12218	translate	read	null
2026-03-11	DiT4DiT: Jointly Modeling Video Dynamics and Actions for Generalizable Robot Control	Teli Ma et.al.	2603.10448	translate	read	null
2026-03-09	Decision-Aware Uncertainty Evaluation of Vision-Language Model-Based Early Action Anticipation for Human-Robot Interaction	Zhaoda Du et.al.	2603.10061	translate	read	null
2026-03-10	M3GCLR: Multi-View Mini-Max Infinite Skeleton-Data Game Contrastive Learning For Skeleton-Based Action Recognition	Yanshan Li et.al.	2603.09367	translate	read	null
2026-03-09	mmGAT: Pose Estimation by Graph Attention with Mutual Features from mmWave Radar Point Cloud	Abdullah Al Masud et.al.	2603.08551	translate	read	null
2026-03-09	Human-AI Divergence in Ego-centric Action Recognition under Spatial and Spatiotemporal Manipulations	Sadegh Rahmaniboldaji et.al.	2603.08317	translate	read	null
2026-03-09	Novel Semantic Prompting for Zero-Shot Action Recognition	Salman Iqbal et.al.	2603.08289	translate	read	null
2026-03-09	DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining	Shangeth Rajaa et.al.	2603.08216	translate	read	null
2026-03-08	Active Inference for Micro-Gesture Recognition: EFE-Guided Temporal Sampling and Adaptive Learning	Weijia Feng et.al.	2603.07559	translate	read	null
2026-03-08	ICLR: In-Context Imitation Learning with Visual Reasoning	Toan Nguyen et.al.	2603.07530	translate	read	null
2026-03-07	Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction	Michael Hauri et.al.	2603.07083	translate	read	null
2026-03-06	Skeleton-to-Image Encoding: Enabling Skeleton Representation Learning via Vision-Pretrained Models	Siyuan Yang et.al.	2603.05963	translate	read	null
2026-03-06	Learning Next Action Predictors from Human-Computer Interaction	Omar Shaikh et.al.	2603.05923	translate	read	null
2026-03-04	A Baseline Study and Benchmark for Few-Shot Open-Set Action Recognition with Feature Residual Discrimination	Stefano Berti et.al.	2603.04125	translate	read	null
2026-03-03	Chain of World: World Model Thinking in Latent Motion	Fuxiang Yang et.al.	2603.03195	translate	read	null
2026-03-02	Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models	Haoyun Liu et.al.	2603.01766	translate	read	null
2026-03-02	ATA: Bridging Implicit Reasoning with Attention-Guided and Action-Guided Inference for Vision-Language Action Models	Cheng Yang et.al.	2603.01490	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)