Action Recognition - 2026-03
Action Recognition - 2026-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-03-31 | SkeletonContext: Skeleton-side Context Prompt Learning for Zero-Shot Skeleton-based Action Recognition | Ning Wang et.al. | 2603.29692 | translate | read | null |
| 2026-03-25 | B-MoE: A Body-Part-Aware Mixture-of-Experts “All Parts Matter” Approach to Micro-Action Recognition | Nishit Poddar et.al. | 2603.24245 | translate | read | null |
| 2026-03-24 | VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs | Haoran Yuan et.al. | 2603.23481 | translate | read | null |
| 2026-03-23 | mmFHE: mmWave Sensing with End-to-End Fully Homomorphic Encryption | Tanvir Ahmed et.al. | 2603.22437 | translate | read | null |
| 2026-03-23 | WiRD-Gest: Gesture Recognition In The Real World Using Range-Doppler Wi-Fi Sensing on COTS Hardware | Jessica Sanson et.al. | 2603.22131 | translate | read | null |
| 2026-03-22 | Privacy-Preserving Federated Action Recognition via Differentially Private Selective Tuning and Efficient Communication | Idris Zakariyya et.al. | 2603.21305 | translate | read | null |
| 2026-03-22 | A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors | Gia-Bao Doan et.al. | 2603.21048 | translate | read | null |
| 2026-03-20 | Learning Dynamic Belief Graphs for Theory-of-mind Reasoning | Ruxiao Chen et.al. | 2603.20170 | translate | read | null |
| 2026-03-20 | Subspace Kernel Learning on Tensor Sequences | Lei Wang et.al. | 2603.19546 | translate | read | null |
| 2026-03-19 | SurfaceXR: Fusing Smartwatch IMUs and Egocentric Hand Pose for Seamless Surface Interactions | Vasco Xu et.al. | 2603.19529 | translate | read | null |
| 2026-03-18 | S3T-Former: A Purely Spike-Driven State-Space Topology Transformer for Skeleton Action Recognition | Naichuan Zheng et.al. | 2603.18062 | translate | read | null |
| 2026-03-18 | Unified Spatio-Temporal Token Scoring for Efficient Video VLMs | Jianrui Zhang et.al. | 2603.18004 | translate | read | null |
| 2026-03-18 | GigaWorld-Policy: An Efficient Action-Centered World–Action Model | Angen Ye et.al. | 2603.17240 | translate | read | null |
| 2026-03-16 | KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition | Yuhan Chen et.al. | 2603.16943 | translate | read | null |
| 2026-03-17 | FG-SGL: Fine-Grained Semantic Guidance Learning via Motion Process Decomposition for Micro-Gesture Recognition | Jinsheng Wei et.al. | 2603.16269 | translate | read | null |
| 2026-03-17 | S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight | Haodong Yan et.al. | 2603.16195 | translate | read | null |
| 2026-03-16 | Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models | Yulin Luo et.al. | 2603.15618 | translate | read | null |
| 2026-03-15 | Wi-Spike: A Low-power WiFi Human Multi-action Recognition Model with Spiking Neural Networks | Nengbo Zhang et.al. | 2603.14475 | translate | read | null |
| 2026-03-15 | eNavi: Event-based Imitation Policies for Low-Light Indoor Mobile Robot Navigation | Prithvi Jai Ramesh et.al. | 2603.14397 | translate | read | null |
| 2026-03-12 | UniMotion: Self-Supervised Learning for Cross-Domain IMU Motion Recognition | Prerna Khanna et.al. | 2603.12218 | translate | read | null |
| 2026-03-11 | DiT4DiT: Jointly Modeling Video Dynamics and Actions for Generalizable Robot Control | Teli Ma et.al. | 2603.10448 | translate | read | null |
| 2026-03-09 | Decision-Aware Uncertainty Evaluation of Vision-Language Model-Based Early Action Anticipation for Human-Robot Interaction | Zhaoda Du et.al. | 2603.10061 | translate | read | null |
| 2026-03-10 | M3GCLR: Multi-View Mini-Max Infinite Skeleton-Data Game Contrastive Learning For Skeleton-Based Action Recognition | Yanshan Li et.al. | 2603.09367 | translate | read | null |
| 2026-03-09 | mmGAT: Pose Estimation by Graph Attention with Mutual Features from mmWave Radar Point Cloud | Abdullah Al Masud et.al. | 2603.08551 | translate | read | null |
| 2026-03-09 | Human-AI Divergence in Ego-centric Action Recognition under Spatial and Spatiotemporal Manipulations | Sadegh Rahmaniboldaji et.al. | 2603.08317 | translate | read | null |
| 2026-03-09 | Novel Semantic Prompting for Zero-Shot Action Recognition | Salman Iqbal et.al. | 2603.08289 | translate | read | null |
| 2026-03-09 | DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining | Shangeth Rajaa et.al. | 2603.08216 | translate | read | null |
| 2026-03-08 | Active Inference for Micro-Gesture Recognition: EFE-Guided Temporal Sampling and Adaptive Learning | Weijia Feng et.al. | 2603.07559 | translate | read | null |
| 2026-03-08 | ICLR: In-Context Imitation Learning with Visual Reasoning | Toan Nguyen et.al. | 2603.07530 | translate | read | null |
| 2026-03-07 | Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction | Michael Hauri et.al. | 2603.07083 | translate | read | null |
| 2026-03-06 | Skeleton-to-Image Encoding: Enabling Skeleton Representation Learning via Vision-Pretrained Models | Siyuan Yang et.al. | 2603.05963 | translate | read | null |
| 2026-03-06 | Learning Next Action Predictors from Human-Computer Interaction | Omar Shaikh et.al. | 2603.05923 | translate | read | null |
| 2026-03-04 | A Baseline Study and Benchmark for Few-Shot Open-Set Action Recognition with Feature Residual Discrimination | Stefano Berti et.al. | 2603.04125 | translate | read | null |
| 2026-03-03 | Chain of World: World Model Thinking in Latent Motion | Fuxiang Yang et.al. | 2603.03195 | translate | read | null |
| 2026-03-02 | Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models | Haoyun Liu et.al. | 2603.01766 | translate | read | null |
| 2026-03-02 | ATA: Bridging Implicit Reasoning with Attention-Guided and Action-Guided Inference for Vision-Language Action Models | Cheng Yang et.al. | 2603.01490 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)