Action Recognition - 2025-09

Publish Date Title Authors PDF Translate Read Code
2025-09-27 $\texttt{BluePrint}$ : A Social Media User Dataset for LLM Persona Evaluation and Training Aurélien Bück-Kaeffer et.al. 2510.02343 translate read null
2025-09-30 Towards Intuitive Human-Robot Interaction through Embodied Gesture-Driven Control with Woven Tactile Skins ChunPing Lam et.al. 2509.25951 translate read null
2025-09-22 Six Sigma For Neural Networks: Taguchi-based optimization Sai Varun Kodathala et.al. 2509.25213 translate read null
2025-09-29 Fast Real-Time Pipeline for Robust Arm Gesture Recognition Milán Zsolt Bagladi et.al. 2509.25042 translate read null
2025-09-28 AssemblyHands-X: Modeling 3D Hand-Body Coordination for Understanding Bimanual Human Activities Tatsuro Banno et.al. 2509.23888 translate read null
2025-09-27 New Synthetic Goldmine: Hand Joint Angle-Driven EMG Data Generation Framework for Micro-Gesture Recognition Nana Wang et.al. 2509.23359 translate read null
2025-09-27 Spatiotemporal Radar Gesture Recognition with Hybrid Spiking Neural Networks: Balancing Accuracy and Efficiency Riccardo Mazzieri et.al. 2509.23303 translate read null
2025-09-27 MMeViT: Multi-Modal ensemble ViT for Post-Stroke Rehabilitation Action Recognition Ye-eun Kim et.al. 2509.23044 translate read null
2025-09-27 Disentangling Static and Dynamic Information for Reducing Static Bias in Action Recognition Masato Kobayashi et.al. 2509.23009 translate read null
2025-09-26 See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation Chih Yao Hu et.al. 2509.22653 translate read null
2025-09-26 Prompt-guided Disentangled Representation for Action Recognition Tianci Wu et.al. 2509.21783 translate read null
2025-09-25 SlotFM: A Motion Foundation Model with Slot Attention for Diverse Downstream Tasks Junyong Park et.al. 2509.21673 translate read null
2025-09-25 Temporal vs. Spatial: Comparing DINOv3 and V-JEPA2 Feature Representations for Video Action Analysis Sai Varun Kodathala et.al. 2509.21595 translate read null
2025-09-25 EMG-UP: Unsupervised Personalization in Cross-User EMG Gesture Recognition Nana Wang et.al. 2509.21589 translate read null
2025-09-24 mmHSense: Multi-Modal and Distributed mmWave ISAC Datasets for Human Sensing Nabeel Nisar Bhat et.al. 2509.21396 translate read null
2025-09-25 Every Subtlety Counts: Fine-grained Person Independence Micro-Action Recognition via Distributionally Robust Optimization Feng-Qi Cui et.al. 2509.21261 translate read null
2025-09-25 Autoregressive End-to-End Planning with Time-Invariant Spatial Alignment and Multi-Objective Policy Refinement Jianbo Zhao et.al. 2509.20938 translate read null
2025-09-25 GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series Sarah Seifi et.al. 2509.20936 translate read null
2025-09-25 Causal Time Series Generation via Diffusion Models Yutong Xia et.al. 2509.20846 translate read null
2025-09-23 A Bimanual Gesture Interface for ROS-Based Mobile Manipulators Using TinyML and Sensor Fusion Najeeb Ahmed Bhuiyan et.al. 2509.19521 translate read null
2025-09-23 FERA: Foil Fencing Referee Assistant Using Pose-Based Multi-Label Move Recognition and Rule Reasoning Ziwen Chen et.al. 2509.18527 translate read null
2025-09-22 MoCrop: Training Free Motion Guided Cropping for Efficient Video Action Recognition Binhua Huang et.al. 2509.18473 translate read null
2025-09-22 Orcust: Stepwise-Feedback Reinforcement Learning for GUI Agent Junyu Lu et.al. 2509.17917 translate read null
2025-09-22 Trainee Action Recognition through Interaction Analysis in CCATT Mixed-Reality Training Divya Mereddy et.al. 2509.17888 translate read null
2025-09-22 A $^2$M$^2$ -Net: Adaptively Aligned Multi-Scale Moment for Few-Shot Action Recognition Zilin Gao et.al. 2509.17638 translate read null
2025-09-22 UIPro: Unleashing Superior Interaction Capability For GUI Agents Hongxin Li et.al. 2509.17328 translate read null
2025-09-21 Imagine2Act: Leveraging Object-Action Motion Consistency from Imagined Goals for Robotic Manipulation Liang Heng et.al. 2509.17125 translate read null
2025-09-21 MoCLIP-Lite: Efficient Video Recognition by Fusing CLIP with Motion Vectors Binhua Huang et.al. 2509.17084 translate read null
2025-09-20 Automated Procedural Analysis via Video-Language Models for AI-assisted Nursing Skills Assessment Shen Chang et.al. 2509.16810 translate read null
2025-09-19 KRAST: Knowledge-Augmented Robotic Action Recognition with Structured Text for Vision-Language Models Son Hai Nguyen et.al. 2509.16452 translate read null
2025-09-18 RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation Yuming Jiang et.al. 2509.15212 translate read null
2025-09-18 Doppler Radiance Field-Guided Antenna Selection for Improved Generalization in Multi-Antenna Wi-Fi-based Human Activity Recognition Navid Hasanzadeh et.al. 2509.15129 translate read null
2025-09-18 LSTC-MDA: A Unified Framework for Long-Short Term Temporal Convolution and Mixed Data Augmentation in Skeleton-Based Action Recognition Feng Ding et.al. 2509.14619 translate read null
2025-09-18 ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference Kihoon Son et.al. 2509.14537 translate read null
2025-09-15 Domain-Adaptive Pretraining Improves Primate Behavior Recognition Felix B. Mueller et.al. 2509.12193 translate read null
2025-09-15 Open-ended Hierarchical Streaming Video Understanding with Vision Language Models Hyolim Kang et.al. 2509.12145 translate read null
2025-09-15 Gesture-Based Robot Control Integrating Mm-wave Radar and Behavior Trees Yuqing Song et.al. 2509.12008 translate read null
2025-09-15 Learning Representations in Video Game Agents with Supervised Contrastive Imitation Learning Carlos Celemin et.al. 2509.11880 translate read null
2025-09-11 Improvement of Human-Object Interaction Action Recognition Using Scene Information and Multi-Task Learning Approach Hesham M. Shehata et.al. 2509.09067 translate read null
2025-09-10 A Contextual Bandits Approach for Personalization of Hand Gesture Recognition Duke Lin et.al. 2509.08915 translate read null
2025-09-10 Diffusion-Based Action Recognition Generalizes to Untrained Domains Rogerio Guimaraes et.al. 2509.08908 translate read null
2025-09-10 Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening Piyush Bagad et.al. 2509.08502 translate read null
2025-09-10 LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations Payal Varshney et.al. 2509.08422 translate read null
2025-09-09 EHWGesture – A dataset for multimodal understanding of clinical gestures Gianluca Amprimo et.al. 2509.07525 translate read null
2025-09-09 G3CN: Gaussian Topology Refinement Gated Graph Convolutional Network for Skeleton-Based Action Recognition Haiqing Ren et.al. 2509.07335 translate read null
2025-09-08 Video-based Generalized Category Discovery via Memory-Guided Consistency-Aware Contrastive Learning Zhang Jing et.al. 2509.06306 translate read null
2025-09-06 Leveraging Vision-Language Large Models for Interpretable Video Action Recognition with Semantic Tokenization Jingwei Peng et.al. 2509.05695 translate read null
2025-09-05 DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation Haitao Tian et.al. 2509.05543 translate read null
2025-09-03 Towards Efficient General Feature Prediction in Masked Skeleton Modeling Shengkai Sun et.al. 2509.03609 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)