Action Recognition - 2025-09
Action Recognition - 2025-09
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-09-27 | $\texttt{BluePrint}$ : A Social Media User Dataset for LLM Persona Evaluation and Training | Aurélien Bück-Kaeffer et.al. | 2510.02343 | translate | read | null |
| 2025-09-30 | Towards Intuitive Human-Robot Interaction through Embodied Gesture-Driven Control with Woven Tactile Skins | ChunPing Lam et.al. | 2509.25951 | translate | read | null |
| 2025-09-22 | Six Sigma For Neural Networks: Taguchi-based optimization | Sai Varun Kodathala et.al. | 2509.25213 | translate | read | null |
| 2025-09-29 | Fast Real-Time Pipeline for Robust Arm Gesture Recognition | Milán Zsolt Bagladi et.al. | 2509.25042 | translate | read | null |
| 2025-09-28 | AssemblyHands-X: Modeling 3D Hand-Body Coordination for Understanding Bimanual Human Activities | Tatsuro Banno et.al. | 2509.23888 | translate | read | null |
| 2025-09-27 | New Synthetic Goldmine: Hand Joint Angle-Driven EMG Data Generation Framework for Micro-Gesture Recognition | Nana Wang et.al. | 2509.23359 | translate | read | null |
| 2025-09-27 | Spatiotemporal Radar Gesture Recognition with Hybrid Spiking Neural Networks: Balancing Accuracy and Efficiency | Riccardo Mazzieri et.al. | 2509.23303 | translate | read | null |
| 2025-09-27 | MMeViT: Multi-Modal ensemble ViT for Post-Stroke Rehabilitation Action Recognition | Ye-eun Kim et.al. | 2509.23044 | translate | read | null |
| 2025-09-27 | Disentangling Static and Dynamic Information for Reducing Static Bias in Action Recognition | Masato Kobayashi et.al. | 2509.23009 | translate | read | null |
| 2025-09-26 | See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation | Chih Yao Hu et.al. | 2509.22653 | translate | read | null |
| 2025-09-26 | Prompt-guided Disentangled Representation for Action Recognition | Tianci Wu et.al. | 2509.21783 | translate | read | null |
| 2025-09-25 | SlotFM: A Motion Foundation Model with Slot Attention for Diverse Downstream Tasks | Junyong Park et.al. | 2509.21673 | translate | read | null |
| 2025-09-25 | Temporal vs. Spatial: Comparing DINOv3 and V-JEPA2 Feature Representations for Video Action Analysis | Sai Varun Kodathala et.al. | 2509.21595 | translate | read | null |
| 2025-09-25 | EMG-UP: Unsupervised Personalization in Cross-User EMG Gesture Recognition | Nana Wang et.al. | 2509.21589 | translate | read | null |
| 2025-09-24 | mmHSense: Multi-Modal and Distributed mmWave ISAC Datasets for Human Sensing | Nabeel Nisar Bhat et.al. | 2509.21396 | translate | read | null |
| 2025-09-25 | Every Subtlety Counts: Fine-grained Person Independence Micro-Action Recognition via Distributionally Robust Optimization | Feng-Qi Cui et.al. | 2509.21261 | translate | read | null |
| 2025-09-25 | Autoregressive End-to-End Planning with Time-Invariant Spatial Alignment and Multi-Objective Policy Refinement | Jianbo Zhao et.al. | 2509.20938 | translate | read | null |
| 2025-09-25 | GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series | Sarah Seifi et.al. | 2509.20936 | translate | read | null |
| 2025-09-25 | Causal Time Series Generation via Diffusion Models | Yutong Xia et.al. | 2509.20846 | translate | read | null |
| 2025-09-23 | A Bimanual Gesture Interface for ROS-Based Mobile Manipulators Using TinyML and Sensor Fusion | Najeeb Ahmed Bhuiyan et.al. | 2509.19521 | translate | read | null |
| 2025-09-23 | FERA: Foil Fencing Referee Assistant Using Pose-Based Multi-Label Move Recognition and Rule Reasoning | Ziwen Chen et.al. | 2509.18527 | translate | read | null |
| 2025-09-22 | MoCrop: Training Free Motion Guided Cropping for Efficient Video Action Recognition | Binhua Huang et.al. | 2509.18473 | translate | read | null |
| 2025-09-22 | Orcust: Stepwise-Feedback Reinforcement Learning for GUI Agent | Junyu Lu et.al. | 2509.17917 | translate | read | null |
| 2025-09-22 | Trainee Action Recognition through Interaction Analysis in CCATT Mixed-Reality Training | Divya Mereddy et.al. | 2509.17888 | translate | read | null |
| 2025-09-22 | A $^2$M$^2$ -Net: Adaptively Aligned Multi-Scale Moment for Few-Shot Action Recognition | Zilin Gao et.al. | 2509.17638 | translate | read | null |
| 2025-09-22 | UIPro: Unleashing Superior Interaction Capability For GUI Agents | Hongxin Li et.al. | 2509.17328 | translate | read | null |
| 2025-09-21 | Imagine2Act: Leveraging Object-Action Motion Consistency from Imagined Goals for Robotic Manipulation | Liang Heng et.al. | 2509.17125 | translate | read | null |
| 2025-09-21 | MoCLIP-Lite: Efficient Video Recognition by Fusing CLIP with Motion Vectors | Binhua Huang et.al. | 2509.17084 | translate | read | null |
| 2025-09-20 | Automated Procedural Analysis via Video-Language Models for AI-assisted Nursing Skills Assessment | Shen Chang et.al. | 2509.16810 | translate | read | null |
| 2025-09-19 | KRAST: Knowledge-Augmented Robotic Action Recognition with Structured Text for Vision-Language Models | Son Hai Nguyen et.al. | 2509.16452 | translate | read | null |
| 2025-09-18 | RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation | Yuming Jiang et.al. | 2509.15212 | translate | read | null |
| 2025-09-18 | Doppler Radiance Field-Guided Antenna Selection for Improved Generalization in Multi-Antenna Wi-Fi-based Human Activity Recognition | Navid Hasanzadeh et.al. | 2509.15129 | translate | read | null |
| 2025-09-18 | LSTC-MDA: A Unified Framework for Long-Short Term Temporal Convolution and Mixed Data Augmentation in Skeleton-Based Action Recognition | Feng Ding et.al. | 2509.14619 | translate | read | null |
| 2025-09-18 | ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference | Kihoon Son et.al. | 2509.14537 | translate | read | null |
| 2025-09-15 | Domain-Adaptive Pretraining Improves Primate Behavior Recognition | Felix B. Mueller et.al. | 2509.12193 | translate | read | null |
| 2025-09-15 | Open-ended Hierarchical Streaming Video Understanding with Vision Language Models | Hyolim Kang et.al. | 2509.12145 | translate | read | null |
| 2025-09-15 | Gesture-Based Robot Control Integrating Mm-wave Radar and Behavior Trees | Yuqing Song et.al. | 2509.12008 | translate | read | null |
| 2025-09-15 | Learning Representations in Video Game Agents with Supervised Contrastive Imitation Learning | Carlos Celemin et.al. | 2509.11880 | translate | read | null |
| 2025-09-11 | Improvement of Human-Object Interaction Action Recognition Using Scene Information and Multi-Task Learning Approach | Hesham M. Shehata et.al. | 2509.09067 | translate | read | null |
| 2025-09-10 | A Contextual Bandits Approach for Personalization of Hand Gesture Recognition | Duke Lin et.al. | 2509.08915 | translate | read | null |
| 2025-09-10 | Diffusion-Based Action Recognition Generalizes to Untrained Domains | Rogerio Guimaraes et.al. | 2509.08908 | translate | read | null |
| 2025-09-10 | Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening | Piyush Bagad et.al. | 2509.08502 | translate | read | null |
| 2025-09-10 | LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations | Payal Varshney et.al. | 2509.08422 | translate | read | null |
| 2025-09-09 | EHWGesture – A dataset for multimodal understanding of clinical gestures | Gianluca Amprimo et.al. | 2509.07525 | translate | read | null |
| 2025-09-09 | G3CN: Gaussian Topology Refinement Gated Graph Convolutional Network for Skeleton-Based Action Recognition | Haiqing Ren et.al. | 2509.07335 | translate | read | null |
| 2025-09-08 | Video-based Generalized Category Discovery via Memory-Guided Consistency-Aware Contrastive Learning | Zhang Jing et.al. | 2509.06306 | translate | read | null |
| 2025-09-06 | Leveraging Vision-Language Large Models for Interpretable Video Action Recognition with Semantic Tokenization | Jingwei Peng et.al. | 2509.05695 | translate | read | null |
| 2025-09-05 | DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation | Haitao Tian et.al. | 2509.05543 | translate | read | null |
| 2025-09-03 | Towards Efficient General Feature Prediction in Masked Skeleton Modeling | Shengkai Sun et.al. | 2509.03609 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)