Action Recognition - 2025-09 | Paper Arxiv Daily

Action Recognition - 2025-09

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-09-27	$\texttt{BluePrint}$ : A Social Media User Dataset for LLM Persona Evaluation and Training	Aurélien Bück-Kaeffer et.al.	2510.02343	translate	read	null
2025-09-30	Towards Intuitive Human-Robot Interaction through Embodied Gesture-Driven Control with Woven Tactile Skins	ChunPing Lam et.al.	2509.25951	translate	read	null
2025-09-22	Six Sigma For Neural Networks: Taguchi-based optimization	Sai Varun Kodathala et.al.	2509.25213	translate	read	null
2025-09-29	Fast Real-Time Pipeline for Robust Arm Gesture Recognition	Milán Zsolt Bagladi et.al.	2509.25042	translate	read	null
2025-09-28	AssemblyHands-X: Modeling 3D Hand-Body Coordination for Understanding Bimanual Human Activities	Tatsuro Banno et.al.	2509.23888	translate	read	null
2025-09-27	New Synthetic Goldmine: Hand Joint Angle-Driven EMG Data Generation Framework for Micro-Gesture Recognition	Nana Wang et.al.	2509.23359	translate	read	null
2025-09-27	Spatiotemporal Radar Gesture Recognition with Hybrid Spiking Neural Networks: Balancing Accuracy and Efficiency	Riccardo Mazzieri et.al.	2509.23303	translate	read	null
2025-09-27	MMeViT: Multi-Modal ensemble ViT for Post-Stroke Rehabilitation Action Recognition	Ye-eun Kim et.al.	2509.23044	translate	read	null
2025-09-27	Disentangling Static and Dynamic Information for Reducing Static Bias in Action Recognition	Masato Kobayashi et.al.	2509.23009	translate	read	null
2025-09-26	See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation	Chih Yao Hu et.al.	2509.22653	translate	read	null
2025-09-26	Prompt-guided Disentangled Representation for Action Recognition	Tianci Wu et.al.	2509.21783	translate	read	null
2025-09-25	SlotFM: A Motion Foundation Model with Slot Attention for Diverse Downstream Tasks	Junyong Park et.al.	2509.21673	translate	read	null
2025-09-25	Temporal vs. Spatial: Comparing DINOv3 and V-JEPA2 Feature Representations for Video Action Analysis	Sai Varun Kodathala et.al.	2509.21595	translate	read	null
2025-09-25	EMG-UP: Unsupervised Personalization in Cross-User EMG Gesture Recognition	Nana Wang et.al.	2509.21589	translate	read	null
2025-09-24	mmHSense: Multi-Modal and Distributed mmWave ISAC Datasets for Human Sensing	Nabeel Nisar Bhat et.al.	2509.21396	translate	read	null
2025-09-25	Every Subtlety Counts: Fine-grained Person Independence Micro-Action Recognition via Distributionally Robust Optimization	Feng-Qi Cui et.al.	2509.21261	translate	read	null
2025-09-25	Autoregressive End-to-End Planning with Time-Invariant Spatial Alignment and Multi-Objective Policy Refinement	Jianbo Zhao et.al.	2509.20938	translate	read	null
2025-09-25	GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series	Sarah Seifi et.al.	2509.20936	translate	read	null
2025-09-25	Causal Time Series Generation via Diffusion Models	Yutong Xia et.al.	2509.20846	translate	read	null
2025-09-23	A Bimanual Gesture Interface for ROS-Based Mobile Manipulators Using TinyML and Sensor Fusion	Najeeb Ahmed Bhuiyan et.al.	2509.19521	translate	read	null
2025-09-23	FERA: Foil Fencing Referee Assistant Using Pose-Based Multi-Label Move Recognition and Rule Reasoning	Ziwen Chen et.al.	2509.18527	translate	read	null
2025-09-22	MoCrop: Training Free Motion Guided Cropping for Efficient Video Action Recognition	Binhua Huang et.al.	2509.18473	translate	read	null
2025-09-22	Orcust: Stepwise-Feedback Reinforcement Learning for GUI Agent	Junyu Lu et.al.	2509.17917	translate	read	null
2025-09-22	Trainee Action Recognition through Interaction Analysis in CCATT Mixed-Reality Training	Divya Mereddy et.al.	2509.17888	translate	read	null
2025-09-22	A $^2$M$^2$ -Net: Adaptively Aligned Multi-Scale Moment for Few-Shot Action Recognition	Zilin Gao et.al.	2509.17638	translate	read	null
2025-09-22	UIPro: Unleashing Superior Interaction Capability For GUI Agents	Hongxin Li et.al.	2509.17328	translate	read	null
2025-09-21	Imagine2Act: Leveraging Object-Action Motion Consistency from Imagined Goals for Robotic Manipulation	Liang Heng et.al.	2509.17125	translate	read	null
2025-09-21	MoCLIP-Lite: Efficient Video Recognition by Fusing CLIP with Motion Vectors	Binhua Huang et.al.	2509.17084	translate	read	null
2025-09-20	Automated Procedural Analysis via Video-Language Models for AI-assisted Nursing Skills Assessment	Shen Chang et.al.	2509.16810	translate	read	null
2025-09-19	KRAST: Knowledge-Augmented Robotic Action Recognition with Structured Text for Vision-Language Models	Son Hai Nguyen et.al.	2509.16452	translate	read	null
2025-09-18	RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation	Yuming Jiang et.al.	2509.15212	translate	read	null
2025-09-18	Doppler Radiance Field-Guided Antenna Selection for Improved Generalization in Multi-Antenna Wi-Fi-based Human Activity Recognition	Navid Hasanzadeh et.al.	2509.15129	translate	read	null
2025-09-18	LSTC-MDA: A Unified Framework for Long-Short Term Temporal Convolution and Mixed Data Augmentation in Skeleton-Based Action Recognition	Feng Ding et.al.	2509.14619	translate	read	null
2025-09-18	ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference	Kihoon Son et.al.	2509.14537	translate	read	null
2025-09-15	Domain-Adaptive Pretraining Improves Primate Behavior Recognition	Felix B. Mueller et.al.	2509.12193	translate	read	null
2025-09-15	Open-ended Hierarchical Streaming Video Understanding with Vision Language Models	Hyolim Kang et.al.	2509.12145	translate	read	null
2025-09-15	Gesture-Based Robot Control Integrating Mm-wave Radar and Behavior Trees	Yuqing Song et.al.	2509.12008	translate	read	null
2025-09-15	Learning Representations in Video Game Agents with Supervised Contrastive Imitation Learning	Carlos Celemin et.al.	2509.11880	translate	read	null
2025-09-11	Improvement of Human-Object Interaction Action Recognition Using Scene Information and Multi-Task Learning Approach	Hesham M. Shehata et.al.	2509.09067	translate	read	null
2025-09-10	A Contextual Bandits Approach for Personalization of Hand Gesture Recognition	Duke Lin et.al.	2509.08915	translate	read	null
2025-09-10	Diffusion-Based Action Recognition Generalizes to Untrained Domains	Rogerio Guimaraes et.al.	2509.08908	translate	read	null
2025-09-10	Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening	Piyush Bagad et.al.	2509.08502	translate	read	null
2025-09-10	LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations	Payal Varshney et.al.	2509.08422	translate	read	null
2025-09-09	EHWGesture – A dataset for multimodal understanding of clinical gestures	Gianluca Amprimo et.al.	2509.07525	translate	read	null
2025-09-09	G3CN: Gaussian Topology Refinement Gated Graph Convolutional Network for Skeleton-Based Action Recognition	Haiqing Ren et.al.	2509.07335	translate	read	null
2025-09-08	Video-based Generalized Category Discovery via Memory-Guided Consistency-Aware Contrastive Learning	Zhang Jing et.al.	2509.06306	translate	read	null
2025-09-06	Leveraging Vision-Language Large Models for Interpretable Video Action Recognition with Semantic Tokenization	Jingwei Peng et.al.	2509.05695	translate	read	null
2025-09-05	DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation	Haitao Tian et.al.	2509.05543	translate	read	null
2025-09-03	Towards Efficient General Feature Prediction in Masked Skeleton Modeling	Shengkai Sun et.al.	2509.03609	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)