Action Recognition - 2025-05 | Paper Arxiv Daily

Action Recognition - 2025-05

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-05-30	DiG-Net: Enhancing Quality of Life through Hyper-Range Dynamic Gesture Recognition in Assistive Robotics	Eran Bamani Beeri et.al.	2505.24786	translate	read	null
2025-05-30	Beyond FACS: Data-driven Facial Expression Dictionaries, with Application to Predicting Autism	Evangelos Sariyanidi et.al.	2505.24679	translate	read	null
2025-05-30	EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding	Ege Özsoy et.al.	2505.24287	translate	read	null
2025-05-29	Autoregressive Meta-Actions for Unified Controllable Trajectory Generation	Jianbo Zhao et.al.	2505.23612	translate	read	null
2025-05-29	CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization	Rui Xia et.al.	2505.23524	translate	read	null
2025-05-29	Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition	Shanaka Ramesh Gunasekara et.al.	2505.23012	translate	read	link
2025-05-28	PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion	Jaehyun Choi et.al.	2505.22564	translate	read	null
2025-05-27	DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity Recognition	Marius Bock et.al.	2505.20894	translate	read	link
2025-05-27	TrustSkin: A Fairness Pipeline for Trustworthy Facial Affect Analysis Across Skin Tone	Ana M. Cabanas et.al.	2505.20637	translate	read	null
2025-05-26	Data-Free Class-Incremental Gesture Recognition with Prototype-Guided Pseudo Feature Replay	Hongsong Wang et.al.	2505.20049	translate	read	link
2025-05-26	PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction	Kanglei Zhou et.al.	2505.19972	translate	read	link
2025-05-26	The Role of Video Generation in Enhancing Data-Limited Action Understanding	Wei Li et.al.	2505.19495	translate	read	null
2025-05-24	ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos	Xiaodong Wang et.al.	2505.18650	translate	read	null
2025-05-27	SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios	Simon Malzard et.al.	2505.18048	translate	read	null
2025-05-23	3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation	Evangelos Sariyanidi et.al.	2505.18025	translate	read	null
2025-05-23	Multi-task Learning For Joint Action and Gesture Recognition	Konstantinos Spathis et.al.	2505.17867	translate	read	null
2025-05-23	Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition	Ping Li et.al.	2505.17807	translate	read	link
2025-05-23	Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour	Bálint Gyevnár et.al.	2505.17801	translate	read	null
2025-05-23	SVL: Spike-based Vision-language Pretraining for Efficient 3D Open-world Understanding	Xuerui Qiu et.al.	2505.17674	translate	read	null
2025-05-23	ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization	Yuchen He et.al.	2505.17555	translate	read	null
2025-05-22	UAV Control with Vision-based Hand Gesture Recognition over Edge-Computing	Sousannah Abdalla et.al.	2505.17303	translate	read	null
2025-05-22	CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning	Jiange Yang et.al.	2505.17006	translate	read	null
2025-05-21	Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models	Ria Shekhawat et.al.	2505.15332	translate	read	null
2025-05-21	DiffProb: Data Pruning for Face Recognition	Eduarda Caldeira et.al.	2505.15272	translate	read	link
2025-05-21	Leveraging Foundation Models for Multimodal Graph-Based Action Recognition	Fatemeh Ziaeetabar et.al.	2505.15192	translate	read	null
2025-05-20	Egocentric Action-aware Inertial Localization in Point Clouds	Mingfang Zhang et.al.	2505.14346	translate	read	link
2025-05-20	Transfer Learning from Visual Speech Recognition to Mouthing Recognition in German Sign Language	Dinh Nam Pham et.al.	2505.13784	translate	read	link
2025-05-18	MTIL: Encoding Full History with Mamba for Temporal Imitation Learning	Yulin Zhou et.al.	2505.12410	translate	read	link
2025-05-20	Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation	Shuo Wang et.al.	2505.11886	translate	read	null
2025-05-16	Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation	Zihan Wang et.al.	2505.11383	translate	read	link
2025-05-15	NeoLightning: A Modern Reimagination of Gesture-Based Sound Design	Yonghyun Kim et.al.	2505.10686	translate	read	link
2025-05-15	Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?	Jianyang Xie et.al.	2505.10679	translate	read	link
2025-05-14	Mission Balance: Generating Under-represented Class Samples using Video Diffusion Models	Danush Kumar Venkatesh et.al.	2505.09858	translate	read	link
2025-05-13	Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection	Ayush K. Rai et.al.	2505.08561	translate	read	null
2025-05-17	Training Strategies for Efficient Embodied Reasoning	William Chen et.al.	2505.08243	translate	read	null
2025-05-12	H $^{\mathbf{3}}$ DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning	Yiyang Lu et.al.	2505.07819	translate	read	null
2025-05-11	DeepSORT-Driven Visual Tracking Approach for Gesture Recognition in Interactive Systems	Tong Zhang et.al.	2505.07110	translate	read	null
2025-05-10	A Short Overview of Multi-Modal Wi-Fi Sensing	Zijian Zhao et.al.	2505.06682	translate	read	link
2025-05-09	Context Informed Incremental Learning Improves Myoelectric Control Performance in Virtual Reality Object Manipulation Tasks	Gabriel Gagné et.al.	2505.06064	translate	read	link
2025-05-09	Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition	Congqi Cao et.al.	2505.06002	translate	read	link
2025-05-07	DetReIDX: A Stress-Test Dataset for Real-World UAV-Based Person Recognition	Kailash A. Hambarde et.al.	2505.04793	translate	read	link
2025-05-07	Comparison of Visual Trackers for Biomechanical Analysis of Running	Luis F. Gomez et.al.	2505.04713	translate	read	null
2025-05-07	Trajectory Entropy Reinforcement Learning for Predictable and Robust Control	Bang You et.al.	2505.04193	translate	read	null
2025-05-07	FoodTrack: Estimating Handheld Food Portions with Egocentric Video	Ervin Wang et.al.	2505.04055	translate	read	null
2025-05-06	Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges	Hao Xu et.al.	2505.03991	translate	read	null
2025-05-03	A Multimodal Framework for Explainable Evaluation of Soft Skills in Educational Environments	Jared D. T. Guerrero-Sosa et.al.	2505.01794	translate	read	null
2025-05-01	Predicting Estimated Times of Restoration for Electrical Outages Using Longitudinal Tabular Transformers	Bogireddy Sai Prasanna Teja et.al.	2505.00225	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)