Action Recognition - 2025-12 | Paper Arxiv Daily

Action Recognition - 2025-12

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-12-31	Spatial4D-Bench: A Versatile 4D Spatial Intelligence Benchmark	Pan Wang et.al.	2601.00092	translate	read	null
2025-12-31	FineTec: Fine-Grained Action Recognition Under Temporal Corruption via Skeleton Decomposition and Sequence Completion	Dian Shao et.al.	2512.25067	translate	read	null
2025-12-31	VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation agents	Xunyi Zhao et.al.	2512.24851	translate	read	null
2025-12-30	AI-Driven Evaluation of Surgical Skill via Action Recognition	Yan Meng et.al.	2512.24411	translate	read	null
2025-12-29	Lifelong Domain Adaptive 3D Human Pose Estimation	Qucheng Peng et.al.	2512.23860	translate	read	null
2025-12-29	Act2Goal: From World Model To General Goal-conditioned Policy	Pengfei Zhou et.al.	2512.23541	translate	read	null
2025-12-29	Multi-Track Multimodal Learning on iMiGUE: Micro-Gesture and Emotion Recognition	Arman Martirosyan et.al.	2512.23291	translate	read	null
2025-12-27	Autoregressive Flow Matching for Motion Prediction	Johnathan Xie et.al.	2512.22688	translate	read	null
2025-12-27	Clutter-Resistant Vision-Language-Action Models through Object-Centric and Geometry Grounding	Khoa Vo et.al.	2512.22519	translate	read	null
2025-12-22	Signal-SGN++: Topology-Enhanced Time-Frequency Spiking Graph Network for Skeleton-Based Action Recognition	Naichuan Zheng et.al.	2512.22214	translate	read	null
2025-12-26	Patch as Node: Human-Centric Graph Representation Learning for Multimodal Action Recognition	Zeyu Liang et.al.	2512.21916	translate	read	null
2025-12-24	EVE: A Generator-Verifier System for Generative Policies	Yusuf Ali et.al.	2512.21430	translate	read	null
2025-12-24	ElfCore: A 28nm Neural Processor Enabling Dynamic Structured Sparse Training and Online Self-Supervised Learning with Activity-Dependent Weight Update	Zhe Su et.al.	2512.21153	translate	read	null
2025-12-23	Bridging Modalities and Transferring Knowledge: Enhanced Multimodal Understanding and Recognition	Gorjan Radevski et.al.	2512.20501	translate	read	null
2025-12-23	Beyond Motion Pattern: An Empirical Study of Physical Forces for Human Motion Understanding	Anh Dao et.al.	2512.20451	translate	read	null
2025-12-23	DETACH : Decomposed Spatio-Temporal Alignment for Exocentric Video and Ambient Sensors with Staged Learning	Junho Yoon et.al.	2512.20409	translate	read	null
2025-12-23	Effect of Activation Function and Model Optimizer on the Performance of Human Activity Recognition System Using Various Deep Learning Models	Subrata Kumer Paula et.al.	2512.20104	translate	read	null
2025-12-23	A Contextual Analysis of Driver-Facing and Dual-View Video Inputs for Distraction Detection in Naturalistic Driving Environments	Anthony Dontoh et.al.	2512.20025	translate	read	null
2025-12-22	Distinguishing Visually Similar Actions: Prompt-Guided Semantic Prototype Modulation for Few-Shot Action Recognition	Xiaoyang Li et.al.	2512.19036	translate	read	null
2025-12-21	Context-Aware Network Based on Multi-scale Spatio-temporal Attention for Action Recognition in Videos	Xiaoyang Li et.al.	2512.18750	translate	read	null
2025-12-21	Hierarchical Bayesian Framework for Multisource Domain Adaptation	Alexander M. Glandon et.al.	2512.18553	translate	read	null
2025-12-17	Seeing Beyond the Scene: Analyzing and Mitigating Background Bias in Action Recognition	Ellie Zhou et.al.	2512.17953	translate	read	null
2025-12-19	Xiaomi MiMo-VL-Miloco Technical Report	Jiaze Li et.al.	2512.17436	translate	read	null
2025-12-18	OMG-Bench: A New Challenging Benchmark for Skeleton-based Online Micro Hand Gesture Recognition	Haochen Chang et.al.	2512.16727	translate	read	null
2025-12-18	Skeleton-Snippet Contrastive Learning with Multiscale Feature Fusion for Action Localization	Qiushuo Cheng et.al.	2512.16504	translate	read	null
2025-12-06	Smart Surveillance: Identifying IoT Device Behaviours using ML-Powered Traffic Analysis	Reza Ryan et.al.	2512.13709	translate	read	null
2025-12-15	Recurrent Video Masked Autoencoders	Daniel Zoran et.al.	2512.13684	translate	read	null
2025-12-14	StegaVAR: Privacy-Preserving Video Action Recognition via Steganographic Domain Analysis	Lixin Chen et.al.	2512.12586	translate	read	null
2025-12-13	From Human Intention to Action Prediction: A Comprehensive Benchmark for Intention-driven End-to-End Autonomous Driving	Huan Zheng et.al.	2512.12302	translate	read	null
2025-12-12	DynaPURLS: Dynamic Refinement of Part-aware Representations for Skeleton-based Zero-Shot Action Recognition	Jingmin Zhu et.al.	2512.11941	translate	read	null
2025-12-05	Explainable Adversarial-Robust Vision-Language-Action Model for Robotic Manipulation	Ju-Young Kim et.al.	2512.11865	translate	read	null
2025-12-12	TSkel-Mamba: Temporal Dynamic Modeling via State Space Model for Human Skeleton-based Action Recognition	Yanan Liu et.al.	2512.11503	translate	read	null
2025-12-12	Boosting Skeleton-based Zero-Shot Action Recognition with Training-Free Test-Time Adaptation	Jingmin Zhu et.al.	2512.11458	translate	read	null
2025-12-12	Task-Specific Distance Correlation Matching for Few-Shot Action Recognition	Fei Long et.al.	2512.11340	translate	read	null
2025-12-12	Breast-Rehab: A Postoperative Breast Cancer Rehabilitation Training Assessment System Based on Human Action Recognition	Zikang Chen et.al.	2512.11245	translate	read	null
2025-12-12	Multi-task Learning with Extended Temporal Shift Module for Temporal Action Localization	Anh-Kiet Duong et.al.	2512.11189	translate	read	null
2025-12-11	Deep Photonic Reservoir Computing with On-chip Nonlinearity	Jinlong Xiang et.al.	2512.10626	translate	read	null
2025-12-11	Lang2Motion: Bridging Language and Motion through Joint Embedding Spaces	Bishoy Galoaa et.al.	2512.10617	translate	read	null
2025-12-11	Lies We Can Trust: Quantifying Action Uncertainty with Inaccurate Stochastic Dynamics through Conformalized Nonholonomic Lie Groups	Luís Marques et.al.	2512.10294	translate	read	null
2025-12-10	GLaD: Geometric Latent Distillation for Vision-Language-Action Models	Minghao Guo et.al.	2512.09619	translate	read	null
2025-12-09	Neural Ordinary Differential Equations for Simulating Metabolic Pathway Dynamics from Time-Series Multiomics Data	Udesh Habaraduwa et.al.	2512.08732	translate	read	null
2025-12-09	Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning	Huilin Xu et.al.	2512.08639	translate	read	null
2025-12-09	Mind to Hand: Purposeful Robotic Control via Embodied Reasoning	Peijun Tang et.al.	2512.08580	translate	read	null
2025-12-08	A Comparative Study of EMG- and IMU-based Gesture Recognition at the Wrist and Forearm	Soroush Baghernezhad et.al.	2512.07997	translate	read	null
2025-12-08	Improving action classification with brain-inspired deep networks	Aidas Aglinskas et.al.	2512.07729	translate	read	null
2025-12-08	A Large-Scale Multimodal Dataset and Benchmarks for Human Activity Scene Understanding and Reasoning	Siyang Jiang et.al.	2512.07136	translate	read	null
2025-12-07	VideoVLA: Video Generators Can Be Generalizable Robot Manipulators	Yichao Shen et.al.	2512.06963	translate	read	null
2025-12-04	Towards Adaptive Fusion of Multimodal Deep Networks for Human Action Recognition	Novanto Yudistira et.al.	2512.04943	translate	read	null
2025-12-04	CIG-MAE: Cross-Modal Information-Guided Masked Autoencoder for Self-Supervised WiFi Sensing	Gang Liu et.al.	2512.04723	translate	read	null
2025-12-04	WiFi-based Cross-Domain Gesture Recognition Using Attention Mechanism	Ruijing Liu et.al.	2512.04521	translate	read	null
2025-12-03	Heatmap Pooling Network for Action Recognition from RGB Videos	Mengyuan Liu et.al.	2512.03837	translate	read	null
2025-12-02	SAM2Grasp: Resolve Multi-modal Grasping via Prompt-conditioned Temporal Action Prediction	Shengkai Wu et.al.	2512.02609	translate	read	null
2025-12-01	TBT-Former: Learning Temporal Boundary Distributions for Action Localization	Thisara Rathnayaka et.al.	2512.01298	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)