Action Recognition - 2026-01 | Paper Arxiv Daily

Action Recognition - 2026-01

Publish Date	Title	Authors	PDF	Translate	Read	Code
2026-01-30	MapDream: Task-Driven Map Learning for Vision-Language Navigation	Guoxin Lian et.al.	2602.00222	translate	read	null
2026-01-30	Leveraging Convolutional Sparse Autoencoders for Robust Movement Classification from Low-Density sEMG	Blagoj Hristov et.al.	2601.23011	translate	read	null
2026-01-30	μTouch: Enabling Accurate, Lightweight Self-Touch Sensing with Passive Magnets	Siyuan Wang et.al.	2601.22864	translate	read	null
2026-01-30	Fire on Motion: Optimizing Video Pass-bands for Efficient Spiking Action Recognition	Shuhan Ye et.al.	2601.22675	translate	read	null
2026-01-29	Causal World Modeling for Robot Control	Lin Li et.al.	2601.21998	translate	read	null
2026-01-29	WheelArm-Sim: A Manipulation and Navigation Combined Multimodal Synthetic Data Generation Simulator for Unified Control in Assistive Robotics	Guangping Liu et.al.	2601.21129	translate	read	null
2026-01-28	Towards Mitigating Modality Bias in Vision-Language Models for Temporal Action Localization	Jiaqi Li et.al.	2601.21078	translate	read	null
2026-01-23	Affinity Contrastive Learning for Skeleton-based Human Activity Understanding	Hongda Liu et.al.	2601.16694	translate	read	null
2026-01-23	Low-Power On-Device Gesture Recognition with Einsum Networks	Sahar Golipoor et.al.	2601.16662	translate	read	null
2026-01-22	Angle of Arrival Estimation for Gesture Recognition from reflective body-worn tags	Sahar Golipoor et.al.	2601.16303	translate	read	null
2026-01-22	Gesture Recognition from body-Worn RFID under Missing Data	Sahar Golipoor et.al.	2601.16301	translate	read	null
2026-01-22	GameTalk: Training LLMs for Strategic Conversation	Victor Conchello Vendrell et.al.	2601.16276	translate	read	null
2026-01-22	Why Can’t I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action Recognition	Geo Ahn et.al.	2601.16211	translate	read	null
2026-01-22	PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation	Onkar Susladkar et.al.	2601.16210	translate	read	null
2026-01-22	Decoupling Return-to-Go for Efficient Decision Transformer	Yongyi Wang et.al.	2601.15953	translate	read	null
2026-01-20	Curriculum-Based Strategies for Efficient Cross-Domain Action Recognition	Emily Kim et.al.	2601.14101	translate	read	null
2026-01-20	Two-Stream temporal transformer for video action classification	Nattapong Kurpukdee et.al.	2601.14086	translate	read	null
2026-01-20	Unsupervised Video Class-Incremental Learning via Deep Embedded Clustering Management	Nattapong Kurpukdee et.al.	2601.14069	translate	read	null
2026-01-20	Variational Dual-path Attention Network for CSI-Based Gesture Recognition	N. Zhang et.al.	2601.13745	translate	read	null
2026-01-20	GeoDynamics: A Geometric State-Space Neural Network for Understanding Brain Dynamics on Riemannian Manifolds	Tingting Dan et.al.	2601.13570	translate	read	null
2026-01-19	Dynamic Hand Gesture Recognition for Robot Manipulator Tasks	Dharmendra Sharma et.al.	2601.12918	translate	read	null
2026-01-15	Effects of Different Attention Mechanisms Applied on 3D Models in Video Classification	Mohammad Rasras et.al.	2601.10854	translate	read	null
2026-01-15	Can Vision-Language Models Understand Construction Workers? An Exploratory Study	Hieu Bui et.al.	2601.10835	translate	read	null
2026-01-11	Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration	Sen Wang et.al.	2601.10744	translate	read	null
2026-01-06	Millimeter-Wave Gesture Recognition in ISAC: Does Reducing Sensing Airtime Hamper Accuracy?	Jakob Struye et.al.	2601.10733	translate	read	null
2026-01-15	Action100M: A Large-scale Video Action Dataset	Delong Chen et.al.	2601.10592	translate	read	link
2026-01-15	BikeActions: An Open Platform and Benchmark for Cyclist-Centric VRU Action Recognition	Max A. Buettner et.al.	2601.10521	translate	read	null
2026-01-14	COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation	Tony Danjun Wang et.al.	2601.09698	translate	read	null
2026-01-13	ActiveVLA: Injecting Active Perception into Vision-Language-Action Models for Precise 3D Robotic Manipulation	Zhenyang Liu et.al.	2601.08325	translate	read	null
2026-01-13	VGG Induced Deep Hand Sign Language Detection	Subham Sharma et.al.	2601.08262	translate	read	null
2026-01-12	Video Generation Models in Robotics – Applications, Research Challenges, Future Directions	Zhiting Mei et.al.	2601.07823	translate	read	null
2026-01-12	Variational Contrastive Learning for Skeleton-based Action Recognition	Dang Dinh Nguyen et.al.	2601.07666	translate	read	null
2026-01-12	Motion Focus Recognition in Fast-Moving Egocentric Video	Daniel Hong et.al.	2601.07154	translate	read	null
2026-01-10	Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification	Ahmed Abdelkawy et.al.	2601.06394	translate	read	null
2026-01-09	LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction	Chengen Xie et.al.	2601.05611	translate	read	null
2026-01-08	When to Act: Calibrated Confidence for Reliable Human Intention Prediction in Assistive Robotics	Johannes A. Gaus et.al.	2601.04982	translate	read	null
2026-01-08	CounterVid: Counterfactual Video Generation for Mitigating Action and Temporal Hallucinations in Video-Language Models	Tobia Poppi et.al.	2601.04778	translate	read	null
2026-01-07	Lightweight Test-Time Adaptation for EMG-Based Gesture Recognition	Nia Touko et.al.	2601.04181	translate	read	null
2026-01-07	Beyond Physical Labels: Redefining Domains for Robust WiFi-based Gesture Recognition	Xiang Zhang et.al.	2601.03825	translate	read	null
2026-01-07	TRec: Learning Hand-Object Interactions through 2D Point Track Motion	Dennis Holzmann et.al.	2601.03667	translate	read	null
2026-01-04	Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation	Huajie Tan et.al.	2601.01618	translate	read	null
2026-01-01	BHaRNet: Reliability-Aware Body-Hand Modality Expertized Networks for Fine-grained Skeleton Action Recognition	Seungyeon Cho et.al.	2601.00369	translate	read	null
2026-01-01	Effects of Limited Field of View on Musical Collaboration Experience with Avatars in Extended Reality	Suibi Che-Chuan Weng et.al.	2601.00333	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)