Action Recognition - 2026-01

Publish Date Title Authors PDF Translate Read Code
2026-01-30 MapDream: Task-Driven Map Learning for Vision-Language Navigation Guoxin Lian et.al. 2602.00222 translate read null
2026-01-30 Leveraging Convolutional Sparse Autoencoders for Robust Movement Classification from Low-Density sEMG Blagoj Hristov et.al. 2601.23011 translate read null
2026-01-30 μTouch: Enabling Accurate, Lightweight Self-Touch Sensing with Passive Magnets Siyuan Wang et.al. 2601.22864 translate read null
2026-01-30 Fire on Motion: Optimizing Video Pass-bands for Efficient Spiking Action Recognition Shuhan Ye et.al. 2601.22675 translate read null
2026-01-29 Causal World Modeling for Robot Control Lin Li et.al. 2601.21998 translate read null
2026-01-29 WheelArm-Sim: A Manipulation and Navigation Combined Multimodal Synthetic Data Generation Simulator for Unified Control in Assistive Robotics Guangping Liu et.al. 2601.21129 translate read null
2026-01-28 Towards Mitigating Modality Bias in Vision-Language Models for Temporal Action Localization Jiaqi Li et.al. 2601.21078 translate read null
2026-01-23 Affinity Contrastive Learning for Skeleton-based Human Activity Understanding Hongda Liu et.al. 2601.16694 translate read null
2026-01-23 Low-Power On-Device Gesture Recognition with Einsum Networks Sahar Golipoor et.al. 2601.16662 translate read null
2026-01-22 Angle of Arrival Estimation for Gesture Recognition from reflective body-worn tags Sahar Golipoor et.al. 2601.16303 translate read null
2026-01-22 Gesture Recognition from body-Worn RFID under Missing Data Sahar Golipoor et.al. 2601.16301 translate read null
2026-01-22 GameTalk: Training LLMs for Strategic Conversation Victor Conchello Vendrell et.al. 2601.16276 translate read null
2026-01-22 Why Can’t I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action Recognition Geo Ahn et.al. 2601.16211 translate read null
2026-01-22 PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation Onkar Susladkar et.al. 2601.16210 translate read null
2026-01-22 Decoupling Return-to-Go for Efficient Decision Transformer Yongyi Wang et.al. 2601.15953 translate read null
2026-01-20 Curriculum-Based Strategies for Efficient Cross-Domain Action Recognition Emily Kim et.al. 2601.14101 translate read null
2026-01-20 Two-Stream temporal transformer for video action classification Nattapong Kurpukdee et.al. 2601.14086 translate read null
2026-01-20 Unsupervised Video Class-Incremental Learning via Deep Embedded Clustering Management Nattapong Kurpukdee et.al. 2601.14069 translate read null
2026-01-20 Variational Dual-path Attention Network for CSI-Based Gesture Recognition N. Zhang et.al. 2601.13745 translate read null
2026-01-20 GeoDynamics: A Geometric State-Space Neural Network for Understanding Brain Dynamics on Riemannian Manifolds Tingting Dan et.al. 2601.13570 translate read null
2026-01-19 Dynamic Hand Gesture Recognition for Robot Manipulator Tasks Dharmendra Sharma et.al. 2601.12918 translate read null
2026-01-15 Effects of Different Attention Mechanisms Applied on 3D Models in Video Classification Mohammad Rasras et.al. 2601.10854 translate read null
2026-01-15 Can Vision-Language Models Understand Construction Workers? An Exploratory Study Hieu Bui et.al. 2601.10835 translate read null
2026-01-11 Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration Sen Wang et.al. 2601.10744 translate read null
2026-01-06 Millimeter-Wave Gesture Recognition in ISAC: Does Reducing Sensing Airtime Hamper Accuracy? Jakob Struye et.al. 2601.10733 translate read null
2026-01-15 Action100M: A Large-scale Video Action Dataset Delong Chen et.al. 2601.10592 translate read link
2026-01-15 BikeActions: An Open Platform and Benchmark for Cyclist-Centric VRU Action Recognition Max A. Buettner et.al. 2601.10521 translate read null
2026-01-14 COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation Tony Danjun Wang et.al. 2601.09698 translate read null
2026-01-13 ActiveVLA: Injecting Active Perception into Vision-Language-Action Models for Precise 3D Robotic Manipulation Zhenyang Liu et.al. 2601.08325 translate read null
2026-01-13 VGG Induced Deep Hand Sign Language Detection Subham Sharma et.al. 2601.08262 translate read null
2026-01-12 Video Generation Models in Robotics – Applications, Research Challenges, Future Directions Zhiting Mei et.al. 2601.07823 translate read null
2026-01-12 Variational Contrastive Learning for Skeleton-based Action Recognition Dang Dinh Nguyen et.al. 2601.07666 translate read null
2026-01-12 Motion Focus Recognition in Fast-Moving Egocentric Video Daniel Hong et.al. 2601.07154 translate read null
2026-01-10 Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification Ahmed Abdelkawy et.al. 2601.06394 translate read null
2026-01-09 LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction Chengen Xie et.al. 2601.05611 translate read null
2026-01-08 When to Act: Calibrated Confidence for Reliable Human Intention Prediction in Assistive Robotics Johannes A. Gaus et.al. 2601.04982 translate read null
2026-01-08 CounterVid: Counterfactual Video Generation for Mitigating Action and Temporal Hallucinations in Video-Language Models Tobia Poppi et.al. 2601.04778 translate read null
2026-01-07 Lightweight Test-Time Adaptation for EMG-Based Gesture Recognition Nia Touko et.al. 2601.04181 translate read null
2026-01-07 Beyond Physical Labels: Redefining Domains for Robust WiFi-based Gesture Recognition Xiang Zhang et.al. 2601.03825 translate read null
2026-01-07 TRec: Learning Hand-Object Interactions through 2D Point Track Motion Dennis Holzmann et.al. 2601.03667 translate read null
2026-01-04 Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation Huajie Tan et.al. 2601.01618 translate read null
2026-01-01 BHaRNet: Reliability-Aware Body-Hand Modality Expertized Networks for Fine-grained Skeleton Action Recognition Seungyeon Cho et.al. 2601.00369 translate read null
2026-01-01 Effects of Limited Field of View on Musical Collaboration Experience with Avatars in Extended Reality Suibi Che-Chuan Weng et.al. 2601.00333 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)