Action Recognition - 2026-01
Action Recognition - 2026-01
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-01-30 | MapDream: Task-Driven Map Learning for Vision-Language Navigation | Guoxin Lian et.al. | 2602.00222 | translate | read | null |
| 2026-01-30 | Leveraging Convolutional Sparse Autoencoders for Robust Movement Classification from Low-Density sEMG | Blagoj Hristov et.al. | 2601.23011 | translate | read | null |
| 2026-01-30 | μTouch: Enabling Accurate, Lightweight Self-Touch Sensing with Passive Magnets | Siyuan Wang et.al. | 2601.22864 | translate | read | null |
| 2026-01-30 | Fire on Motion: Optimizing Video Pass-bands for Efficient Spiking Action Recognition | Shuhan Ye et.al. | 2601.22675 | translate | read | null |
| 2026-01-29 | Causal World Modeling for Robot Control | Lin Li et.al. | 2601.21998 | translate | read | null |
| 2026-01-29 | WheelArm-Sim: A Manipulation and Navigation Combined Multimodal Synthetic Data Generation Simulator for Unified Control in Assistive Robotics | Guangping Liu et.al. | 2601.21129 | translate | read | null |
| 2026-01-28 | Towards Mitigating Modality Bias in Vision-Language Models for Temporal Action Localization | Jiaqi Li et.al. | 2601.21078 | translate | read | null |
| 2026-01-23 | Affinity Contrastive Learning for Skeleton-based Human Activity Understanding | Hongda Liu et.al. | 2601.16694 | translate | read | null |
| 2026-01-23 | Low-Power On-Device Gesture Recognition with Einsum Networks | Sahar Golipoor et.al. | 2601.16662 | translate | read | null |
| 2026-01-22 | Angle of Arrival Estimation for Gesture Recognition from reflective body-worn tags | Sahar Golipoor et.al. | 2601.16303 | translate | read | null |
| 2026-01-22 | Gesture Recognition from body-Worn RFID under Missing Data | Sahar Golipoor et.al. | 2601.16301 | translate | read | null |
| 2026-01-22 | GameTalk: Training LLMs for Strategic Conversation | Victor Conchello Vendrell et.al. | 2601.16276 | translate | read | null |
| 2026-01-22 | Why Can’t I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action Recognition | Geo Ahn et.al. | 2601.16211 | translate | read | null |
| 2026-01-22 | PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation | Onkar Susladkar et.al. | 2601.16210 | translate | read | null |
| 2026-01-22 | Decoupling Return-to-Go for Efficient Decision Transformer | Yongyi Wang et.al. | 2601.15953 | translate | read | null |
| 2026-01-20 | Curriculum-Based Strategies for Efficient Cross-Domain Action Recognition | Emily Kim et.al. | 2601.14101 | translate | read | null |
| 2026-01-20 | Two-Stream temporal transformer for video action classification | Nattapong Kurpukdee et.al. | 2601.14086 | translate | read | null |
| 2026-01-20 | Unsupervised Video Class-Incremental Learning via Deep Embedded Clustering Management | Nattapong Kurpukdee et.al. | 2601.14069 | translate | read | null |
| 2026-01-20 | Variational Dual-path Attention Network for CSI-Based Gesture Recognition | N. Zhang et.al. | 2601.13745 | translate | read | null |
| 2026-01-20 | GeoDynamics: A Geometric State-Space Neural Network for Understanding Brain Dynamics on Riemannian Manifolds | Tingting Dan et.al. | 2601.13570 | translate | read | null |
| 2026-01-19 | Dynamic Hand Gesture Recognition for Robot Manipulator Tasks | Dharmendra Sharma et.al. | 2601.12918 | translate | read | null |
| 2026-01-15 | Effects of Different Attention Mechanisms Applied on 3D Models in Video Classification | Mohammad Rasras et.al. | 2601.10854 | translate | read | null |
| 2026-01-15 | Can Vision-Language Models Understand Construction Workers? An Exploratory Study | Hieu Bui et.al. | 2601.10835 | translate | read | null |
| 2026-01-11 | Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration | Sen Wang et.al. | 2601.10744 | translate | read | null |
| 2026-01-06 | Millimeter-Wave Gesture Recognition in ISAC: Does Reducing Sensing Airtime Hamper Accuracy? | Jakob Struye et.al. | 2601.10733 | translate | read | null |
| 2026-01-15 | Action100M: A Large-scale Video Action Dataset | Delong Chen et.al. | 2601.10592 | translate | read | link |
| 2026-01-15 | BikeActions: An Open Platform and Benchmark for Cyclist-Centric VRU Action Recognition | Max A. Buettner et.al. | 2601.10521 | translate | read | null |
| 2026-01-14 | COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation | Tony Danjun Wang et.al. | 2601.09698 | translate | read | null |
| 2026-01-13 | ActiveVLA: Injecting Active Perception into Vision-Language-Action Models for Precise 3D Robotic Manipulation | Zhenyang Liu et.al. | 2601.08325 | translate | read | null |
| 2026-01-13 | VGG Induced Deep Hand Sign Language Detection | Subham Sharma et.al. | 2601.08262 | translate | read | null |
| 2026-01-12 | Video Generation Models in Robotics – Applications, Research Challenges, Future Directions | Zhiting Mei et.al. | 2601.07823 | translate | read | null |
| 2026-01-12 | Variational Contrastive Learning for Skeleton-based Action Recognition | Dang Dinh Nguyen et.al. | 2601.07666 | translate | read | null |
| 2026-01-12 | Motion Focus Recognition in Fast-Moving Egocentric Video | Daniel Hong et.al. | 2601.07154 | translate | read | null |
| 2026-01-10 | Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification | Ahmed Abdelkawy et.al. | 2601.06394 | translate | read | null |
| 2026-01-09 | LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction | Chengen Xie et.al. | 2601.05611 | translate | read | null |
| 2026-01-08 | When to Act: Calibrated Confidence for Reliable Human Intention Prediction in Assistive Robotics | Johannes A. Gaus et.al. | 2601.04982 | translate | read | null |
| 2026-01-08 | CounterVid: Counterfactual Video Generation for Mitigating Action and Temporal Hallucinations in Video-Language Models | Tobia Poppi et.al. | 2601.04778 | translate | read | null |
| 2026-01-07 | Lightweight Test-Time Adaptation for EMG-Based Gesture Recognition | Nia Touko et.al. | 2601.04181 | translate | read | null |
| 2026-01-07 | Beyond Physical Labels: Redefining Domains for Robust WiFi-based Gesture Recognition | Xiang Zhang et.al. | 2601.03825 | translate | read | null |
| 2026-01-07 | TRec: Learning Hand-Object Interactions through 2D Point Track Motion | Dennis Holzmann et.al. | 2601.03667 | translate | read | null |
| 2026-01-04 | Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation | Huajie Tan et.al. | 2601.01618 | translate | read | null |
| 2026-01-01 | BHaRNet: Reliability-Aware Body-Hand Modality Expertized Networks for Fine-grained Skeleton Action Recognition | Seungyeon Cho et.al. | 2601.00369 | translate | read | null |
| 2026-01-01 | Effects of Limited Field of View on Musical Collaboration Experience with Avatars in Extended Reality | Suibi Che-Chuan Weng et.al. | 2601.00333 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)