Action Recognition - 2025-07

Publish Date Title Authors PDF Translate Read Code
2025-07-22 Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition Zefeng Qian et.al. 2507.16287 translate read null
2025-07-22 SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities Yasser Ashraf et.al. 2507.16151 translate read null
2025-07-20 Light Future: Multimodal Action Frame Prediction via InstructPix2Pix Zesen Zhong et.al. 2507.14809 translate read null
2025-07-17 A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains Antonio Finocchiaro et.al. 2507.13326 translate read null
2025-07-17 Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities Liuyi Wang et.al. 2507.13019 translate read null
2025-07-17 Generalist Bimanual Manipulation via Foundation Video Diffusion Models Yao Feng et.al. 2507.12898 translate read null
2025-07-16 Predicting Soccer Penalty Kick Direction Using Human Action Recognition David Freire-Obregón et.al. 2507.12617 translate read null
2025-07-18 DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition Hayat Ullah et.al. 2507.12426 translate read null
2025-07-16 Calisthenics Skills Temporal Video Segmentation Antonio Finocchiaro et.al. 2507.12245 translate read null
2025-07-15 Diffusion-Based Imaginative Coordination for Bimanual Manipulation Huilin Xu et.al. 2507.11296 translate read null
2025-07-15 Women Sport Actions Dataset for Visual Classification Using Small Scale Training Data Palash Ray et.al. 2507.10969 translate read null
2025-07-14 Hand Gesture Recognition for Collaborative Robots Using Lightweight Deep Learning in Real-Time Robotic Systems Muhtadin et.al. 2507.10055 translate read null
2025-07-13 Online Micro-gesture Recognition Using Data Augmentation and Spatial-Temporal Attention Pengyu Liu et.al. 2507.09512 translate read null
2025-07-11 MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion Jihao Gu et.al. 2507.08344 translate read null
2025-07-10 Multimodal Framework for Explainable Autonomous Driving: Integrating Video, Sensor, and Textual Data for Enhanced Decision-Making and Transparency Abolfazl Zarghani et.al. 2507.07938 translate read null
2025-07-10 EEvAct: Early Event-Based Action Recognition with High-Rate Two-Stream Spiking Neural Networks Michael Neumeier et.al. 2507.07734 translate read null
2025-07-09 Cross-Modal Dual-Causal Learning for Long-Term Action Recognition Xu Shaowu et.al. 2507.06603 translate read null
2025-07-08 Hierarchical Multi-Stage Transformer Architecture for Context-Aware Temporal Action Localization Hayat Ullah et.al. 2507.06411 translate read null
2025-07-10 VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting Juyi Lin et.al. 2507.05116 translate read link
2025-07-07 HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding Yuxuan Cai et.al. 2507.04909 translate read null
2025-07-06 Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions Konstantinos Foteinos et.al. 2507.04465 translate read null
2025-07-06 DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge Wenyao Zhang et.al. 2507.04447 translate read link
2025-07-04 Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos Yufan Zhou et.al. 2507.03393 translate read link
2025-07-05 AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation Sixiang Chen et.al. 2507.01961 translate read null
2025-07-02 Variational Graph Convolutional Neural Networks Illia Oleksiienko et.al. 2507.01699 translate read null
2025-07-01 Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment Kai Zhou et.al. 2507.00566 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)