Action Recognition - 2025-07
Action Recognition - 2025-07
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-07-22 | Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition | Zefeng Qian et.al. | 2507.16287 | translate | read | null |
| 2025-07-22 | SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities | Yasser Ashraf et.al. | 2507.16151 | translate | read | null |
| 2025-07-20 | Light Future: Multimodal Action Frame Prediction via InstructPix2Pix | Zesen Zhong et.al. | 2507.14809 | translate | read | null |
| 2025-07-17 | A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains | Antonio Finocchiaro et.al. | 2507.13326 | translate | read | null |
| 2025-07-17 | Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities | Liuyi Wang et.al. | 2507.13019 | translate | read | null |
| 2025-07-17 | Generalist Bimanual Manipulation via Foundation Video Diffusion Models | Yao Feng et.al. | 2507.12898 | translate | read | null |
| 2025-07-16 | Predicting Soccer Penalty Kick Direction Using Human Action Recognition | David Freire-Obregón et.al. | 2507.12617 | translate | read | null |
| 2025-07-18 | DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition | Hayat Ullah et.al. | 2507.12426 | translate | read | null |
| 2025-07-16 | Calisthenics Skills Temporal Video Segmentation | Antonio Finocchiaro et.al. | 2507.12245 | translate | read | null |
| 2025-07-15 | Diffusion-Based Imaginative Coordination for Bimanual Manipulation | Huilin Xu et.al. | 2507.11296 | translate | read | null |
| 2025-07-15 | Women Sport Actions Dataset for Visual Classification Using Small Scale Training Data | Palash Ray et.al. | 2507.10969 | translate | read | null |
| 2025-07-14 | Hand Gesture Recognition for Collaborative Robots Using Lightweight Deep Learning in Real-Time Robotic Systems | Muhtadin et.al. | 2507.10055 | translate | read | null |
| 2025-07-13 | Online Micro-gesture Recognition Using Data Augmentation and Spatial-Temporal Attention | Pengyu Liu et.al. | 2507.09512 | translate | read | null |
| 2025-07-11 | MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion | Jihao Gu et.al. | 2507.08344 | translate | read | null |
| 2025-07-10 | Multimodal Framework for Explainable Autonomous Driving: Integrating Video, Sensor, and Textual Data for Enhanced Decision-Making and Transparency | Abolfazl Zarghani et.al. | 2507.07938 | translate | read | null |
| 2025-07-10 | EEvAct: Early Event-Based Action Recognition with High-Rate Two-Stream Spiking Neural Networks | Michael Neumeier et.al. | 2507.07734 | translate | read | null |
| 2025-07-09 | Cross-Modal Dual-Causal Learning for Long-Term Action Recognition | Xu Shaowu et.al. | 2507.06603 | translate | read | null |
| 2025-07-08 | Hierarchical Multi-Stage Transformer Architecture for Context-Aware Temporal Action Localization | Hayat Ullah et.al. | 2507.06411 | translate | read | null |
| 2025-07-10 | VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting | Juyi Lin et.al. | 2507.05116 | translate | read | link |
| 2025-07-07 | HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding | Yuxuan Cai et.al. | 2507.04909 | translate | read | null |
| 2025-07-06 | Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions | Konstantinos Foteinos et.al. | 2507.04465 | translate | read | null |
| 2025-07-06 | DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge | Wenyao Zhang et.al. | 2507.04447 | translate | read | link |
| 2025-07-04 | Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos | Yufan Zhou et.al. | 2507.03393 | translate | read | link |
| 2025-07-05 | AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation | Sixiang Chen et.al. | 2507.01961 | translate | read | null |
| 2025-07-02 | Variational Graph Convolutional Neural Networks | Illia Oleksiienko et.al. | 2507.01699 | translate | read | null |
| 2025-07-01 | Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment | Kai Zhou et.al. | 2507.00566 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)