Action Recognition - 2024-11

Publish Date Title Authors PDF Translate Read Code
2024-11-29 CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation Qixiu Li et.al. 2411.19650 translate read null
2024-11-29 SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders Niki Martinel et.al. 2411.19544 translate read null
2024-11-29 Hierarchical Framework for Retrosynthesis Prediction with Enhanced Reaction Center Localization Seongeun Yun et.al. 2411.19503 translate read null
2024-11-28 TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition Yilong Wang et.al. 2411.19041 translate read null
2024-11-28 Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition Hongda Liu et.al. 2411.18941 translate read link
2024-11-27 Robust Dynamic Gesture Recognition at Ultra-Long Distances Eran Bamani Beeri et.al. 2411.18413 translate read null
2024-11-27 EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond Meiqi Cao et.al. 2411.18328 translate read null
2024-11-27 An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition Song-Jiang Lai et.al. 2411.18002 translate read null
2024-11-26 Pre-training for Action Recognition with Automatically Generated Fractal Datasets Davyd Svyezhentsev et.al. 2411.17584 translate read link
2024-11-26 Real-Time Multimodal Signal Processing for HRI in RoboCup: Understanding a Human Referee Filippo Ansalone et.al. 2411.17347 translate read null
2024-11-22 TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks Prajna G. Malettira et.al. 2411.16711 translate read null
2024-11-24 OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions Guanyu Zhou et.al. 2411.15729 translate read link
2024-11-23 Machine Learning-based sEMG Signal Classification for Hand Gesture Recognition Parshuram N. Aarotale et.al. 2411.15655 translate read null
2024-11-23 Optimizing Gesture Recognition for Seamless UI Interaction Using Convolutional Neural Networks Qi Sun et.al. 2411.15598 translate read null
2024-11-22 When Spatial meets Temporal in Action Recognition Huilin Chen et.al. 2411.15284 translate read null
2024-11-22 Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections Youwei Zhou et.al. 2411.14796 translate read null
2024-11-22 Aim My Robot: Precision Local Navigation to Any Object Xiangyun Meng et.al. 2411.14770 translate read null
2024-11-21 Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning Jiange Yang et.al. 2411.14519 translate read null
2024-11-18 Enhancing Bidirectional Sign Language Communication: Integrating YOLOv8 and NLP for Real-Time Gesture Recognition & Translation Hasnat Jamil Bhuiyan et.al. 2411.13597 translate read null
2024-11-23 AzSLD: Azerbaijani Sign Language Dataset for Fingerspelling, Word, and Sentence Translation with Baseline Software Nigar Alishzade et.al. 2411.12865 translate read null
2024-11-20 Topological Symmetry Enhanced Graph Convolution for Skeleton-Based Action Recognition Zeyu Liang et.al. 2411.12560 translate read link
2024-11-19 Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization Quang Vinh Nguyen et.al. 2411.12525 translate read null
2024-11-18 Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition Hanyu Guo et.al. 2411.11335 translate read null
2024-11-18 Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition Yang Chen et.al. 2411.11288 translate read null
2024-11-18 Efficient Transfer Learning for Video-language Foundation Models Haoxing Chen et.al. 2411.11223 translate read link
2024-11-16 TDSM:Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition Jeonghyeok Do et.al. 2411.10745 translate read link
2024-11-15 KuaiFormer: Transformer-Based Retrieval at Kuaishou Chi Liu et.al. 2411.10057 translate read null
2024-11-14 Towards Scalable Handwriting Communication via EEG Decoding and Latent Embedding Integration Jun-Young Kim et.al. 2411.09170 translate read null
2024-11-14 VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation Youpeng Wen et.al. 2411.09153 translate read null
2024-11-13 Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks? Quan Zhang et.al. 2411.08466 translate read null
2024-11-13 Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study Jinbo Wen et.al. 2411.08341 translate read null
2024-11-12 LapGSR: Laplacian Reconstructive Network for Guided Thermal Super-Resolution Aditya Kasliwal et.al. 2411.07750 translate read null
2024-11-12 OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework Jiaxi Li et.al. 2411.07711 translate read null
2024-11-11 ConvMixFormer- A Resource-efficient Convolution Mixer for Transformer-based Dynamic Hand Gesture Recognition Mallika Garg et.al. 2411.07118 translate read link
2024-11-10 Extended multi-stream temporal-attention module for skeleton-based human action recognition (HAR) Faisal Mehmood et.al. 2411.06553 translate read null
2024-11-10 SuperResolution Radar Gesture Recognitio Netanel Blumenfeld et.al. 2411.06410 translate read null
2024-11-08 Video RWKV:Video Action Recognition Based RWKV Zhuowen Yin et.al. 2411.05636 translate read null
2024-11-06 Object Recognition in Human Computer Interaction:- A Comparative Analysis Kaushik Ranade et.al. 2411.04263 translate read null
2024-11-06 Explaining Human Activity Recognition with SHAP: Validating Insights with Perturbation and Quantitative Measures Felix Tempel et.al. 2411.03714 translate read link
2024-11-05 One-Stage-TFS: Thai One-Stage Fingerspelling Dataset for Fingerspelling Recognition Frameworks Siriwiwat Lata et.al. 2411.02768 translate read null
2024-11-04 TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos Leonardo Plini et.al. 2411.02570 translate read null
2024-11-04 AM Flow: Adapters for Temporal Processing in Action Recognition Tanay Agrawal et.al. 2411.02065 translate read null
2024-11-04 ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics Chuanchuan Wang et.al. 2411.01769 translate read null
2024-11-01 STAA: Spatio-Temporal Attention Attribution for Real-Time Interpreting Transformer-based Video Models Zerui Wang et.al. 2411.00630 translate read link
2024-11-01 Human Action Recognition (HAR) Using Skeleton-based Spatial Temporal Relative Transformer Network: ST-RTR Faisal Mehmood et.al. 2410.23806 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)