Action Recognition - 2024-04

Publish Date Title Authors PDF Translate Read Code
2024-04-30 One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features Trung Thanh Nguyen et.al. 2404.19542 translate read link
2024-04-30 Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition Zhendong Liu et.al. 2404.19383 translate read null
2024-04-28 Enhancing Action Recognition from Low-Quality Skeleton Data via Part-Level Knowledge Distillation Cuiwei Liu et.al. 2404.18206 translate read null
2024-04-26 SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes Georgia Baltsou et.al. 2404.17255 translate read null
2024-04-25 Learning Discriminative Spatio-temporal Representations for Semi-supervised Action Recognition Yu Wang et.al. 2404.16416 translate read null
2024-04-25 An Improved Graph Pooling Network for Skeleton-Based Action Recognition Cong Wu et.al. 2404.16359 translate read null
2024-04-24 Unimodal and Multimodal Sensor Fusion for Wearable Activity Recognition Hymalai Bello et.al. 2404.16005 translate read null
2024-04-24 3D Face Morphing Attack Generation using Non-Rigid Registration Jag Mohan Singh et.al. 2404.15765 translate read null
2024-04-25 HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition Jinfu Liu et.al. 2404.15719 translate read link
2024-04-23 Combating Missing Modalities in Egocentric Videos at Test Time Merey Ramazanova et.al. 2404.15161 translate read null
2024-04-23 G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos for Generalized Gesture Recognition Kaikai Deng et.al. 2404.14934 translate read null
2024-04-23 Driver Activity Classification Using Generalizable Representations from Vision-Language Models Ross Greer et.al. 2404.14906 translate read null
2024-04-23 DENOISER: Rethinking the Robustness for Open-Vocabulary Action Recognition Haozhe Cheng et.al. 2404.14890 translate read null
2024-04-22 1st Place Solution to the 1st SkatingVerse Challenge Tao Sun et.al. 2404.14032 translate read null
2024-04-22 CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment Kanglei Zhou et.al. 2404.13999 translate read link
2024-04-21 Attack on Scene Flow using Point Clouds Haniyeh Ehsani Oskouie et.al. 2404.13621 translate read null
2024-04-20 STAT: Towards Generalizable Temporal Action Localization Yangcen Liu et.al. 2404.13311 translate read null
2024-04-19 Ring-a-Pose: A Ring for Continuous Hand Pose Tracking Tianhong Catherine Yu et.al. 2404.12980 translate read null
2024-04-19 VoxAtnNet: A 3D Point Clouds Convolutional Neural Network for Generalizable Face Presentation Attack Detection Raghavendra Ramachandra et.al. 2404.12680 translate read null
2024-04-18 DeepLocalization: Using change point detection for Temporal Action Localization Mohammed Shaiqur Rahman et.al. 2404.12258 translate read null
2024-04-18 Aligning Actions and Walking to LLM-Generated Textual Descriptions Radu Chivereanu et.al. 2404.12192 translate read link
2024-04-18 Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition Xunsong Li et.al. 2404.11903 translate read null
2024-04-18 sEMG-based Fine-grained Gesture Recognition via Improved LightGBM Model Xiupeng Qiao et.al. 2404.11861 translate read null
2024-04-17 VG4D: Vision-Language Model Goes 4D Video Recognition Zhichao Deng et.al. 2404.11605 translate read link
2024-04-17 A Data-Driven Representation for Sign Language Production Harry Walsh et.al. 2404.11499 translate read link
2024-04-17 Lower Limb Movements Recognition Based on Feature Recursive Elimination and Backpropagation Neural Network Yongkai Ma et.al. 2404.11383 translate read null
2024-04-17 Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in Surface Electromyographic Signal Analysis Weiyu Guo et.al. 2404.11213 translate read null
2024-04-17 Kathakali Hand Gesture Recognition With Minimal Data Kavitha Raju et.al. 2404.11205 translate read null
2024-04-16 HumMUSS: Human Motion Understanding using State Space Models Arnab Kumar Mondal et.al. 2404.10880 translate read null
2024-04-17 Learning to Score Sign Language with Two-stage Method Hongli Wen et.al. 2404.10383 translate read null
2024-04-16 MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognition Naichuan Zheng et.al. 2404.10210 translate read null
2024-04-15 Design and Analysis of Efficient Attention in Transformers for Social Group Activity Recognition Masato Tamura et.al. 2404.09964 translate read null
2024-04-15 A Diffusion-based Data Generator for Training Object Recognition Models in Ultra-Range Distance Eran Bamani et.al. 2404.09846 translate read null
2024-04-15 Leveraging Temporal Contextualization for Video Action Recognition Minji Kim et.al. 2404.09490 translate read link
2024-04-14 In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition Wiktor Mucha et.al. 2404.09308 translate read null
2024-04-13 Exploring Explainability in Video Action Recognition Avinab Saha et.al. 2404.09067 translate read null
2024-04-12 MSSTNet: A Multi-Scale Spatio-Temporal CNN-Transformer Network for Dynamic Facial Expression Recognition Linhuang Wang et.al. 2404.08433 translate read null
2024-04-11 Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls Amin Hosseiny Marani et.al. 2404.08155 translate read null
2024-04-11 Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos Soumyabrata Chaudhuri et.al. 2404.07645 translate read null
2024-04-15 Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition Yang Chen et.al. 2404.07487 translate read null
2024-04-10 O-TALC: Steps Towards Combating Oversegmentation within Online Action Segmentation Matthew Kent Myers et.al. 2404.06894 translate read null
2024-04-10 An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video Xingyu Song et.al. 2404.06741 translate read null
2024-04-07 X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model Jan Held et.al. 2404.06332 translate read null
2024-04-10 Algorithms for Caching and MTS with reduced number of predictions Karim Abdel Sadek et.al. 2404.06280 translate read null
2024-04-09 ActNetFormer: Transformer-ResNet Hybrid Method for Semi-Supervised Action Recognition in Videos Sharana Dharshikgan Suresh Dass et.al. 2404.06243 translate read link
2024-04-08 Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder Halil Ismail Helvaci et.al. 2404.05849 translate read null
2024-04-09 TIM: A Time Interval Machine for Audio-Visual Action Recognition Jacob Chalk et.al. 2404.05559 translate read link
2024-04-11 Test-Time Zero-Shot Temporal Action Localization Benedetta Liberatori et.al. 2404.05426 translate read link
2024-04-09 SDFR: Synthetic Data for Face Recognition Competition Hatef Otroshi Shahreza et.al. 2404.04580 translate read null
2024-04-05 PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos Yufei Zhang et.al. 2404.04430 translate read null
2024-04-05 Koala: Key frame-conditioned long video-LLM Reuben Tan et.al. 2404.04346 translate read null
2024-04-04 UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization Tiantian Geng et.al. 2404.03179 translate read null
2024-04-03 Optimizing the Deployment of Tiny Transformers on Low-Power MCUs Victor J. B. Jung et.al. 2404.02945 translate read link
2024-04-03 Multi-Scale Spatial-Temporal Self-Attention Graph Convolutional Networks for Skeleton-based Action Recognition Ikuo Nakamura et.al. 2404.02624 translate read null
2024-04-02 PREGO: online mistake detection in PRocedural EGOcentric videos Alessandro Flaborea et.al. 2404.01933 translate read link
2024-04-02 Disentangled Pre-training for Human-Object Interaction Detection Zhuolong Li et.al. 2404.01725 translate read link
2024-04-02 Language Model Guided Interpretable Video Action Reasoning Ning Wang et.al. 2404.01591 translate read null
2024-04-02 Leveraging YOLO-World and GPT-4V LMMs for Zero-Shot Person Detection and Action Recognition in Drone Imagery Christian Limberg et.al. 2404.01571 translate read null
2024-04-01 LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization Akshita Gupta et.al. 2404.01282 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)