Action Recognition - 2024-04
Action Recognition - 2024-04
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-04-30 | One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features | Trung Thanh Nguyen et.al. | 2404.19542 | translate | read | link |
| 2024-04-30 | Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition | Zhendong Liu et.al. | 2404.19383 | translate | read | null |
| 2024-04-28 | Enhancing Action Recognition from Low-Quality Skeleton Data via Part-Level Knowledge Distillation | Cuiwei Liu et.al. | 2404.18206 | translate | read | null |
| 2024-04-26 | SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes | Georgia Baltsou et.al. | 2404.17255 | translate | read | null |
| 2024-04-25 | Learning Discriminative Spatio-temporal Representations for Semi-supervised Action Recognition | Yu Wang et.al. | 2404.16416 | translate | read | null |
| 2024-04-25 | An Improved Graph Pooling Network for Skeleton-Based Action Recognition | Cong Wu et.al. | 2404.16359 | translate | read | null |
| 2024-04-24 | Unimodal and Multimodal Sensor Fusion for Wearable Activity Recognition | Hymalai Bello et.al. | 2404.16005 | translate | read | null |
| 2024-04-24 | 3D Face Morphing Attack Generation using Non-Rigid Registration | Jag Mohan Singh et.al. | 2404.15765 | translate | read | null |
| 2024-04-25 | HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition | Jinfu Liu et.al. | 2404.15719 | translate | read | link |
| 2024-04-23 | Combating Missing Modalities in Egocentric Videos at Test Time | Merey Ramazanova et.al. | 2404.15161 | translate | read | null |
| 2024-04-23 | G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos for Generalized Gesture Recognition | Kaikai Deng et.al. | 2404.14934 | translate | read | null |
| 2024-04-23 | Driver Activity Classification Using Generalizable Representations from Vision-Language Models | Ross Greer et.al. | 2404.14906 | translate | read | null |
| 2024-04-23 | DENOISER: Rethinking the Robustness for Open-Vocabulary Action Recognition | Haozhe Cheng et.al. | 2404.14890 | translate | read | null |
| 2024-04-22 | 1st Place Solution to the 1st SkatingVerse Challenge | Tao Sun et.al. | 2404.14032 | translate | read | null |
| 2024-04-22 | CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment | Kanglei Zhou et.al. | 2404.13999 | translate | read | link |
| 2024-04-21 | Attack on Scene Flow using Point Clouds | Haniyeh Ehsani Oskouie et.al. | 2404.13621 | translate | read | null |
| 2024-04-20 | STAT: Towards Generalizable Temporal Action Localization | Yangcen Liu et.al. | 2404.13311 | translate | read | null |
| 2024-04-19 | Ring-a-Pose: A Ring for Continuous Hand Pose Tracking | Tianhong Catherine Yu et.al. | 2404.12980 | translate | read | null |
| 2024-04-19 | VoxAtnNet: A 3D Point Clouds Convolutional Neural Network for Generalizable Face Presentation Attack Detection | Raghavendra Ramachandra et.al. | 2404.12680 | translate | read | null |
| 2024-04-18 | DeepLocalization: Using change point detection for Temporal Action Localization | Mohammed Shaiqur Rahman et.al. | 2404.12258 | translate | read | null |
| 2024-04-18 | Aligning Actions and Walking to LLM-Generated Textual Descriptions | Radu Chivereanu et.al. | 2404.12192 | translate | read | link |
| 2024-04-18 | Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition | Xunsong Li et.al. | 2404.11903 | translate | read | null |
| 2024-04-18 | sEMG-based Fine-grained Gesture Recognition via Improved LightGBM Model | Xiupeng Qiao et.al. | 2404.11861 | translate | read | null |
| 2024-04-17 | VG4D: Vision-Language Model Goes 4D Video Recognition | Zhichao Deng et.al. | 2404.11605 | translate | read | link |
| 2024-04-17 | A Data-Driven Representation for Sign Language Production | Harry Walsh et.al. | 2404.11499 | translate | read | link |
| 2024-04-17 | Lower Limb Movements Recognition Based on Feature Recursive Elimination and Backpropagation Neural Network | Yongkai Ma et.al. | 2404.11383 | translate | read | null |
| 2024-04-17 | Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in Surface Electromyographic Signal Analysis | Weiyu Guo et.al. | 2404.11213 | translate | read | null |
| 2024-04-17 | Kathakali Hand Gesture Recognition With Minimal Data | Kavitha Raju et.al. | 2404.11205 | translate | read | null |
| 2024-04-16 | HumMUSS: Human Motion Understanding using State Space Models | Arnab Kumar Mondal et.al. | 2404.10880 | translate | read | null |
| 2024-04-17 | Learning to Score Sign Language with Two-stage Method | Hongli Wen et.al. | 2404.10383 | translate | read | null |
| 2024-04-16 | MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognition | Naichuan Zheng et.al. | 2404.10210 | translate | read | null |
| 2024-04-15 | Design and Analysis of Efficient Attention in Transformers for Social Group Activity Recognition | Masato Tamura et.al. | 2404.09964 | translate | read | null |
| 2024-04-15 | A Diffusion-based Data Generator for Training Object Recognition Models in Ultra-Range Distance | Eran Bamani et.al. | 2404.09846 | translate | read | null |
| 2024-04-15 | Leveraging Temporal Contextualization for Video Action Recognition | Minji Kim et.al. | 2404.09490 | translate | read | link |
| 2024-04-14 | In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition | Wiktor Mucha et.al. | 2404.09308 | translate | read | null |
| 2024-04-13 | Exploring Explainability in Video Action Recognition | Avinab Saha et.al. | 2404.09067 | translate | read | null |
| 2024-04-12 | MSSTNet: A Multi-Scale Spatio-Temporal CNN-Transformer Network for Dynamic Facial Expression Recognition | Linhuang Wang et.al. | 2404.08433 | translate | read | null |
| 2024-04-11 | Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls | Amin Hosseiny Marani et.al. | 2404.08155 | translate | read | null |
| 2024-04-11 | Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos | Soumyabrata Chaudhuri et.al. | 2404.07645 | translate | read | null |
| 2024-04-15 | Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition | Yang Chen et.al. | 2404.07487 | translate | read | null |
| 2024-04-10 | O-TALC: Steps Towards Combating Oversegmentation within Online Action Segmentation | Matthew Kent Myers et.al. | 2404.06894 | translate | read | null |
| 2024-04-10 | An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video | Xingyu Song et.al. | 2404.06741 | translate | read | null |
| 2024-04-07 | X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model | Jan Held et.al. | 2404.06332 | translate | read | null |
| 2024-04-10 | Algorithms for Caching and MTS with reduced number of predictions | Karim Abdel Sadek et.al. | 2404.06280 | translate | read | null |
| 2024-04-09 | ActNetFormer: Transformer-ResNet Hybrid Method for Semi-Supervised Action Recognition in Videos | Sharana Dharshikgan Suresh Dass et.al. | 2404.06243 | translate | read | link |
| 2024-04-08 | Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder | Halil Ismail Helvaci et.al. | 2404.05849 | translate | read | null |
| 2024-04-09 | TIM: A Time Interval Machine for Audio-Visual Action Recognition | Jacob Chalk et.al. | 2404.05559 | translate | read | link |
| 2024-04-11 | Test-Time Zero-Shot Temporal Action Localization | Benedetta Liberatori et.al. | 2404.05426 | translate | read | link |
| 2024-04-09 | SDFR: Synthetic Data for Face Recognition Competition | Hatef Otroshi Shahreza et.al. | 2404.04580 | translate | read | null |
| 2024-04-05 | PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos | Yufei Zhang et.al. | 2404.04430 | translate | read | null |
| 2024-04-05 | Koala: Key frame-conditioned long video-LLM | Reuben Tan et.al. | 2404.04346 | translate | read | null |
| 2024-04-04 | UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization | Tiantian Geng et.al. | 2404.03179 | translate | read | null |
| 2024-04-03 | Optimizing the Deployment of Tiny Transformers on Low-Power MCUs | Victor J. B. Jung et.al. | 2404.02945 | translate | read | link |
| 2024-04-03 | Multi-Scale Spatial-Temporal Self-Attention Graph Convolutional Networks for Skeleton-based Action Recognition | Ikuo Nakamura et.al. | 2404.02624 | translate | read | null |
| 2024-04-02 | PREGO: online mistake detection in PRocedural EGOcentric videos | Alessandro Flaborea et.al. | 2404.01933 | translate | read | link |
| 2024-04-02 | Disentangled Pre-training for Human-Object Interaction Detection | Zhuolong Li et.al. | 2404.01725 | translate | read | link |
| 2024-04-02 | Language Model Guided Interpretable Video Action Reasoning | Ning Wang et.al. | 2404.01591 | translate | read | null |
| 2024-04-02 | Leveraging YOLO-World and GPT-4V LMMs for Zero-Shot Person Detection and Action Recognition in Drone Imagery | Christian Limberg et.al. | 2404.01571 | translate | read | null |
| 2024-04-01 | LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization | Akshita Gupta et.al. | 2404.01282 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)