Action Recognition - 2024-11
Action Recognition - 2024-11
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-11-29 | CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation | Qixiu Li et.al. | 2411.19650 | translate | read | null |
| 2024-11-29 | SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders | Niki Martinel et.al. | 2411.19544 | translate | read | null |
| 2024-11-29 | Hierarchical Framework for Retrosynthesis Prediction with Enhanced Reaction Center Localization | Seongeun Yun et.al. | 2411.19503 | translate | read | null |
| 2024-11-28 | TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition | Yilong Wang et.al. | 2411.19041 | translate | read | null |
| 2024-11-28 | Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition | Hongda Liu et.al. | 2411.18941 | translate | read | link |
| 2024-11-27 | Robust Dynamic Gesture Recognition at Ultra-Long Distances | Eran Bamani Beeri et.al. | 2411.18413 | translate | read | null |
| 2024-11-27 | EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond | Meiqi Cao et.al. | 2411.18328 | translate | read | null |
| 2024-11-27 | An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition | Song-Jiang Lai et.al. | 2411.18002 | translate | read | null |
| 2024-11-26 | Pre-training for Action Recognition with Automatically Generated Fractal Datasets | Davyd Svyezhentsev et.al. | 2411.17584 | translate | read | link |
| 2024-11-26 | Real-Time Multimodal Signal Processing for HRI in RoboCup: Understanding a Human Referee | Filippo Ansalone et.al. | 2411.17347 | translate | read | null |
| 2024-11-22 | TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks | Prajna G. Malettira et.al. | 2411.16711 | translate | read | null |
| 2024-11-24 | OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions | Guanyu Zhou et.al. | 2411.15729 | translate | read | link |
| 2024-11-23 | Machine Learning-based sEMG Signal Classification for Hand Gesture Recognition | Parshuram N. Aarotale et.al. | 2411.15655 | translate | read | null |
| 2024-11-23 | Optimizing Gesture Recognition for Seamless UI Interaction Using Convolutional Neural Networks | Qi Sun et.al. | 2411.15598 | translate | read | null |
| 2024-11-22 | When Spatial meets Temporal in Action Recognition | Huilin Chen et.al. | 2411.15284 | translate | read | null |
| 2024-11-22 | Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections | Youwei Zhou et.al. | 2411.14796 | translate | read | null |
| 2024-11-22 | Aim My Robot: Precision Local Navigation to Any Object | Xiangyun Meng et.al. | 2411.14770 | translate | read | null |
| 2024-11-21 | Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning | Jiange Yang et.al. | 2411.14519 | translate | read | null |
| 2024-11-18 | Enhancing Bidirectional Sign Language Communication: Integrating YOLOv8 and NLP for Real-Time Gesture Recognition & Translation | Hasnat Jamil Bhuiyan et.al. | 2411.13597 | translate | read | null |
| 2024-11-23 | AzSLD: Azerbaijani Sign Language Dataset for Fingerspelling, Word, and Sentence Translation with Baseline Software | Nigar Alishzade et.al. | 2411.12865 | translate | read | null |
| 2024-11-20 | Topological Symmetry Enhanced Graph Convolution for Skeleton-Based Action Recognition | Zeyu Liang et.al. | 2411.12560 | translate | read | link |
| 2024-11-19 | Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization | Quang Vinh Nguyen et.al. | 2411.12525 | translate | read | null |
| 2024-11-18 | Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition | Hanyu Guo et.al. | 2411.11335 | translate | read | null |
| 2024-11-18 | Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition | Yang Chen et.al. | 2411.11288 | translate | read | null |
| 2024-11-18 | Efficient Transfer Learning for Video-language Foundation Models | Haoxing Chen et.al. | 2411.11223 | translate | read | link |
| 2024-11-16 | TDSM:Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition | Jeonghyeok Do et.al. | 2411.10745 | translate | read | link |
| 2024-11-15 | KuaiFormer: Transformer-Based Retrieval at Kuaishou | Chi Liu et.al. | 2411.10057 | translate | read | null |
| 2024-11-14 | Towards Scalable Handwriting Communication via EEG Decoding and Latent Embedding Integration | Jun-Young Kim et.al. | 2411.09170 | translate | read | null |
| 2024-11-14 | VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation | Youpeng Wen et.al. | 2411.09153 | translate | read | null |
| 2024-11-13 | Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks? | Quan Zhang et.al. | 2411.08466 | translate | read | null |
| 2024-11-13 | Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study | Jinbo Wen et.al. | 2411.08341 | translate | read | null |
| 2024-11-12 | LapGSR: Laplacian Reconstructive Network for Guided Thermal Super-Resolution | Aditya Kasliwal et.al. | 2411.07750 | translate | read | null |
| 2024-11-12 | OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework | Jiaxi Li et.al. | 2411.07711 | translate | read | null |
| 2024-11-11 | ConvMixFormer- A Resource-efficient Convolution Mixer for Transformer-based Dynamic Hand Gesture Recognition | Mallika Garg et.al. | 2411.07118 | translate | read | link |
| 2024-11-10 | Extended multi-stream temporal-attention module for skeleton-based human action recognition (HAR) | Faisal Mehmood et.al. | 2411.06553 | translate | read | null |
| 2024-11-10 | SuperResolution Radar Gesture Recognitio | Netanel Blumenfeld et.al. | 2411.06410 | translate | read | null |
| 2024-11-08 | Video RWKV:Video Action Recognition Based RWKV | Zhuowen Yin et.al. | 2411.05636 | translate | read | null |
| 2024-11-06 | Object Recognition in Human Computer Interaction:- A Comparative Analysis | Kaushik Ranade et.al. | 2411.04263 | translate | read | null |
| 2024-11-06 | Explaining Human Activity Recognition with SHAP: Validating Insights with Perturbation and Quantitative Measures | Felix Tempel et.al. | 2411.03714 | translate | read | link |
| 2024-11-05 | One-Stage-TFS: Thai One-Stage Fingerspelling Dataset for Fingerspelling Recognition Frameworks | Siriwiwat Lata et.al. | 2411.02768 | translate | read | null |
| 2024-11-04 | TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos | Leonardo Plini et.al. | 2411.02570 | translate | read | null |
| 2024-11-04 | AM Flow: Adapters for Temporal Processing in Action Recognition | Tanay Agrawal et.al. | 2411.02065 | translate | read | null |
| 2024-11-04 | ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics | Chuanchuan Wang et.al. | 2411.01769 | translate | read | null |
| 2024-11-01 | STAA: Spatio-Temporal Attention Attribution for Real-Time Interpreting Transformer-based Video Models | Zerui Wang et.al. | 2411.00630 | translate | read | link |
| 2024-11-01 | Human Action Recognition (HAR) Using Skeleton-based Spatial Temporal Relative Transformer Network: ST-RTR | Faisal Mehmood et.al. | 2410.23806 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)