Action Recognition - 2024-07
Action Recognition - 2024-07
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-07-31 | Explainable Artificial Intelligence for Quantifying Interfering and High-Risk Behaviors in Autism Spectrum Disorder in a Real-World Classroom Environment Using Privacy-Preserving Video Analysis | Barun Das et.al. | 2407.21691 | translate | read | null |
| 2024-07-31 | Skeleton-Based Action Recognition with Spatial-Structural Graph Convolution | Jingyao Wang et.al. | 2407.21525 | translate | read | null |
| 2024-07-31 | Dynamic Gesture Recognition in Ultra-Range Distance for Effective Human-Robot Interaction | Eran Bamani Beeri et.al. | 2407.21374 | translate | read | null |
| 2024-07-29 | Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter | Chao Liu et.al. | 2407.19981 | translate | read | null |
| 2024-07-29 | ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality | Guoliang Xu et.al. | 2407.19820 | translate | read | null |
| 2024-07-29 | PredIN: Towards Open-Set Gesture Recognition via Prediction Inconsistency | Chen Liu et.al. | 2407.19753 | translate | read | null |
| 2024-07-28 | Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph | Zhengcen Li et.al. | 2407.19497 | translate | read | link |
| 2024-07-25 | MARINE: A Computer Vision Model for Detecting Rare Predator-Prey Interactions in Animal Videos | Zsófia Katona et.al. | 2407.18289 | translate | read | null |
| 2024-07-25 | Trajectory-aligned Space-time Tokens for Few-shot Action Recognition | Pulkit Kumar et.al. | 2407.18249 | translate | read | null |
| 2024-07-26 | Harnessing Temporal Causality for Advanced Temporal Action Detection | Shuming Liu et.al. | 2407.17792 | translate | read | link |
| 2024-07-23 | Fusion and Cross-Modal Transfer for Zero-Shot Human Action Recognition | Abhi Kamboj et.al. | 2407.16803 | translate | read | null |
| 2024-07-23 | PLM-Net: Perception Latency Mitigation Network for Vision-Based Lateral Control of Autonomous Vehicles | Aws Khalil et.al. | 2407.16740 | translate | read | link |
| 2024-07-24 | SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition | Wenbo Huang et.al. | 2407.16344 | translate | read | link |
| 2024-07-22 | Efficient and generalizable prediction of molecular alterations in multiple cancer cohorts using H&E whole slide images | Kshitij Ingale et.al. | 2407.15816 | translate | read | null |
| 2024-07-25 | Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition | Jinfu Liu et.al. | 2407.15706 | translate | read | link |
| 2024-07-21 | Semi-Supervised Pipe Video Temporal Defect Interval Localization | Zhu Huang et.al. | 2407.15170 | translate | read | null |
| 2024-07-20 | Automated Patient Positioning with Learned 3D Hand Gestures | Zhongpai Gao et.al. | 2407.14903 | translate | read | null |
| 2024-07-20 | Can VLMs be used on videos for action recognition? LLMs are Visual Reasoning Coordinators | Harsh Lunia et.al. | 2407.14834 | translate | read | null |
| 2024-07-20 | Decoupled Prompt-Adapter Tuning for Continual Activity Recognition | Di Fu et.al. | 2407.14811 | translate | read | null |
| 2024-07-20 | A Comprehensive Review of Few-shot Action Recognition | Yuyang Wanyan et.al. | 2407.14744 | translate | read | null |
| 2024-07-19 | LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition | Soroush Oraki et.al. | 2407.14655 | translate | read | null |
| 2024-07-19 | Fine-grained Knowledge Graph-driven Video-Language Learning for Action Recognition | Rui Zhang et.al. | 2407.14146 | translate | read | null |
| 2024-07-19 | Zero-Shot Underwater Gesture Recognition | Sandipan Sarma et.al. | 2407.14103 | translate | read | link |
| 2024-07-18 | Pose-guided multi-task video transformer for driver action recognition | Ricardo Pizarro et.al. | 2407.13750 | translate | read | null |
| 2024-07-18 | SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders | Sheng-Wei Li et.al. | 2407.13460 | translate | read | link |
| 2024-07-18 | QuIIL at T3 challenge: Towards Automation in Life-Saving Intervention Procedures from First-Person View | Trinh T. L. Vuong et.al. | 2407.13216 | translate | read | link |
| 2024-07-18 | Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism | Sangyoun Lee et.al. | 2407.13078 | translate | read | link |
| 2024-07-17 | ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos | Hyolim Kang et.al. | 2407.12987 | translate | read | link |
| 2024-07-17 | NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models | Gengze Zhou et.al. | 2407.12366 | translate | read | link |
| 2024-07-17 | Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer | Wenhan Wu et.al. | 2407.12322 | translate | read | null |
| 2024-07-17 | Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action Recognition | Jiahang Zhang et.al. | 2407.12312 | translate | read | null |
| 2024-07-16 | Enhancing Split Computing and Early Exit Applications through Predefined Sparsity | Luigi Capogrosso et.al. | 2407.11763 | translate | read | link |
| 2024-07-10 | Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical | Adarsh Prasad Behera et.al. | 2407.11061 | translate | read | null |
| 2024-07-15 | STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences | Soroush Mehraban et.al. | 2407.10935 | translate | read | null |
| 2024-07-15 | Human-Centric Transformer for Domain Adaptive Action Recognition | Kun-Yu Lin et.al. | 2407.10860 | translate | read | null |
| 2024-07-17 | Augmented Neural Fine-Tuning for Efficient Backdoor Purification | Nazmul Karim et.al. | 2407.10052 | translate | read | link |
| 2024-07-13 | Region-aware Image-based Human Action Retrieval with Transformers | Hongsong Wang et.al. | 2407.09924 | translate | read | null |
| 2024-07-16 | OmniRace: 6D Hand Pose Estimation for Intuitive Guidance of Racing Drone | Valerii Serpiva et.al. | 2407.09841 | translate | read | link |
| 2024-07-12 | Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action Localization | Qianhan Feng et.al. | 2407.08971 | translate | read | link |
| 2024-07-11 | Boosting Adversarial Transferability for Skeleton-based Action Recognition via Exploring the Model Posterior Space | Yunfeng Diao et.al. | 2407.08572 | translate | read | null |
| 2024-07-12 | Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization | Feixiang Zhou et.al. | 2407.07673 | translate | read | null |
| 2024-07-10 | EA-VTR: Event-Aware Video-Text Retrieval | Zongyang Ma et.al. | 2407.07478 | translate | read | null |
| 2024-07-09 | Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization | Jeongseok Hyun et.al. | 2407.07024 | translate | read | link |
| 2024-07-09 | Rethinking Image-to-Video Adaptation: An Object-centric Perspective | Rui Qian et.al. | 2407.06871 | translate | read | null |
| 2024-07-09 | Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition | Mingfang Zhang et.al. | 2407.06628 | translate | read | null |
| 2024-07-08 | Noise-Free Explanation for Driving Action Prediction | Hongbo Zhu et.al. | 2407.06339 | translate | read | link |
| 2024-07-08 | C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition | Rongchang Li et.al. | 2407.06113 | translate | read | link |
| 2024-07-08 | DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition | Fei Guo et.al. | 2407.05657 | translate | read | null |
| 2024-07-11 | Helios: An extremely low power event-based gesture recognition for always-on smart eyewear | Prarthana Bhattacharyya et.al. | 2407.05206 | translate | read | null |
| 2024-07-06 | DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition | Qi Wang et.al. | 2407.05106 | translate | read | link |
| 2024-07-05 | AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Yuhan Zhu et.al. | 2407.04603 | translate | read | null |
| 2024-07-05 | TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking | Thuc Nguyen-Quang et.al. | 2407.04327 | translate | read | null |
| 2024-07-05 | Computer Vision for Clinical Gait Analysis: A Gait Abnormality Video Dataset | Rahm Ranjan et.al. | 2407.04190 | translate | read | null |
| 2024-07-04 | Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature Selection | Jiafan Zhuang et.al. | 2407.04056 | translate | read | null |
| 2024-07-04 | On-Device Training Empowered Transfer Learning For Human Activity Recognition | Pixi Kang et.al. | 2407.03644 | translate | read | null |
| 2024-07-03 | Motion meets Attention: Video Motion Prompts | Qixiang Chen et.al. | 2407.03179 | translate | read | null |
| 2024-07-02 | Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation | Efstathia Soufleri et.al. | 2407.02713 | translate | read | link |
| 2024-07-02 | Novel Human Machine Interface via Robust Hand Gesture Recognition System using Channel Pruned YOLOv5s Model | Abir Sen et.al. | 2407.02585 | translate | read | null |
| 2024-07-02 | Referring Atomic Video Action Recognition | Kunyu Peng et.al. | 2407.01872 | translate | read | link |
| 2024-07-01 | Mask and Compress: Efficient Skeleton-based Action Recognition in Continual Learning | Matteo Mosconi et.al. | 2407.01397 | translate | read | link |
| 2024-07-01 | EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation | Baoqi Pei et.al. | 2406.18070 | translate | read | link |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)