Action Recognition - 2024-09
Action Recognition - 2024-09
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-09-30 | SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition | Shu Yang et.al. | 2409.20083 | translate | read | null |
| 2024-09-28 | Gesture Recognition for Feedback Based Mixed Reality and Robotic Fabrication: A Case Study of the UnLog Tower | Alexander Htet Kyaw et.al. | 2409.19281 | translate | read | null |
| 2024-09-26 | SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining | Ruiqi Xian et.al. | 2409.18300 | translate | read | null |
| 2024-09-26 | Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition | Xinpeng Yin et.al. | 2409.17951 | translate | read | link |
| 2024-09-26 | EAGLE: Egocentric AGgregated Language-video Engine | Jing Bi et.al. | 2409.17523 | translate | read | null |
| 2024-09-25 | Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration | Jiazhou Zhou et.al. | 2409.16953 | translate | read | null |
| 2024-09-25 | Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion | Vineet Punyamoorty et.al. | 2409.16950 | translate | read | null |
| 2024-09-24 | Hand Gesture Classification Based on Forearm Ultrasound Video Snippets Using 3D Convolutional Neural Networks | Keshav Bimbraw et.al. | 2409.16431 | translate | read | null |
| 2024-09-22 | Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment | Jidong Kuang et.al. | 2409.14336 | translate | read | null |
| 2024-09-21 | Egocentric zone-aware action recognition across environments | Simone Alberto Peirone et.al. | 2409.14205 | translate | read | null |
| 2024-09-19 | Interpretable Action Recognition on Hard to Classify Actions | Anastasia Anichenko et.al. | 2409.13091 | translate | read | null |
| 2024-09-18 | Distillation-free Scaling of Large SSMs for Images and Videos | Hamid Suleman et.al. | 2409.11867 | translate | read | null |
| 2024-09-17 | Mamba Fusion: Learning Actions Through Questioning | Zhikang Dong et.al. | 2409.11513 | translate | read | link |
| 2024-09-16 | Forearm Ultrasound based Gesture Recognition on Edge | Keshav Bimbraw et.al. | 2409.09915 | translate | read | null |
| 2024-09-15 | Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition | Cagri Gungor et.al. | 2409.09611 | translate | read | null |
| 2024-09-14 | MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction | Yan Feng et.al. | 2409.09446 | translate | read | link |
| 2024-09-14 | KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition | Zhaoyu Chen et.al. | 2409.09444 | translate | read | null |
| 2024-09-14 | ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild | Arya Farkhondeh et.al. | 2409.09319 | translate | read | link |
| 2024-09-13 | Using The Concept Hierarchy for Household Action Recognition | Andrei Costinescu et.al. | 2409.08853 | translate | read | null |
| 2024-09-12 | Customized Mid-Air Gestures for Accessibility: A $B Recognizer for Multi-Dimensional Biosignal Gestures | Momona Yamagami et.al. | 2409.08402 | translate | read | null |
| 2024-09-12 | Spatial Adaptation Layer: Interpretable Domain Adaptation For Biosignal Sensor Array Applications | Joao Pereira et.al. | 2409.08058 | translate | read | null |
| 2024-09-16 | InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation | Andrew Lee et.al. | 2409.07914 | translate | read | null |
| 2024-09-11 | 2D bidirectional gated recurrent unit convolutional Neural networks for end-to-end violence detection In videos | Abdarahmane Traoré et.al. | 2409.07588 | translate | read | null |
| 2024-09-10 | Data Collection-free Masked Video Modeling | Yuchi Ishikawa et.al. | 2409.06665 | translate | read | null |
| 2024-09-10 | Advancements in Gesture Recognition Techniques and Machine Learning for Enhanced Human-Robot Interaction: A Comprehensive Review | Sajjad Hussain et.al. | 2409.06503 | translate | read | null |
| 2024-09-10 | Learning Generative Interactive Environments By Trained Agent Exploration | Naser Kazemi et.al. | 2409.06445 | translate | read | link |
| 2024-09-09 | ReL-SAR: Representation Learning for Skeleton Action Recognition with Convolutional Transformers and BYOL | Safwen Naimi et.al. | 2409.05749 | translate | read | null |
| 2024-09-11 | Real-Time Human Action Recognition on Embedded Platforms | Ruiqi Wang et.al. | 2409.05662 | translate | read | null |
| 2024-09-06 | Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment | Keyne Oei et.al. | 2409.04607 | translate | read | null |
| 2024-09-05 | MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition | Mallika Garg et.al. | 2409.03890 | translate | read | link |
| 2024-09-05 | UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking | Md. Mahfuzur Rahman et.al. | 2409.03245 | translate | read | null |
| 2024-09-04 | SITAR: Semi-supervised Image Transformer for Action Recognition | Owais Iqbal et.al. | 2409.02910 | translate | read | null |
| 2024-09-04 | TASAR: Transferable Attack on Skeletal Action Recognition | Yunfeng Diao et.al. | 2409.02483 | translate | read | link |
| 2024-09-04 | Unified Framework with Consistency across Modalities for Human Activity Recognition | Tuyen Tran et.al. | 2409.02385 | translate | read | null |
| 2024-09-07 | Unfolding Videos Dynamics via Taylor Expansion | Siyi Chen et.al. | 2409.02371 | translate | read | null |
| 2024-09-03 | ADHD diagnosis based on action characteristics recorded in videos using machine learning | Yichun Li et.al. | 2409.02274 | translate | read | null |
| 2024-09-03 | Action-Based ADHD Diagnosis in Video | Yichun Li et.al. | 2409.02261 | translate | read | null |
| 2024-09-03 | ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition | Shiting Xiao et.al. | 2409.01564 | translate | read | null |
| 2024-09-02 | FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition | Ishan Rajendrakumar Dave et.al. | 2409.01448 | translate | read | null |
| 2024-09-01 | Fisher Information guided Purification against Backdoor Attacks | Nazmul Karim et.al. | 2409.00863 | translate | read | link |
| 2024-09-01 | A Critical Analysis on Machine Learning Techniques for Video-based Human Activity Recognition of Surveillance Systems: A Review | Shahriar Jahan et.al. | 2409.00731 | translate | read | null |
| 2024-09-03 | Open-vocabulary Temporal Action Localization using VLMs | Naoki Wake et.al. | 2408.17422 | translate | read | null |
| 2024-09-04 | Hand1000: Generating Realistic Hands from Text with Only 1,000 Images | Haozhuo Zhang et.al. | 2408.15461 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)