Action Recognition - 2024-09

Publish Date Title Authors PDF Translate Read Code
2024-09-30 SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition Shu Yang et.al. 2409.20083 translate read null
2024-09-28 Gesture Recognition for Feedback Based Mixed Reality and Robotic Fabrication: A Case Study of the UnLog Tower Alexander Htet Kyaw et.al. 2409.19281 translate read null
2024-09-26 SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining Ruiqi Xian et.al. 2409.18300 translate read null
2024-09-26 Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition Xinpeng Yin et.al. 2409.17951 translate read link
2024-09-26 EAGLE: Egocentric AGgregated Language-video Engine Jing Bi et.al. 2409.17523 translate read null
2024-09-25 Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration Jiazhou Zhou et.al. 2409.16953 translate read null
2024-09-25 Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion Vineet Punyamoorty et.al. 2409.16950 translate read null
2024-09-24 Hand Gesture Classification Based on Forearm Ultrasound Video Snippets Using 3D Convolutional Neural Networks Keshav Bimbraw et.al. 2409.16431 translate read null
2024-09-22 Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment Jidong Kuang et.al. 2409.14336 translate read null
2024-09-21 Egocentric zone-aware action recognition across environments Simone Alberto Peirone et.al. 2409.14205 translate read null
2024-09-19 Interpretable Action Recognition on Hard to Classify Actions Anastasia Anichenko et.al. 2409.13091 translate read null
2024-09-18 Distillation-free Scaling of Large SSMs for Images and Videos Hamid Suleman et.al. 2409.11867 translate read null
2024-09-17 Mamba Fusion: Learning Actions Through Questioning Zhikang Dong et.al. 2409.11513 translate read link
2024-09-16 Forearm Ultrasound based Gesture Recognition on Edge Keshav Bimbraw et.al. 2409.09915 translate read null
2024-09-15 Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition Cagri Gungor et.al. 2409.09611 translate read null
2024-09-14 MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction Yan Feng et.al. 2409.09446 translate read link
2024-09-14 KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition Zhaoyu Chen et.al. 2409.09444 translate read null
2024-09-14 ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild Arya Farkhondeh et.al. 2409.09319 translate read link
2024-09-13 Using The Concept Hierarchy for Household Action Recognition Andrei Costinescu et.al. 2409.08853 translate read null
2024-09-12 Customized Mid-Air Gestures for Accessibility: A $B Recognizer for Multi-Dimensional Biosignal Gestures Momona Yamagami et.al. 2409.08402 translate read null
2024-09-12 Spatial Adaptation Layer: Interpretable Domain Adaptation For Biosignal Sensor Array Applications Joao Pereira et.al. 2409.08058 translate read null
2024-09-16 InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation Andrew Lee et.al. 2409.07914 translate read null
2024-09-11 2D bidirectional gated recurrent unit convolutional Neural networks for end-to-end violence detection In videos Abdarahmane Traoré et.al. 2409.07588 translate read null
2024-09-10 Data Collection-free Masked Video Modeling Yuchi Ishikawa et.al. 2409.06665 translate read null
2024-09-10 Advancements in Gesture Recognition Techniques and Machine Learning for Enhanced Human-Robot Interaction: A Comprehensive Review Sajjad Hussain et.al. 2409.06503 translate read null
2024-09-10 Learning Generative Interactive Environments By Trained Agent Exploration Naser Kazemi et.al. 2409.06445 translate read link
2024-09-09 ReL-SAR: Representation Learning for Skeleton Action Recognition with Convolutional Transformers and BYOL Safwen Naimi et.al. 2409.05749 translate read null
2024-09-11 Real-Time Human Action Recognition on Embedded Platforms Ruiqi Wang et.al. 2409.05662 translate read null
2024-09-06 Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment Keyne Oei et.al. 2409.04607 translate read null
2024-09-05 MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition Mallika Garg et.al. 2409.03890 translate read link
2024-09-05 UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking Md. Mahfuzur Rahman et.al. 2409.03245 translate read null
2024-09-04 SITAR: Semi-supervised Image Transformer for Action Recognition Owais Iqbal et.al. 2409.02910 translate read null
2024-09-04 TASAR: Transferable Attack on Skeletal Action Recognition Yunfeng Diao et.al. 2409.02483 translate read link
2024-09-04 Unified Framework with Consistency across Modalities for Human Activity Recognition Tuyen Tran et.al. 2409.02385 translate read null
2024-09-07 Unfolding Videos Dynamics via Taylor Expansion Siyi Chen et.al. 2409.02371 translate read null
2024-09-03 ADHD diagnosis based on action characteristics recorded in videos using machine learning Yichun Li et.al. 2409.02274 translate read null
2024-09-03 Action-Based ADHD Diagnosis in Video Yichun Li et.al. 2409.02261 translate read null
2024-09-03 ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition Shiting Xiao et.al. 2409.01564 translate read null
2024-09-02 FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition Ishan Rajendrakumar Dave et.al. 2409.01448 translate read null
2024-09-01 Fisher Information guided Purification against Backdoor Attacks Nazmul Karim et.al. 2409.00863 translate read link
2024-09-01 A Critical Analysis on Machine Learning Techniques for Video-based Human Activity Recognition of Surveillance Systems: A Review Shahriar Jahan et.al. 2409.00731 translate read null
2024-09-03 Open-vocabulary Temporal Action Localization using VLMs Naoki Wake et.al. 2408.17422 translate read null
2024-09-04 Hand1000: Generating Realistic Hands from Text with Only 1,000 Images Haozhuo Zhang et.al. 2408.15461 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)