Action Recognition - 2024-09 | Paper Arxiv Daily

Action Recognition - 2024-09

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-09-30	SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition	Shu Yang et.al.	2409.20083	translate	read	null
2024-09-28	Gesture Recognition for Feedback Based Mixed Reality and Robotic Fabrication: A Case Study of the UnLog Tower	Alexander Htet Kyaw et.al.	2409.19281	translate	read	null
2024-09-26	SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining	Ruiqi Xian et.al.	2409.18300	translate	read	null
2024-09-26	Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition	Xinpeng Yin et.al.	2409.17951	translate	read	link
2024-09-26	EAGLE: Egocentric AGgregated Language-video Engine	Jing Bi et.al.	2409.17523	translate	read	null
2024-09-25	Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration	Jiazhou Zhou et.al.	2409.16953	translate	read	null
2024-09-25	Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion	Vineet Punyamoorty et.al.	2409.16950	translate	read	null
2024-09-24	Hand Gesture Classification Based on Forearm Ultrasound Video Snippets Using 3D Convolutional Neural Networks	Keshav Bimbraw et.al.	2409.16431	translate	read	null
2024-09-22	Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment	Jidong Kuang et.al.	2409.14336	translate	read	null
2024-09-21	Egocentric zone-aware action recognition across environments	Simone Alberto Peirone et.al.	2409.14205	translate	read	null
2024-09-19	Interpretable Action Recognition on Hard to Classify Actions	Anastasia Anichenko et.al.	2409.13091	translate	read	null
2024-09-18	Distillation-free Scaling of Large SSMs for Images and Videos	Hamid Suleman et.al.	2409.11867	translate	read	null
2024-09-17	Mamba Fusion: Learning Actions Through Questioning	Zhikang Dong et.al.	2409.11513	translate	read	link
2024-09-16	Forearm Ultrasound based Gesture Recognition on Edge	Keshav Bimbraw et.al.	2409.09915	translate	read	null
2024-09-15	Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition	Cagri Gungor et.al.	2409.09611	translate	read	null
2024-09-14	MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction	Yan Feng et.al.	2409.09446	translate	read	link
2024-09-14	KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition	Zhaoyu Chen et.al.	2409.09444	translate	read	null
2024-09-14	ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild	Arya Farkhondeh et.al.	2409.09319	translate	read	link
2024-09-13	Using The Concept Hierarchy for Household Action Recognition	Andrei Costinescu et.al.	2409.08853	translate	read	null
2024-09-12	Customized Mid-Air Gestures for Accessibility: A $B Recognizer for Multi-Dimensional Biosignal Gestures	Momona Yamagami et.al.	2409.08402	translate	read	null
2024-09-12	Spatial Adaptation Layer: Interpretable Domain Adaptation For Biosignal Sensor Array Applications	Joao Pereira et.al.	2409.08058	translate	read	null
2024-09-16	InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation	Andrew Lee et.al.	2409.07914	translate	read	null
2024-09-11	2D bidirectional gated recurrent unit convolutional Neural networks for end-to-end violence detection In videos	Abdarahmane Traoré et.al.	2409.07588	translate	read	null
2024-09-10	Data Collection-free Masked Video Modeling	Yuchi Ishikawa et.al.	2409.06665	translate	read	null
2024-09-10	Advancements in Gesture Recognition Techniques and Machine Learning for Enhanced Human-Robot Interaction: A Comprehensive Review	Sajjad Hussain et.al.	2409.06503	translate	read	null
2024-09-10	Learning Generative Interactive Environments By Trained Agent Exploration	Naser Kazemi et.al.	2409.06445	translate	read	link
2024-09-09	ReL-SAR: Representation Learning for Skeleton Action Recognition with Convolutional Transformers and BYOL	Safwen Naimi et.al.	2409.05749	translate	read	null
2024-09-11	Real-Time Human Action Recognition on Embedded Platforms	Ruiqi Wang et.al.	2409.05662	translate	read	null
2024-09-06	Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment	Keyne Oei et.al.	2409.04607	translate	read	null
2024-09-05	MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition	Mallika Garg et.al.	2409.03890	translate	read	link
2024-09-05	UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking	Md. Mahfuzur Rahman et.al.	2409.03245	translate	read	null
2024-09-04	SITAR: Semi-supervised Image Transformer for Action Recognition	Owais Iqbal et.al.	2409.02910	translate	read	null
2024-09-04	TASAR: Transferable Attack on Skeletal Action Recognition	Yunfeng Diao et.al.	2409.02483	translate	read	link
2024-09-04	Unified Framework with Consistency across Modalities for Human Activity Recognition	Tuyen Tran et.al.	2409.02385	translate	read	null
2024-09-07	Unfolding Videos Dynamics via Taylor Expansion	Siyi Chen et.al.	2409.02371	translate	read	null
2024-09-03	ADHD diagnosis based on action characteristics recorded in videos using machine learning	Yichun Li et.al.	2409.02274	translate	read	null
2024-09-03	Action-Based ADHD Diagnosis in Video	Yichun Li et.al.	2409.02261	translate	read	null
2024-09-03	ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition	Shiting Xiao et.al.	2409.01564	translate	read	null
2024-09-02	FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition	Ishan Rajendrakumar Dave et.al.	2409.01448	translate	read	null
2024-09-01	Fisher Information guided Purification against Backdoor Attacks	Nazmul Karim et.al.	2409.00863	translate	read	link
2024-09-01	A Critical Analysis on Machine Learning Techniques for Video-based Human Activity Recognition of Surveillance Systems: A Review	Shahriar Jahan et.al.	2409.00731	translate	read	null
2024-09-03	Open-vocabulary Temporal Action Localization using VLMs	Naoki Wake et.al.	2408.17422	translate	read	null
2024-09-04	Hand1000: Generating Realistic Hands from Text with Only 1,000 Images	Haozhuo Zhang et.al.	2408.15461	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)