Action Recognition - 2024-07 | Paper Arxiv Daily

Action Recognition - 2024-07

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-07-31	Explainable Artificial Intelligence for Quantifying Interfering and High-Risk Behaviors in Autism Spectrum Disorder in a Real-World Classroom Environment Using Privacy-Preserving Video Analysis	Barun Das et.al.	2407.21691	translate	read	null
2024-07-31	Skeleton-Based Action Recognition with Spatial-Structural Graph Convolution	Jingyao Wang et.al.	2407.21525	translate	read	null
2024-07-31	Dynamic Gesture Recognition in Ultra-Range Distance for Effective Human-Robot Interaction	Eran Bamani Beeri et.al.	2407.21374	translate	read	null
2024-07-29	Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter	Chao Liu et.al.	2407.19981	translate	read	null
2024-07-29	ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality	Guoliang Xu et.al.	2407.19820	translate	read	null
2024-07-29	PredIN: Towards Open-Set Gesture Recognition via Prediction Inconsistency	Chen Liu et.al.	2407.19753	translate	read	null
2024-07-28	Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph	Zhengcen Li et.al.	2407.19497	translate	read	link
2024-07-25	MARINE: A Computer Vision Model for Detecting Rare Predator-Prey Interactions in Animal Videos	Zsófia Katona et.al.	2407.18289	translate	read	null
2024-07-25	Trajectory-aligned Space-time Tokens for Few-shot Action Recognition	Pulkit Kumar et.al.	2407.18249	translate	read	null
2024-07-26	Harnessing Temporal Causality for Advanced Temporal Action Detection	Shuming Liu et.al.	2407.17792	translate	read	link
2024-07-23	Fusion and Cross-Modal Transfer for Zero-Shot Human Action Recognition	Abhi Kamboj et.al.	2407.16803	translate	read	null
2024-07-23	PLM-Net: Perception Latency Mitigation Network for Vision-Based Lateral Control of Autonomous Vehicles	Aws Khalil et.al.	2407.16740	translate	read	link
2024-07-24	SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition	Wenbo Huang et.al.	2407.16344	translate	read	link
2024-07-22	Efficient and generalizable prediction of molecular alterations in multiple cancer cohorts using H&E whole slide images	Kshitij Ingale et.al.	2407.15816	translate	read	null
2024-07-25	Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition	Jinfu Liu et.al.	2407.15706	translate	read	link
2024-07-21	Semi-Supervised Pipe Video Temporal Defect Interval Localization	Zhu Huang et.al.	2407.15170	translate	read	null
2024-07-20	Automated Patient Positioning with Learned 3D Hand Gestures	Zhongpai Gao et.al.	2407.14903	translate	read	null
2024-07-20	Can VLMs be used on videos for action recognition? LLMs are Visual Reasoning Coordinators	Harsh Lunia et.al.	2407.14834	translate	read	null
2024-07-20	Decoupled Prompt-Adapter Tuning for Continual Activity Recognition	Di Fu et.al.	2407.14811	translate	read	null
2024-07-20	A Comprehensive Review of Few-shot Action Recognition	Yuyang Wanyan et.al.	2407.14744	translate	read	null
2024-07-19	LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition	Soroush Oraki et.al.	2407.14655	translate	read	null
2024-07-19	Fine-grained Knowledge Graph-driven Video-Language Learning for Action Recognition	Rui Zhang et.al.	2407.14146	translate	read	null
2024-07-19	Zero-Shot Underwater Gesture Recognition	Sandipan Sarma et.al.	2407.14103	translate	read	link
2024-07-18	Pose-guided multi-task video transformer for driver action recognition	Ricardo Pizarro et.al.	2407.13750	translate	read	null
2024-07-18	SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders	Sheng-Wei Li et.al.	2407.13460	translate	read	link
2024-07-18	QuIIL at T3 challenge: Towards Automation in Life-Saving Intervention Procedures from First-Person View	Trinh T. L. Vuong et.al.	2407.13216	translate	read	link
2024-07-18	Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism	Sangyoun Lee et.al.	2407.13078	translate	read	link
2024-07-17	ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos	Hyolim Kang et.al.	2407.12987	translate	read	link
2024-07-17	NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models	Gengze Zhou et.al.	2407.12366	translate	read	link
2024-07-17	Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer	Wenhan Wu et.al.	2407.12322	translate	read	null
2024-07-17	Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action Recognition	Jiahang Zhang et.al.	2407.12312	translate	read	null
2024-07-16	Enhancing Split Computing and Early Exit Applications through Predefined Sparsity	Luigi Capogrosso et.al.	2407.11763	translate	read	link
2024-07-10	Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical	Adarsh Prasad Behera et.al.	2407.11061	translate	read	null
2024-07-15	STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences	Soroush Mehraban et.al.	2407.10935	translate	read	null
2024-07-15	Human-Centric Transformer for Domain Adaptive Action Recognition	Kun-Yu Lin et.al.	2407.10860	translate	read	null
2024-07-17	Augmented Neural Fine-Tuning for Efficient Backdoor Purification	Nazmul Karim et.al.	2407.10052	translate	read	link
2024-07-13	Region-aware Image-based Human Action Retrieval with Transformers	Hongsong Wang et.al.	2407.09924	translate	read	null
2024-07-16	OmniRace: 6D Hand Pose Estimation for Intuitive Guidance of Racing Drone	Valerii Serpiva et.al.	2407.09841	translate	read	link
2024-07-12	Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action Localization	Qianhan Feng et.al.	2407.08971	translate	read	link
2024-07-11	Boosting Adversarial Transferability for Skeleton-based Action Recognition via Exploring the Model Posterior Space	Yunfeng Diao et.al.	2407.08572	translate	read	null
2024-07-12	Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization	Feixiang Zhou et.al.	2407.07673	translate	read	null
2024-07-10	EA-VTR: Event-Aware Video-Text Retrieval	Zongyang Ma et.al.	2407.07478	translate	read	null
2024-07-09	Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization	Jeongseok Hyun et.al.	2407.07024	translate	read	link
2024-07-09	Rethinking Image-to-Video Adaptation: An Object-centric Perspective	Rui Qian et.al.	2407.06871	translate	read	null
2024-07-09	Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition	Mingfang Zhang et.al.	2407.06628	translate	read	null
2024-07-08	Noise-Free Explanation for Driving Action Prediction	Hongbo Zhu et.al.	2407.06339	translate	read	link
2024-07-08	C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition	Rongchang Li et.al.	2407.06113	translate	read	link
2024-07-08	DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition	Fei Guo et.al.	2407.05657	translate	read	null
2024-07-11	Helios: An extremely low power event-based gesture recognition for always-on smart eyewear	Prarthana Bhattacharyya et.al.	2407.05206	translate	read	null
2024-07-06	DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition	Qi Wang et.al.	2407.05106	translate	read	link
2024-07-05	AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation	Yuhan Zhu et.al.	2407.04603	translate	read	null
2024-07-05	TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking	Thuc Nguyen-Quang et.al.	2407.04327	translate	read	null
2024-07-05	Computer Vision for Clinical Gait Analysis: A Gait Abnormality Video Dataset	Rahm Ranjan et.al.	2407.04190	translate	read	null
2024-07-04	Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature Selection	Jiafan Zhuang et.al.	2407.04056	translate	read	null
2024-07-04	On-Device Training Empowered Transfer Learning For Human Activity Recognition	Pixi Kang et.al.	2407.03644	translate	read	null
2024-07-03	Motion meets Attention: Video Motion Prompts	Qixiang Chen et.al.	2407.03179	translate	read	null
2024-07-02	Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation	Efstathia Soufleri et.al.	2407.02713	translate	read	link
2024-07-02	Novel Human Machine Interface via Robust Hand Gesture Recognition System using Channel Pruned YOLOv5s Model	Abir Sen et.al.	2407.02585	translate	read	null
2024-07-02	Referring Atomic Video Action Recognition	Kunyu Peng et.al.	2407.01872	translate	read	link
2024-07-01	Mask and Compress: Efficient Skeleton-based Action Recognition in Continual Learning	Matteo Mosconi et.al.	2407.01397	translate	read	link
2024-07-01	EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation	Baoqi Pei et.al.	2406.18070	translate	read	link

(<a href=../Action_Recognition.md>back to Action Recognition</a>)