Action Recognition - 2024-05 | Paper Arxiv Daily

Action Recognition - 2024-05

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-05-31	Action-OOD: An End-to-End Skeleton-Based Model for Robust Out-of-Distribution Human Action Detection	Jing Xu et.al.	2405.20633	translate	read	link
2024-05-31	Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning	Yang Chen et.al.	2405.20606	translate	read	null
2024-05-30	ENTIRe-ID: An Extensive and Diverse Dataset for Person Re-Identification	Serdar Yildiz et.al.	2405.20465	translate	read	null
2024-05-30	From Forest to Zoo: Great Ape Behavior Recognition with ChimpBehave	Michael Fuchs et.al.	2405.20025	translate	read	null
2024-05-31	Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition	Masashi Hatano et.al.	2405.19917	translate	read	null
2024-05-30	EgoSurgery-Phase: A Dataset of Surgical Phase Recognition from Egocentric Open Surgery Videos	Ryo Fujii et.al.	2405.19644	translate	read	link
2024-05-30	SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation	Junjie Zhang et.al.	2405.19586	translate	read	null
2024-05-29	Matrix Manifold Neural Networks++	Xuan Son Nguyen et.al.	2405.19206	translate	read	null
2024-05-29	Exploring AI-based Anonymization of Industrial Image and Video Data in the Context of Feature Preservation	Sabrina Cynthia Triess et.al.	2405.19173	translate	read	null
2024-05-28	Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition	Muhammad Adi Nugroho et.al.	2405.18012	translate	read	null
2024-05-30	Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson’s Disease Severity in Walking Sequences	Vida Adeli et.al.	2405.17817	translate	read	link
2024-05-28	Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions	Rui Zhang et.al.	2405.17729	translate	read	null
2024-05-28	EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions?	Boshen Xu et.al.	2405.17719	translate	read	link
2024-05-27	Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction	Chiara Fumelli et.al.	2405.17038	translate	read	null
2024-05-27	A Cross-Dataset Study for Text-based 3D Human Motion Retrieval	Léore Bensabath et.al.	2405.16909	translate	read	null
2024-05-26	Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception	Shuangpeng Han et.al.	2405.16493	translate	read	null
2024-05-25	Application of Artificial Intelligence in Hand Gesture Recognition with Virtual Reality: Survey and Analysis of Hand Gesture Hardware Selection	Jindi Wang et.al.	2405.16264	translate	read	null
2024-05-22	From CNNs to Transformers in Multimodal Human Action Recognition: A Survey	Muhammad Bilal Shaikh et.al.	2405.15813	translate	read	null
2024-05-24	V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM	Abdur Rahman et.al.	2405.15341	translate	read	link
2024-05-23	Enhanced Spatiotemporal Prediction Using Physical-guided And Frequency-enhanced Recurrent Neural Networks	Xuanle Zhao et.al.	2405.14504	translate	read	null
2024-05-23	SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network	Weiyu Guo et.al.	2405.14398	translate	read	null
2024-05-23	MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models	Jiuming Liu et.al.	2405.14338	translate	read	null
2024-05-22	Counterfactual Gradients-based Quantification of Prediction Trust in Neural Networks	Mohit Prabhushankar et.al.	2405.13758	translate	read	null
2024-05-21	Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding	Rong Gao et.al.	2405.13206	translate	read	null
2024-05-22	Building Temporal Kernels with Orthogonal Polynomials	Yan Ru Pei et.al.	2405.12179	translate	read	link
2024-05-18	GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition	Mallika Garg et.al.	2405.11180	translate	read	link
2024-05-17	Air Signing and Privacy-Preserving Signature Verification for Digital Documents	P. Sarveswarasarma et.al.	2405.10868	translate	read	null
2024-05-17	MC-GPT: Empowering Vision-and-Language Navigation with Memory Map and Reasoning Chains	Zhaohuan Zhan et.al.	2405.10620	translate	read	null
2024-05-06	MEET: Mixture of Experts Extra Tree-Based sEMG Hand Gesture Identification	Naveen Gehlot et.al.	2405.09562	translate	read	null
2024-05-14	Wearable Sensor-Based Few-Shot Continual Learning on Hand Gestures for Motor-Impaired Individuals via Latent Embedding Exploitation	Riyad Bin Rafiq et.al.	2405.08969	translate	read	link
2024-05-14	The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks	Carmela Calabrese et.al.	2405.08695	translate	read	null
2024-05-15	POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning	Chang Huang et.al.	2405.08036	translate	read	null
2024-05-13	Coarse or Fine? Recognising Action End States without Labels	Davide Moltisanti et.al.	2405.07723	translate	read	link
2024-05-11	PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition	Shenglin He et.al.	2405.06929	translate	read	null
2024-05-10	CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras	James Tang et.al.	2405.06845	translate	read	link
2024-05-09	A Survey on Backbones for Deep Video Action Recognition	Zixuan Tang et.al.	2405.05584	translate	read	null
2024-05-06	OmniActions: Predicting Digital Actions in Response to Real-World Multimodal Sensory Inputs with LLMs	Jiahao Nick Li et.al.	2405.03901	translate	read	null
2024-05-05	JOSENet: A Joint Stream Embedding Network for Violence Detection in Surveillance Videos	Pietro Nardelli et.al.	2405.02961	translate	read	null
2024-05-03	On the Utility of External Agent Intention Predictor for Human-AI Coordination	Chenxu Wang et.al.	2405.02229	translate	read	null
2024-05-11	MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition	Hongyu Qu et.al.	2405.02077	translate	read	null
2024-05-03	Enhancing Micro Gesture Recognition for Emotion Understanding via Context-aware Visual-Text Contrastive Learning	Deng Li et.al.	2405.01885	translate	read	link
2024-05-02	Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy	Hoang-Quan Nguyen et.al.	2405.01337	translate	read	null
2024-05-07	Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration	Praveen Kumar Chandaliya et.al.	2405.01273	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)