Action Recognition - 2024-03 | Paper Arxiv Daily

Action Recognition - 2024-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-03-31	LLMs are Good Action Recognizers	Haoxuan Qu et.al.	2404.00532	translate	read	null
2024-03-29	Latent Embedding Clustering for Occlusion Robust Head Pose Estimation	José Celestino et.al.	2403.20251	translate	read	null
2024-03-29	A Unified Framework for Human-centric Point Cloud Video Understanding	Yiteng Xu et.al.	2403.20031	translate	read	null
2024-03-28	Zero-shot Prompt-based Video Encoder for Surgical Gesture Recognition	Mingxing Rao et.al.	2403.19786	translate	read	link
2024-03-28	Hypergraph-based Multi-View Action Recognition using Event Cameras	Yue Gao et.al.	2403.19316	translate	read	null
2024-03-27	PLOT-TAL – Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization	Edward Fish et.al.	2403.18915	translate	read	null
2024-03-27	iFace: Hand-Over-Face Gesture Recognition Leveraging Impedance Sensing	Mengxi Liu et.al.	2403.18433	translate	read	null
2024-03-27	An Evolutionary Network Architecture Search Framework with Adaptive Multimodal Fusion for Hand Gesture Recognition	Yizhang Xia et.al.	2403.18208	translate	read	null
2024-03-26	OmniVid: A Generative Framework for Universal Video Understanding	Junke Wang et.al.	2403.17935	translate	read	link
2024-03-25	Understanding Long Videos in One Multimodal Language Model Pass	Kanchana Ranasinghe et.al.	2403.16998	translate	read	link
2024-03-25	Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects	Zicong Fan et.al.	2403.16428	translate	read	null
2024-03-24	Emotion Recognition from the perspective of Activity Recognition	Savinay Nagendra et.al.	2403.16263	translate	read	null
2024-03-22	InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding	Yi Wang et.al.	2403.15377	translate	read	link
2024-03-22	Gesture-Controlled Aerial Robot Formation for Human-Swarm Interaction in Safety Monitoring Applications	Vít Krátký et.al.	2403.15333	translate	read	null
2024-03-22	GCN-DevLSTM: Path Development for Skeleton-Based Action Recognition	Lei Jiang et.al.	2403.15212	translate	read	link
2024-03-21	Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets	Ahmet Alp Kindiroglu et.al.	2403.14534	translate	read	link
2024-03-20	Hierarchical NeuroSymbolic Approach for Action Quality Assessment	Lauren Okamoto et.al.	2403.13798	translate	read	null
2024-03-19	Selective, Interpretable, and Motion Consistent Privacy Attribute Obfuscation for Action Recognition	Filip Ilic et.al.	2403.12710	translate	read	null
2024-03-19	ExACT: Language-guided Conceptual Reasoning and Uncertainty Estimation for Event-based Action Recognition and More	Jiazhou Zhou et.al.	2403.12534	translate	read	null
2024-03-19	VideoBadminton: A Video Dataset for Badminton Action Recognition	Qi Li et.al.	2403.12385	translate	read	null
2024-03-19	Multi-View Video-Based Learning: Leveraging Weak Labels for Frame-Level Perception	Vijay John et.al.	2403.11616	translate	read	null
2024-03-19	VIHE: Virtual In-Hand Eye Transformer for 3D Robotic Manipulation	Weiyao Wang et.al.	2403.11461	translate	read	null
2024-03-17	A Lie Group Approach to Riemannian Batch Normalization	Ziheng Chen et.al.	2403.11261	translate	read	link
2024-03-17	Boosting Semi-Supervised Temporal Action Localization by Learning from Non-Target Classes	Kun Xia et.al.	2403.11189	translate	read	null
2024-03-16	CoPlay: Audio-agnostic Cognitive Scaling for Acoustic Sensing	Yin Li et.al.	2403.10796	translate	read	null
2024-03-15	CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner	Tingbing Yan et.al.	2403.10082	translate	read	null
2024-03-15	Skeleton-Based Human Action Recognition with Noisy Labels	Yi Xu et.al.	2403.09975	translate	read	null
2024-03-14	On the Utility of 3D Hand Poses for Action Recognition	Md Salman Shamil et.al.	2403.09805	translate	read	null
2024-03-14	3D-VLA: A 3D Vision-Language-Action Generative World Model	Haoyu Zhen et.al.	2403.09631	translate	read	link
2024-03-14	SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition	Jeonghyeok Do et.al.	2403.09508	translate	read	link
2024-03-14	EventRPG: Event Data Augmentation with Relevance Propagation Guidance	Mingyuan Sun et.al.	2403.09274	translate	read	link
2024-03-14	Leveraging Foundation Model Automatic Data Augmentation Strategies and Skeletal Points for Hands Action Recognition in Industrial Assembly Lines	Liang Wu et.al.	2403.09056	translate	read	null
2024-03-13	Low-Cost and Real-Time Industrial Human Action Recognitions Based on Large-Scale Foundation Models	Wensheng Liang et.al.	2403.08420	translate	read	null
2024-03-13	NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation	Ran Xu et.al.	2403.08355	translate	read	null
2024-03-13	ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation	Guanxing Lu et.al.	2403.08321	translate	read	link
2024-03-12	NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning	Bingqian Lin et.al.	2403.07376	translate	read	link
2024-03-12	BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin	Qihang Fang et.al.	2403.07354	translate	read	null
2024-03-11	Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling	Wele Gedara Chaminda Bandara et.al.	2403.06978	translate	read	link
2024-03-11	Deep Learning Approaches for Human Action Recognition in Video Data	Yufei Xie et.al.	2403.06810	translate	read	null
2024-03-11	Real-Time Multimodal Cognitive Assistant for Emergency Medical Services	Keshara Weerasinghe et.al.	2403.06734	translate	read	null
2024-03-11	Multimodal Transformers for Real-Time Surgical Activity Prediction	Keshara Weerasinghe et.al.	2403.06705	translate	read	link
2024-03-11	epsilon-Mesh Attack: A Surface-based Adversarial Point Cloud Attack for Facial Expression Recognition	Batuhan Cengiz et.al.	2403.06661	translate	read	null
2024-03-11	Density-Guided Label Smoothing for Temporal Localization of Driving Actions	Tunc Alkanat et.al.	2403.06616	translate	read	null
2024-03-11	Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition	Erkut Akdag et.al.	2403.06577	translate	read	null
2024-03-10	Coherent Temporal Synthesis for Incremental Action Segmentation	Guodong Ding et.al.	2403.06102	translate	read	null
2024-03-09	Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence	Marcel Hussing et.al.	2403.05996	translate	read	null
2024-03-08	Benchmarking Micro-action Recognition: Dataset, Methods, and Applications	Dan Guo et.al.	2403.05234	translate	read	link
2024-03-06	Video Relationship Detection Using Mixture of Experts	Ala Shaabana et.al.	2403.03994	translate	read	link
2024-03-05	Behavior Generation with Latent Actions	Seungjae Lee et.al.	2403.03181	translate	read	link
2024-03-05	Learning to Use Tools via Cooperative and Interactive Agents	Zhengliang Shi et.al.	2403.03031	translate	read	null
2024-03-04	Gesture recognition with Brownian reservoir computing using geometrically confined skyrmion dynamics	Grischa Beneke et.al.	2403.01877	translate	read	null
2024-03-04	A Simple Baseline for Efficient Hand Mesh Reconstruction	Zhishan Zhou et.al.	2403.01813	translate	read	null
2024-03-03	A Unified Model Selection Technique for Spectral Clustering Based Motion Segmentation	Yuxiang Huang et.al.	2403.01606	translate	read	null
2024-03-03	Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition	Kun-Yu Lin et.al.	2403.01560	translate	read	link
2024-03-02	Dynamic 3D Point Cloud Sequences as 2D Videos	Yiming Zeng et.al.	2403.01129	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)