Action Recognition - 2024-04 | Paper Arxiv Daily

Action Recognition - 2024-04

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-04-30	One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features	Trung Thanh Nguyen et.al.	2404.19542	translate	read	link
2024-04-30	Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition	Zhendong Liu et.al.	2404.19383	translate	read	null
2024-04-28	Enhancing Action Recognition from Low-Quality Skeleton Data via Part-Level Knowledge Distillation	Cuiwei Liu et.al.	2404.18206	translate	read	null
2024-04-26	SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes	Georgia Baltsou et.al.	2404.17255	translate	read	null
2024-04-25	Learning Discriminative Spatio-temporal Representations for Semi-supervised Action Recognition	Yu Wang et.al.	2404.16416	translate	read	null
2024-04-25	An Improved Graph Pooling Network for Skeleton-Based Action Recognition	Cong Wu et.al.	2404.16359	translate	read	null
2024-04-24	Unimodal and Multimodal Sensor Fusion for Wearable Activity Recognition	Hymalai Bello et.al.	2404.16005	translate	read	null
2024-04-24	3D Face Morphing Attack Generation using Non-Rigid Registration	Jag Mohan Singh et.al.	2404.15765	translate	read	null
2024-04-25	HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition	Jinfu Liu et.al.	2404.15719	translate	read	link
2024-04-23	Combating Missing Modalities in Egocentric Videos at Test Time	Merey Ramazanova et.al.	2404.15161	translate	read	null
2024-04-23	G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos for Generalized Gesture Recognition	Kaikai Deng et.al.	2404.14934	translate	read	null
2024-04-23	Driver Activity Classification Using Generalizable Representations from Vision-Language Models	Ross Greer et.al.	2404.14906	translate	read	null
2024-04-23	DENOISER: Rethinking the Robustness for Open-Vocabulary Action Recognition	Haozhe Cheng et.al.	2404.14890	translate	read	null
2024-04-22	1st Place Solution to the 1st SkatingVerse Challenge	Tao Sun et.al.	2404.14032	translate	read	null
2024-04-22	CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment	Kanglei Zhou et.al.	2404.13999	translate	read	link
2024-04-21	Attack on Scene Flow using Point Clouds	Haniyeh Ehsani Oskouie et.al.	2404.13621	translate	read	null
2024-04-20	STAT: Towards Generalizable Temporal Action Localization	Yangcen Liu et.al.	2404.13311	translate	read	null
2024-04-19	Ring-a-Pose: A Ring for Continuous Hand Pose Tracking	Tianhong Catherine Yu et.al.	2404.12980	translate	read	null
2024-04-19	VoxAtnNet: A 3D Point Clouds Convolutional Neural Network for Generalizable Face Presentation Attack Detection	Raghavendra Ramachandra et.al.	2404.12680	translate	read	null
2024-04-18	DeepLocalization: Using change point detection for Temporal Action Localization	Mohammed Shaiqur Rahman et.al.	2404.12258	translate	read	null
2024-04-18	Aligning Actions and Walking to LLM-Generated Textual Descriptions	Radu Chivereanu et.al.	2404.12192	translate	read	link
2024-04-18	Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition	Xunsong Li et.al.	2404.11903	translate	read	null
2024-04-18	sEMG-based Fine-grained Gesture Recognition via Improved LightGBM Model	Xiupeng Qiao et.al.	2404.11861	translate	read	null
2024-04-17	VG4D: Vision-Language Model Goes 4D Video Recognition	Zhichao Deng et.al.	2404.11605	translate	read	link
2024-04-17	A Data-Driven Representation for Sign Language Production	Harry Walsh et.al.	2404.11499	translate	read	link
2024-04-17	Lower Limb Movements Recognition Based on Feature Recursive Elimination and Backpropagation Neural Network	Yongkai Ma et.al.	2404.11383	translate	read	null
2024-04-17	Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in Surface Electromyographic Signal Analysis	Weiyu Guo et.al.	2404.11213	translate	read	null
2024-04-17	Kathakali Hand Gesture Recognition With Minimal Data	Kavitha Raju et.al.	2404.11205	translate	read	null
2024-04-16	HumMUSS: Human Motion Understanding using State Space Models	Arnab Kumar Mondal et.al.	2404.10880	translate	read	null
2024-04-17	Learning to Score Sign Language with Two-stage Method	Hongli Wen et.al.	2404.10383	translate	read	null
2024-04-16	MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognition	Naichuan Zheng et.al.	2404.10210	translate	read	null
2024-04-15	Design and Analysis of Efficient Attention in Transformers for Social Group Activity Recognition	Masato Tamura et.al.	2404.09964	translate	read	null
2024-04-15	A Diffusion-based Data Generator for Training Object Recognition Models in Ultra-Range Distance	Eran Bamani et.al.	2404.09846	translate	read	null
2024-04-15	Leveraging Temporal Contextualization for Video Action Recognition	Minji Kim et.al.	2404.09490	translate	read	link
2024-04-14	In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition	Wiktor Mucha et.al.	2404.09308	translate	read	null
2024-04-13	Exploring Explainability in Video Action Recognition	Avinab Saha et.al.	2404.09067	translate	read	null
2024-04-12	MSSTNet: A Multi-Scale Spatio-Temporal CNN-Transformer Network for Dynamic Facial Expression Recognition	Linhuang Wang et.al.	2404.08433	translate	read	null
2024-04-11	Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls	Amin Hosseiny Marani et.al.	2404.08155	translate	read	null
2024-04-11	Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos	Soumyabrata Chaudhuri et.al.	2404.07645	translate	read	null
2024-04-15	Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition	Yang Chen et.al.	2404.07487	translate	read	null
2024-04-10	O-TALC: Steps Towards Combating Oversegmentation within Online Action Segmentation	Matthew Kent Myers et.al.	2404.06894	translate	read	null
2024-04-10	An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video	Xingyu Song et.al.	2404.06741	translate	read	null
2024-04-07	X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model	Jan Held et.al.	2404.06332	translate	read	null
2024-04-10	Algorithms for Caching and MTS with reduced number of predictions	Karim Abdel Sadek et.al.	2404.06280	translate	read	null
2024-04-09	ActNetFormer: Transformer-ResNet Hybrid Method for Semi-Supervised Action Recognition in Videos	Sharana Dharshikgan Suresh Dass et.al.	2404.06243	translate	read	link
2024-04-08	Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder	Halil Ismail Helvaci et.al.	2404.05849	translate	read	null
2024-04-09	TIM: A Time Interval Machine for Audio-Visual Action Recognition	Jacob Chalk et.al.	2404.05559	translate	read	link
2024-04-11	Test-Time Zero-Shot Temporal Action Localization	Benedetta Liberatori et.al.	2404.05426	translate	read	link
2024-04-09	SDFR: Synthetic Data for Face Recognition Competition	Hatef Otroshi Shahreza et.al.	2404.04580	translate	read	null
2024-04-05	PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos	Yufei Zhang et.al.	2404.04430	translate	read	null
2024-04-05	Koala: Key frame-conditioned long video-LLM	Reuben Tan et.al.	2404.04346	translate	read	null
2024-04-04	UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization	Tiantian Geng et.al.	2404.03179	translate	read	null
2024-04-03	Optimizing the Deployment of Tiny Transformers on Low-Power MCUs	Victor J. B. Jung et.al.	2404.02945	translate	read	link
2024-04-03	Multi-Scale Spatial-Temporal Self-Attention Graph Convolutional Networks for Skeleton-based Action Recognition	Ikuo Nakamura et.al.	2404.02624	translate	read	null
2024-04-02	PREGO: online mistake detection in PRocedural EGOcentric videos	Alessandro Flaborea et.al.	2404.01933	translate	read	link
2024-04-02	Disentangled Pre-training for Human-Object Interaction Detection	Zhuolong Li et.al.	2404.01725	translate	read	link
2024-04-02	Language Model Guided Interpretable Video Action Reasoning	Ning Wang et.al.	2404.01591	translate	read	null
2024-04-02	Leveraging YOLO-World and GPT-4V LMMs for Zero-Shot Person Detection and Action Recognition in Drone Imagery	Christian Limberg et.al.	2404.01571	translate	read	null
2024-04-01	LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization	Akshita Gupta et.al.	2404.01282	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)