Action Recognition - 2024-11 | Paper Arxiv Daily

Action Recognition - 2024-11

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-11-29	CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation	Qixiu Li et.al.	2411.19650	translate	read	null
2024-11-29	SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders	Niki Martinel et.al.	2411.19544	translate	read	null
2024-11-29	Hierarchical Framework for Retrosynthesis Prediction with Enhanced Reaction Center Localization	Seongeun Yun et.al.	2411.19503	translate	read	null
2024-11-28	TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition	Yilong Wang et.al.	2411.19041	translate	read	null
2024-11-28	Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition	Hongda Liu et.al.	2411.18941	translate	read	link
2024-11-27	Robust Dynamic Gesture Recognition at Ultra-Long Distances	Eran Bamani Beeri et.al.	2411.18413	translate	read	null
2024-11-27	EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond	Meiqi Cao et.al.	2411.18328	translate	read	null
2024-11-27	An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition	Song-Jiang Lai et.al.	2411.18002	translate	read	null
2024-11-26	Pre-training for Action Recognition with Automatically Generated Fractal Datasets	Davyd Svyezhentsev et.al.	2411.17584	translate	read	link
2024-11-26	Real-Time Multimodal Signal Processing for HRI in RoboCup: Understanding a Human Referee	Filippo Ansalone et.al.	2411.17347	translate	read	null
2024-11-22	TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks	Prajna G. Malettira et.al.	2411.16711	translate	read	null
2024-11-24	OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions	Guanyu Zhou et.al.	2411.15729	translate	read	link
2024-11-23	Machine Learning-based sEMG Signal Classification for Hand Gesture Recognition	Parshuram N. Aarotale et.al.	2411.15655	translate	read	null
2024-11-23	Optimizing Gesture Recognition for Seamless UI Interaction Using Convolutional Neural Networks	Qi Sun et.al.	2411.15598	translate	read	null
2024-11-22	When Spatial meets Temporal in Action Recognition	Huilin Chen et.al.	2411.15284	translate	read	null
2024-11-22	Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections	Youwei Zhou et.al.	2411.14796	translate	read	null
2024-11-22	Aim My Robot: Precision Local Navigation to Any Object	Xiangyun Meng et.al.	2411.14770	translate	read	null
2024-11-21	Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning	Jiange Yang et.al.	2411.14519	translate	read	null
2024-11-18	Enhancing Bidirectional Sign Language Communication: Integrating YOLOv8 and NLP for Real-Time Gesture Recognition & Translation	Hasnat Jamil Bhuiyan et.al.	2411.13597	translate	read	null
2024-11-23	AzSLD: Azerbaijani Sign Language Dataset for Fingerspelling, Word, and Sentence Translation with Baseline Software	Nigar Alishzade et.al.	2411.12865	translate	read	null
2024-11-20	Topological Symmetry Enhanced Graph Convolution for Skeleton-Based Action Recognition	Zeyu Liang et.al.	2411.12560	translate	read	link
2024-11-19	Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization	Quang Vinh Nguyen et.al.	2411.12525	translate	read	null
2024-11-18	Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition	Hanyu Guo et.al.	2411.11335	translate	read	null
2024-11-18	Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition	Yang Chen et.al.	2411.11288	translate	read	null
2024-11-18	Efficient Transfer Learning for Video-language Foundation Models	Haoxing Chen et.al.	2411.11223	translate	read	link
2024-11-16	TDSM:Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition	Jeonghyeok Do et.al.	2411.10745	translate	read	link
2024-11-15	KuaiFormer: Transformer-Based Retrieval at Kuaishou	Chi Liu et.al.	2411.10057	translate	read	null
2024-11-14	Towards Scalable Handwriting Communication via EEG Decoding and Latent Embedding Integration	Jun-Young Kim et.al.	2411.09170	translate	read	null
2024-11-14	VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation	Youpeng Wen et.al.	2411.09153	translate	read	null
2024-11-13	Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks?	Quan Zhang et.al.	2411.08466	translate	read	null
2024-11-13	Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study	Jinbo Wen et.al.	2411.08341	translate	read	null
2024-11-12	LapGSR: Laplacian Reconstructive Network for Guided Thermal Super-Resolution	Aditya Kasliwal et.al.	2411.07750	translate	read	null
2024-11-12	OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework	Jiaxi Li et.al.	2411.07711	translate	read	null
2024-11-11	ConvMixFormer- A Resource-efficient Convolution Mixer for Transformer-based Dynamic Hand Gesture Recognition	Mallika Garg et.al.	2411.07118	translate	read	link
2024-11-10	Extended multi-stream temporal-attention module for skeleton-based human action recognition (HAR)	Faisal Mehmood et.al.	2411.06553	translate	read	null
2024-11-10	SuperResolution Radar Gesture Recognitio	Netanel Blumenfeld et.al.	2411.06410	translate	read	null
2024-11-08	Video RWKV:Video Action Recognition Based RWKV	Zhuowen Yin et.al.	2411.05636	translate	read	null
2024-11-06	Object Recognition in Human Computer Interaction:- A Comparative Analysis	Kaushik Ranade et.al.	2411.04263	translate	read	null
2024-11-06	Explaining Human Activity Recognition with SHAP: Validating Insights with Perturbation and Quantitative Measures	Felix Tempel et.al.	2411.03714	translate	read	link
2024-11-05	One-Stage-TFS: Thai One-Stage Fingerspelling Dataset for Fingerspelling Recognition Frameworks	Siriwiwat Lata et.al.	2411.02768	translate	read	null
2024-11-04	TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos	Leonardo Plini et.al.	2411.02570	translate	read	null
2024-11-04	AM Flow: Adapters for Temporal Processing in Action Recognition	Tanay Agrawal et.al.	2411.02065	translate	read	null
2024-11-04	ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics	Chuanchuan Wang et.al.	2411.01769	translate	read	null
2024-11-01	STAA: Spatio-Temporal Attention Attribution for Real-Time Interpreting Transformer-based Video Models	Zerui Wang et.al.	2411.00630	translate	read	link
2024-11-01	Human Action Recognition (HAR) Using Skeleton-based Spatial Temporal Relative Transformer Network: ST-RTR	Faisal Mehmood et.al.	2410.23806	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)