Action Recognition - 2024-12 | Paper Arxiv Daily

Action Recognition - 2024-12

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-12-31	M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference	Chuxiong Sun et.al.	2501.00312	translate	read	null
2024-12-30	A Large-Scale Study on Video Action Dataset Condensation	Yang Chen et.al.	2412.21197	translate	read	null
2024-12-30	Frequency-aware Event Cloud Network	Hongwei Ren et.al.	2412.20803	translate	read	null
2024-12-29	FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition	Wenhan Wu et.al.	2412.20621	translate	read	link
2024-12-29	Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation	Qucheng Peng et.al.	2412.20538	translate	read	link
2024-12-29	Improving Vision-Language-Action Models via Chain-of-Affordance	Jinming Li et.al.	2412.20451	translate	read	null
2024-12-28	DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments	Xijun Wang et.al.	2412.20042	translate	read	null
2024-12-27	Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action Localization	Yuanpeng He et.al.	2412.19418	translate	read	link
2024-12-25	SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation	Maxence Boels et.al.	2412.18849	translate	read	null
2024-12-25	Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion	Yuheng Yang et.al.	2412.18780	translate	read	link
2024-12-24	Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer	Fenghua Shao et.al.	2412.18321	translate	read	null
2024-12-23	HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data	Ting Zhou et.al.	2412.17574	translate	read	null
2024-12-22	Video Domain Incremental Learning for Human Action Recognition in Home Environments	Yuanda Hu et.al.	2412.16946	translate	read	null
2024-12-21	Optical Wireless Communications: Enabling the Next Generation Network of Networks	Aravindh Krishnamoorthy et.al.	2412.16798	translate	read	null
2024-12-21	FACTS: Fine-Grained Action Classification for Tactical Sports	Christopher Lai et.al.	2412.16454	translate	read	null
2024-12-20	iRadar: Synthesizing Millimeter-Waves from Wearable Inertial Inputs for Human Gesture Sensing	Huanqi Yang et.al.	2412.15980	translate	read	null
2024-12-19	Synchronized and Fine-Grained Head for Skeleton-Based Ambiguous Action Recognition	Hao Huang et.al.	2412.14833	translate	read	null
2024-12-19	Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition	Kun Li et.al.	2412.14719	translate	read	link
2024-12-24	Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models	Xinghang Li et.al.	2412.14058	translate	read	link
2024-12-18	Do Language Models Understand Time?	Xi Ding et.al.	2412.13845	translate	read	link
2024-12-17	CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices	Andrei Znobishchev et.al.	2412.13273	translate	read	null
2024-12-20	Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences	Antonios Gasteratos et.al.	2412.12990	translate	read	null
2024-12-16	Designing Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition	Hichem Sahbi et.al.	2412.11813	translate	read	null
2024-12-13	TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies	Ruijie Zheng et.al.	2412.10345	translate	read	null
2024-12-13	Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP	Yating Yu et.al.	2412.09895	translate	read	link
2024-12-14	USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation	Wanjiang Weng et.al.	2412.09220	translate	read	link
2024-12-13	Temporal Action Localization with Cross Layer Task Decoupling and Refinement	Qiang Li et.al.	2412.09202	translate	read	link
2024-12-12	Goal-Conditioned Supervised Learning for Multi-Objective Recommendation	Shijun Li et.al.	2412.08911	translate	read	null
2024-12-10	SAT: Spatial Aptitude Training for Multimodal Language Models	Arijit Ray et.al.	2412.07755	translate	read	link
2024-12-10	Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence	Wenbo Huang et.al.	2412.07481	translate	read	null
2024-12-09	Mining Limited Data Sufficiently: A BERT-inspired Approach for CSI Time Series Application in Wireless Communication and Sensing	Zijian Zhao et.al.	2412.06861	translate	read	link
2024-12-09	Exploring the Impact of Synthetic Data on Human Gesture Recognition Tasks Using GANs	George Kontogiannis et.al.	2412.06389	translate	read	null
2024-12-07	Action Recognition based Industrial Safety Violation Detection	Surya N Reddy et.al.	2412.05531	translate	read	null
2024-12-06	CCS: Continuous Learning for Customized Incremental Wireless Sensing Services	Qunhang Fu et.al.	2412.04821	translate	read	null
2024-12-06	KNN-MMD: Cross Domain Wi-Fi Sensing Based on Local Distribution Alignment	Zijian Zhao et.al.	2412.04783	translate	read	link
2024-12-03	Proximal Control of UAVs with Federated Learning for Human-Robot Collaborative Domains	Lucas Nogueira Nobrega et.al.	2412.02863	translate	read	null
2024-12-03	Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation	Xuanlin Li et.al.	2412.02676	translate	read	null
2024-12-02	Human-Machine Interfaces for Subsea Telerobotics: From Soda-straw to Natural Language Interactions	Adnan Abdullah et.al.	2412.01753	translate	read	null
2024-12-02	HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition	Anton Nuzhdin et.al.	2412.01508	translate	read	link
2024-12-02	EdgeOAR: Real-time Online Action Recognition On Edge Devices	Wei Luo et.al.	2412.01267	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)