Action Recognition - 2024-12

Publish Date Title Authors PDF Translate Read Code
2024-12-31 M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference Chuxiong Sun et.al. 2501.00312 translate read null
2024-12-30 A Large-Scale Study on Video Action Dataset Condensation Yang Chen et.al. 2412.21197 translate read null
2024-12-30 Frequency-aware Event Cloud Network Hongwei Ren et.al. 2412.20803 translate read null
2024-12-29 FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition Wenhan Wu et.al. 2412.20621 translate read link
2024-12-29 Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation Qucheng Peng et.al. 2412.20538 translate read link
2024-12-29 Improving Vision-Language-Action Models via Chain-of-Affordance Jinming Li et.al. 2412.20451 translate read null
2024-12-28 DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments Xijun Wang et.al. 2412.20042 translate read null
2024-12-27 Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action Localization Yuanpeng He et.al. 2412.19418 translate read link
2024-12-25 SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation Maxence Boels et.al. 2412.18849 translate read null
2024-12-25 Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion Yuheng Yang et.al. 2412.18780 translate read link
2024-12-24 Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer Fenghua Shao et.al. 2412.18321 translate read null
2024-12-23 HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data Ting Zhou et.al. 2412.17574 translate read null
2024-12-22 Video Domain Incremental Learning for Human Action Recognition in Home Environments Yuanda Hu et.al. 2412.16946 translate read null
2024-12-21 Optical Wireless Communications: Enabling the Next Generation Network of Networks Aravindh Krishnamoorthy et.al. 2412.16798 translate read null
2024-12-21 FACTS: Fine-Grained Action Classification for Tactical Sports Christopher Lai et.al. 2412.16454 translate read null
2024-12-20 iRadar: Synthesizing Millimeter-Waves from Wearable Inertial Inputs for Human Gesture Sensing Huanqi Yang et.al. 2412.15980 translate read null
2024-12-19 Synchronized and Fine-Grained Head for Skeleton-Based Ambiguous Action Recognition Hao Huang et.al. 2412.14833 translate read null
2024-12-19 Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition Kun Li et.al. 2412.14719 translate read link
2024-12-24 Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models Xinghang Li et.al. 2412.14058 translate read link
2024-12-18 Do Language Models Understand Time? Xi Ding et.al. 2412.13845 translate read link
2024-12-17 CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices Andrei Znobishchev et.al. 2412.13273 translate read null
2024-12-20 Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences Antonios Gasteratos et.al. 2412.12990 translate read null
2024-12-16 Designing Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition Hichem Sahbi et.al. 2412.11813 translate read null
2024-12-13 TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies Ruijie Zheng et.al. 2412.10345 translate read null
2024-12-13 Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP Yating Yu et.al. 2412.09895 translate read link
2024-12-14 USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation Wanjiang Weng et.al. 2412.09220 translate read link
2024-12-13 Temporal Action Localization with Cross Layer Task Decoupling and Refinement Qiang Li et.al. 2412.09202 translate read link
2024-12-12 Goal-Conditioned Supervised Learning for Multi-Objective Recommendation Shijun Li et.al. 2412.08911 translate read null
2024-12-10 SAT: Spatial Aptitude Training for Multimodal Language Models Arijit Ray et.al. 2412.07755 translate read link
2024-12-10 Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence Wenbo Huang et.al. 2412.07481 translate read null
2024-12-09 Mining Limited Data Sufficiently: A BERT-inspired Approach for CSI Time Series Application in Wireless Communication and Sensing Zijian Zhao et.al. 2412.06861 translate read link
2024-12-09 Exploring the Impact of Synthetic Data on Human Gesture Recognition Tasks Using GANs George Kontogiannis et.al. 2412.06389 translate read null
2024-12-07 Action Recognition based Industrial Safety Violation Detection Surya N Reddy et.al. 2412.05531 translate read null
2024-12-06 CCS: Continuous Learning for Customized Incremental Wireless Sensing Services Qunhang Fu et.al. 2412.04821 translate read null
2024-12-06 KNN-MMD: Cross Domain Wi-Fi Sensing Based on Local Distribution Alignment Zijian Zhao et.al. 2412.04783 translate read link
2024-12-03 Proximal Control of UAVs with Federated Learning for Human-Robot Collaborative Domains Lucas Nogueira Nobrega et.al. 2412.02863 translate read null
2024-12-03 Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation Xuanlin Li et.al. 2412.02676 translate read null
2024-12-02 Human-Machine Interfaces for Subsea Telerobotics: From Soda-straw to Natural Language Interactions Adnan Abdullah et.al. 2412.01753 translate read null
2024-12-02 HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition Anton Nuzhdin et.al. 2412.01508 translate read link
2024-12-02 EdgeOAR: Real-time Online Action Recognition On Edge Devices Wei Luo et.al. 2412.01267 translate read null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)