Action Recognition - 2024-12
Action Recognition - 2024-12
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-12-31 | M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference | Chuxiong Sun et.al. | 2501.00312 | translate | read | null |
| 2024-12-30 | A Large-Scale Study on Video Action Dataset Condensation | Yang Chen et.al. | 2412.21197 | translate | read | null |
| 2024-12-30 | Frequency-aware Event Cloud Network | Hongwei Ren et.al. | 2412.20803 | translate | read | null |
| 2024-12-29 | FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition | Wenhan Wu et.al. | 2412.20621 | translate | read | link |
| 2024-12-29 | Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation | Qucheng Peng et.al. | 2412.20538 | translate | read | link |
| 2024-12-29 | Improving Vision-Language-Action Models via Chain-of-Affordance | Jinming Li et.al. | 2412.20451 | translate | read | null |
| 2024-12-28 | DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments | Xijun Wang et.al. | 2412.20042 | translate | read | null |
| 2024-12-27 | Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action Localization | Yuanpeng He et.al. | 2412.19418 | translate | read | link |
| 2024-12-25 | SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation | Maxence Boels et.al. | 2412.18849 | translate | read | null |
| 2024-12-25 | Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion | Yuheng Yang et.al. | 2412.18780 | translate | read | link |
| 2024-12-24 | Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer | Fenghua Shao et.al. | 2412.18321 | translate | read | null |
| 2024-12-23 | HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data | Ting Zhou et.al. | 2412.17574 | translate | read | null |
| 2024-12-22 | Video Domain Incremental Learning for Human Action Recognition in Home Environments | Yuanda Hu et.al. | 2412.16946 | translate | read | null |
| 2024-12-21 | Optical Wireless Communications: Enabling the Next Generation Network of Networks | Aravindh Krishnamoorthy et.al. | 2412.16798 | translate | read | null |
| 2024-12-21 | FACTS: Fine-Grained Action Classification for Tactical Sports | Christopher Lai et.al. | 2412.16454 | translate | read | null |
| 2024-12-20 | iRadar: Synthesizing Millimeter-Waves from Wearable Inertial Inputs for Human Gesture Sensing | Huanqi Yang et.al. | 2412.15980 | translate | read | null |
| 2024-12-19 | Synchronized and Fine-Grained Head for Skeleton-Based Ambiguous Action Recognition | Hao Huang et.al. | 2412.14833 | translate | read | null |
| 2024-12-19 | Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition | Kun Li et.al. | 2412.14719 | translate | read | link |
| 2024-12-24 | Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models | Xinghang Li et.al. | 2412.14058 | translate | read | link |
| 2024-12-18 | Do Language Models Understand Time? | Xi Ding et.al. | 2412.13845 | translate | read | link |
| 2024-12-17 | CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices | Andrei Znobishchev et.al. | 2412.13273 | translate | read | null |
| 2024-12-20 | Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences | Antonios Gasteratos et.al. | 2412.12990 | translate | read | null |
| 2024-12-16 | Designing Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition | Hichem Sahbi et.al. | 2412.11813 | translate | read | null |
| 2024-12-13 | TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies | Ruijie Zheng et.al. | 2412.10345 | translate | read | null |
| 2024-12-13 | Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP | Yating Yu et.al. | 2412.09895 | translate | read | link |
| 2024-12-14 | USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation | Wanjiang Weng et.al. | 2412.09220 | translate | read | link |
| 2024-12-13 | Temporal Action Localization with Cross Layer Task Decoupling and Refinement | Qiang Li et.al. | 2412.09202 | translate | read | link |
| 2024-12-12 | Goal-Conditioned Supervised Learning for Multi-Objective Recommendation | Shijun Li et.al. | 2412.08911 | translate | read | null |
| 2024-12-10 | SAT: Spatial Aptitude Training for Multimodal Language Models | Arijit Ray et.al. | 2412.07755 | translate | read | link |
| 2024-12-10 | Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence | Wenbo Huang et.al. | 2412.07481 | translate | read | null |
| 2024-12-09 | Mining Limited Data Sufficiently: A BERT-inspired Approach for CSI Time Series Application in Wireless Communication and Sensing | Zijian Zhao et.al. | 2412.06861 | translate | read | link |
| 2024-12-09 | Exploring the Impact of Synthetic Data on Human Gesture Recognition Tasks Using GANs | George Kontogiannis et.al. | 2412.06389 | translate | read | null |
| 2024-12-07 | Action Recognition based Industrial Safety Violation Detection | Surya N Reddy et.al. | 2412.05531 | translate | read | null |
| 2024-12-06 | CCS: Continuous Learning for Customized Incremental Wireless Sensing Services | Qunhang Fu et.al. | 2412.04821 | translate | read | null |
| 2024-12-06 | KNN-MMD: Cross Domain Wi-Fi Sensing Based on Local Distribution Alignment | Zijian Zhao et.al. | 2412.04783 | translate | read | link |
| 2024-12-03 | Proximal Control of UAVs with Federated Learning for Human-Robot Collaborative Domains | Lucas Nogueira Nobrega et.al. | 2412.02863 | translate | read | null |
| 2024-12-03 | Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation | Xuanlin Li et.al. | 2412.02676 | translate | read | null |
| 2024-12-02 | Human-Machine Interfaces for Subsea Telerobotics: From Soda-straw to Natural Language Interactions | Adnan Abdullah et.al. | 2412.01753 | translate | read | null |
| 2024-12-02 | HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition | Anton Nuzhdin et.al. | 2412.01508 | translate | read | link |
| 2024-12-02 | EdgeOAR: Real-time Online Action Recognition On Edge Devices | Wei Luo et.al. | 2412.01267 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)