Action Recognition - 2024-06
Action Recognition - 2024-06
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-06-30 | Graph in Graph Neural Network | Jiongshu Wang et.al. | 2407.00696 | translate | read | link |
| 2024-06-29 | Diving Deeper Into Pedestrian Behavior Understanding: Intention Estimation, Action Prediction, and Event Risk Assessment | Amir Rasouli et.al. | 2407.00446 | translate | read | link |
| 2024-06-29 | PerAct2: A Perceiver Actor Framework for Bimanual Manipulation Tasks | Markus Grotz et.al. | 2407.00278 | translate | read | null |
| 2024-06-27 | VideoMambaPro: A Leap Forward for Mamba in Video Understanding | Hui Lu et.al. | 2406.19006 | translate | read | link |
| 2024-06-28 | CSI4Free: GAN-Augmented mmWave CSI for Improved Pose Classification | Nabeel Nisar Bhat et.al. | 2406.18684 | translate | read | null |
| 2024-06-26 | The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval | Meinardus Boris et.al. | 2406.18113 | translate | read | link |
| 2024-06-26 | Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation | Yijie Yang et.al. | 2406.18011 | translate | read | link |
| 2024-06-25 | Using joint angles based on the international biomechanical standards for human action recognition and related tasks | Kevin Schlegel et.al. | 2406.17443 | translate | read | null |
| 2024-06-21 | Open-Vocabulary Temporal Action Localization using Multimodal Guidance | Akshita Gupta et.al. | 2406.15556 | translate | read | null |
| 2024-06-21 | SVFormer: A Direct Training Spiking Transformer for Efficient Video Action Recognition | Liutao Yu et.al. | 2406.15034 | translate | read | null |
| 2024-06-21 | Real-Time Hand Gesture Recognition: Integrating Skeleton-Based Data Fusion and Multi-Stream CNN | Oluwaleke Yusuf et.al. | 2406.15003 | translate | read | link |
| 2024-06-20 | Self-supervised Multi-actor Social Activity Understanding in Streaming Videos | Shubham Trehan et.al. | 2406.14472 | translate | read | null |
| 2024-06-19 | An Efficient yet High-Performance Method for Precise Radar-Based Imaging of Human Hand Poses | Johanna Bräunig et.al. | 2406.13464 | translate | read | null |
| 2024-06-19 | Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition | Anqi Zhu et.al. | 2406.13327 | translate | read | link |
| 2024-06-21 | Underwater Human-Robot and Human-Swarm Interaction: A Review and Perspective | Sara Aldhaheri et.al. | 2406.12473 | translate | read | null |
| 2024-06-18 | Deep self-supervised learning with visualisation for automatic gesture recognition | Fabien Allemand et.al. | 2406.12440 | translate | read | null |
| 2024-06-17 | Brain-inspired Computational Modeling of Action Recognition with Recurrent Spiking Neural Networks Equipped with Reinforcement Delay Learning | Alireza Nadafian et.al. | 2406.11778 | translate | read | null |
| 2024-06-18 | CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition | Ruoyu Wang et.al. | 2406.11340 | translate | read | null |
| 2024-06-17 | Expanding the Design Space of Computer Vision-based Interactive Systems for Group Dance Practice | Soohwan Lee et.al. | 2406.11236 | translate | read | null |
| 2024-06-14 | Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild | Lingni Ma et.al. | 2406.09905 | translate | read | null |
| 2024-06-12 | Enhancing End-to-End Autonomous Driving with Latent World Model | Yingyan Li et.al. | 2406.08481 | translate | read | link |
| 2024-06-09 | ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition | Sanjoy Kundu et.al. | 2406.05722 | translate | read | null |
| 2024-06-07 | SMART: Scene-motion-aware human action recognition framework for mental disorder group | Zengyuan Lai et.al. | 2406.04649 | translate | read | link |
| 2024-06-06 | Enhancing Sign Language Detection through Mediapipe and Convolutional Neural Networks (CNN) | Aditya Raj Verma et.al. | 2406.03729 | translate | read | null |
| 2024-06-05 | The Logarithmic Memristor-Based Bayesian Machine | Clément Turck et.al. | 2406.03492 | translate | read | null |
| 2024-06-05 | FILS: Self-Supervised Video Feature Prediction In Semantic Language Space | Mona Ahmadian et.al. | 2406.03447 | translate | read | null |
| 2024-06-05 | Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond | Jiahang Zhang et.al. | 2406.02978 | translate | read | null |
| 2024-06-04 | Contrastive Language Video Time Pre-training | Hengyue Liu et.al. | 2406.02631 | translate | read | null |
| 2024-06-04 | DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark | Chi-Jui Chang et.al. | 2406.02468 | translate | read | null |
| 2024-06-04 | A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies | Md Mirajul Islam et.al. | 2406.02450 | translate | read | null |
| 2024-06-04 | Analyzing the Feature Extractor Networks for Face Image Synthesis | Erdi Sarıtaş et.al. | 2406.02153 | translate | read | link |
| 2024-06-04 | Analyzing the Effect of Combined Degradations on Face Recognition | Erdi Sarıtaş et.al. | 2406.02142 | translate | read | link |
| 2024-06-03 | ELSA: Evaluating Localization of Social Activities in Urban Streets | Maryam Hosseini et.al. | 2406.01551 | translate | read | null |
| 2024-06-03 | HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models | Mengcheng Li et.al. | 2406.01334 | translate | read | null |
| 2024-06-03 | Augmented Commonsense Knowledge for Remote Object Grounding | Bahram Mohammadi et.al. | 2406.01256 | translate | read | link |
| 2024-06-03 | Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models | Georgia Markham et.al. | 2406.01073 | translate | read | null |
| 2024-06-02 | An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition | Haojun Xu et.al. | 2406.00639 | translate | read | null |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)