Action Recognition - 2024-06 | Paper Arxiv Daily

Action Recognition - 2024-06

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-06-30	Graph in Graph Neural Network	Jiongshu Wang et.al.	2407.00696	translate	read	link
2024-06-29	Diving Deeper Into Pedestrian Behavior Understanding: Intention Estimation, Action Prediction, and Event Risk Assessment	Amir Rasouli et.al.	2407.00446	translate	read	link
2024-06-29	PerAct2: A Perceiver Actor Framework for Bimanual Manipulation Tasks	Markus Grotz et.al.	2407.00278	translate	read	null
2024-06-27	VideoMambaPro: A Leap Forward for Mamba in Video Understanding	Hui Lu et.al.	2406.19006	translate	read	link
2024-06-28	CSI4Free: GAN-Augmented mmWave CSI for Improved Pose Classification	Nabeel Nisar Bhat et.al.	2406.18684	translate	read	null
2024-06-26	The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval	Meinardus Boris et.al.	2406.18113	translate	read	link
2024-06-26	Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation	Yijie Yang et.al.	2406.18011	translate	read	link
2024-06-25	Using joint angles based on the international biomechanical standards for human action recognition and related tasks	Kevin Schlegel et.al.	2406.17443	translate	read	null
2024-06-21	Open-Vocabulary Temporal Action Localization using Multimodal Guidance	Akshita Gupta et.al.	2406.15556	translate	read	null
2024-06-21	SVFormer: A Direct Training Spiking Transformer for Efficient Video Action Recognition	Liutao Yu et.al.	2406.15034	translate	read	null
2024-06-21	Real-Time Hand Gesture Recognition: Integrating Skeleton-Based Data Fusion and Multi-Stream CNN	Oluwaleke Yusuf et.al.	2406.15003	translate	read	link
2024-06-20	Self-supervised Multi-actor Social Activity Understanding in Streaming Videos	Shubham Trehan et.al.	2406.14472	translate	read	null
2024-06-19	An Efficient yet High-Performance Method for Precise Radar-Based Imaging of Human Hand Poses	Johanna Bräunig et.al.	2406.13464	translate	read	null
2024-06-19	Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition	Anqi Zhu et.al.	2406.13327	translate	read	link
2024-06-21	Underwater Human-Robot and Human-Swarm Interaction: A Review and Perspective	Sara Aldhaheri et.al.	2406.12473	translate	read	null
2024-06-18	Deep self-supervised learning with visualisation for automatic gesture recognition	Fabien Allemand et.al.	2406.12440	translate	read	null
2024-06-17	Brain-inspired Computational Modeling of Action Recognition with Recurrent Spiking Neural Networks Equipped with Reinforcement Delay Learning	Alireza Nadafian et.al.	2406.11778	translate	read	null
2024-06-18	CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition	Ruoyu Wang et.al.	2406.11340	translate	read	null
2024-06-17	Expanding the Design Space of Computer Vision-based Interactive Systems for Group Dance Practice	Soohwan Lee et.al.	2406.11236	translate	read	null
2024-06-14	Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild	Lingni Ma et.al.	2406.09905	translate	read	null
2024-06-12	Enhancing End-to-End Autonomous Driving with Latent World Model	Yingyan Li et.al.	2406.08481	translate	read	link
2024-06-09	ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition	Sanjoy Kundu et.al.	2406.05722	translate	read	null
2024-06-07	SMART: Scene-motion-aware human action recognition framework for mental disorder group	Zengyuan Lai et.al.	2406.04649	translate	read	link
2024-06-06	Enhancing Sign Language Detection through Mediapipe and Convolutional Neural Networks (CNN)	Aditya Raj Verma et.al.	2406.03729	translate	read	null
2024-06-05	The Logarithmic Memristor-Based Bayesian Machine	Clément Turck et.al.	2406.03492	translate	read	null
2024-06-05	FILS: Self-Supervised Video Feature Prediction In Semantic Language Space	Mona Ahmadian et.al.	2406.03447	translate	read	null
2024-06-05	Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond	Jiahang Zhang et.al.	2406.02978	translate	read	null
2024-06-04	Contrastive Language Video Time Pre-training	Hengyue Liu et.al.	2406.02631	translate	read	null
2024-06-04	DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark	Chi-Jui Chang et.al.	2406.02468	translate	read	null
2024-06-04	A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies	Md Mirajul Islam et.al.	2406.02450	translate	read	null
2024-06-04	Analyzing the Feature Extractor Networks for Face Image Synthesis	Erdi Sarıtaş et.al.	2406.02153	translate	read	link
2024-06-04	Analyzing the Effect of Combined Degradations on Face Recognition	Erdi Sarıtaş et.al.	2406.02142	translate	read	link
2024-06-03	ELSA: Evaluating Localization of Social Activities in Urban Streets	Maryam Hosseini et.al.	2406.01551	translate	read	null
2024-06-03	HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models	Mengcheng Li et.al.	2406.01334	translate	read	null
2024-06-03	Augmented Commonsense Knowledge for Remote Object Grounding	Bahram Mohammadi et.al.	2406.01256	translate	read	link
2024-06-03	Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models	Georgia Markham et.al.	2406.01073	translate	read	null
2024-06-02	An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition	Haojun Xu et.al.	2406.00639	translate	read	null

(<a href=../Action_Recognition.md>back to Action Recognition</a>)