Action Recognition - 2024-10 | Paper Arxiv Daily

Action Recognition - 2024-10

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-10-31	Technical Report for ActivityNet Challenge 2022 – Temporal Action Localization	Shimin Chen et.al.	2411.00883	translate	read	null
2024-10-30	A Simple and Effective Temporal Grounding Pipeline for Basketball Broadcast Footage	Levi Harris et.al.	2411.00862	translate	read	null
2024-10-31	Recovering Complete Actions for Cross-dataset Skeleton Action Recognition	Hanchao Liu et.al.	2410.23641	translate	read	null
2024-10-30	Keypoint Abstraction using Large Models for Object-Relative Imitation Learning	Xiaolin Fang et.al.	2410.23254	translate	read	null
2024-10-30	AtGCN: A Graph Convolutional Network For Ataxic Gait Detection	Karan Bania et.al.	2410.22862	translate	read	null
2024-10-29	ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding	Kimihiro Hasegawa et.al.	2410.22211	translate	read	link
2024-10-29	Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets	Adrian Iordache et.al.	2410.22184	translate	read	link
2024-10-28	Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context	Manuel Benavent-Lledo et.al.	2410.21275	translate	read	link
2024-10-28	One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation	Zhendong Wang et.al.	2410.21257	translate	read	null
2024-10-28	Zero-Shot Action Recognition in Surveillance Videos	Joao Pereira et.al.	2410.21113	translate	read	null
2024-10-28	LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group Activity Recognition	Naga Venkata Sai Raviteja Chappa et.al.	2410.21108	translate	read	null
2024-10-27	Exocentric To Egocentric Transfer For Action Recognition: A Short Survey	Anirudh Thatipelli et.al.	2410.20621	translate	read	null
2024-10-27	Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition	Lilang Lin et.al.	2410.20349	translate	read	null
2024-10-28	x-RAGE: eXtended Reality – Action & Gesture Events Dataset	Vivek Parmar et.al.	2410.19486	translate	read	null
2024-10-24	Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms	Zhangheng Li et.al.	2410.18967	translate	read	link
2024-10-24	Research on gesture recognition method based on SEDCNN-SVM	Mingjin Zhang et.al.	2410.18557	translate	read	null
2024-10-23	Unsupervised Domain Adaptation for Action Recognition via Self-Ensembling and Conditional Embedding Alignment	Indrajeet Ghosh et.al.	2410.17489	translate	read	link
2024-10-22	Are Visual-Language Models Effective in Action Recognition? A Comparative Study	Mahmoud Ali et.al.	2410.17149	translate	read	null
2024-10-22	Masked Differential Privacy	David Schneider et.al.	2410.17098	translate	read	null
2024-10-22	SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition	Jiaqi Chen et.al.	2410.16746	translate	read	link
2024-10-21	Improving the Multi-label Atomic Activity Recognition by Robust Visual Feature and Advanced Attention @ ROAD++ Atomic Activity Recognition 2024	Jiamin Cao et.al.	2410.16037	translate	read	null
2024-10-19	CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic Manipulation	Shangning Xia et.al.	2410.14974	translate	read	null
2024-10-18	DFlow: Diverse Dialogue Flow Simulation with Large Language Models	Wanyu Du et.al.	2410.14853	translate	read	null
2024-10-18	Storyboard guided Alignment for Fine-grained Video Action Recognition	Enqi Liu et.al.	2410.14238	translate	read	null
2024-10-17	SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs	Yuling Gu et.al.	2410.13648	translate	read	null
2024-10-16	In-Context Learning Enables Robot Action Prediction in LLMs	Yida Yin et.al.	2410.12782	translate	read	null
2024-10-14	Continual Learning Improves Zero-Shot Action Recognition	Shreyank N Gowda et.al.	2410.10497	translate	read	null
2024-10-16	PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation	Kaidong Zhang et.al.	2410.10394	translate	read	null
2024-10-13	EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition	Jingyu Liu et.al.	2410.09954	translate	read	null
2024-10-13	Multi class activity classification in videos using Motion History Image generation	Senthilkumar Gopal et.al.	2410.09902	translate	read	link
2024-10-12	Advanced Gesture Recognition in Autism: Integrating YOLOv7, Video Augmentation and VideoMAE for Video Analysis	Amit Kumar Singh et.al.	2410.09339	translate	read	null
2024-10-11	Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning	Yunpeng Gao et.al.	2410.08500	translate	read	null
2024-10-10	Human Stone Toolmaking Action Grammar (HSTAG): A Challenging Benchmark for Fine-grained Motor Behavior Recognition	Cheng Liu et.al.	2410.08410	translate	read	null
2024-10-10	Understanding Spatio-Temporal Relations in Human-Object Interaction using Pyramid Graph Convolutional Network	Hao Xing et.al.	2410.07912	translate	read	null
2024-10-09	CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition	Yuhang Wen et.al.	2410.07153	translate	read	link
2024-10-09	Fourier-based Action Recognition for Wildlife Behavior Quantification with Event Cameras	Friedhelm Hamann et.al.	2410.06698	translate	read	null
2024-10-08	GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation	Chi-Lam Cheang et.al.	2410.06158	translate	read	null
2024-10-10	ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition	Mohammadreza Salehi et.al.	2410.05774	translate	read	null
2024-10-07	Exploring Gestural Interaction with a Cushion Interface for Smart Home Control	Yuri Suzuki et.al.	2410.04730	translate	read	null
2024-10-05	TR-LLM: Integrating Trajectory Data for Scene-Aware LLM-Based Human Action Prediction	Kojiro Takeyama et.al.	2410.03993	translate	read	null
2024-10-04	Shadow Augmentation for Handwashing Action Recognition: from Synthetic to Real Datasets	Shengtai Ju et.al.	2410.03984	translate	read	null
2024-10-04	Action Selection Learning for Multi-label Multi-view Action Recognition	Trung Thanh Nguyen et.al.	2410.03302	translate	read	link
2024-10-03	DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects	Zhaowei Wang et.al.	2410.02730	translate	read	link
2024-10-03	An Evaluation of Large Pre-Trained Models for Gesture Recognition using Synthetic Videos	Arun Reddy et.al.	2410.02152	translate	read	null
2024-10-02	Language Supervised Human Action Recognition with Salient Fusion: Construction Worker Action Recognition as a Use Case	Mohammad Mahdavian et.al.	2410.01962	translate	read	null
2024-10-02	Sparse Covariance Neural Networks	Andrea Cavallo et.al.	2410.01669	translate	read	link
2024-10-02	Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy	Ricardo Garcia et.al.	2410.01345	translate	read	link
2024-10-01	Dynamic Planning for LLM-based Graphical User Interface Automation	Shaoqing Zhang et.al.	2410.00467	translate	read	link

(<a href=../Action_Recognition.md>back to Action Recognition</a>)