Action Recognition - 2025-02
Action Recognition - 2025-02
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-02-28 | BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports | Jing-Yuan Chang et.al. | 2502.21085 | translate | read | null |
| 2025-02-27 | Learning to Generalize without Bias for Open-Vocabulary Action Recognition | Yating Yu et.al. | 2502.20158 | translate | read | link |
| 2025-02-27 | QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects | Elkhan Ismayilzada et.al. | 2502.19769 | translate | read | null |
| 2025-02-26 | Deep Learning For Time Series Analysis With Application On Human Motion | Ali Ismail-Fawaz et.al. | 2502.19364 | translate | read | null |
| 2025-02-26 | UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering | Langming Liu et.al. | 2502.19178 | translate | read | link |
| 2025-02-25 | EgoSim: An Egocentric Multi-view Simulator and Real Dataset for Body-worn Cameras during Motion and Activity | Dominik Hollidt et.al. | 2502.18373 | translate | read | link |
| 2025-02-25 | Edge Training and Inference with Analog ReRAM Technology for Hand Gesture Recognition | Victoria Clerico et.al. | 2502.18152 | translate | read | null |
| 2025-02-23 | Trunk-branch Contrastive Network with Multi-view Deformable Aggregation for Multi-view Action Recognition | Yingyuan Yang et.al. | 2502.16493 | translate | read | null |
| 2025-02-20 | Online hand gesture recognition using Continual Graph Transformers | Rim Slama et.al. | 2502.14939 | translate | read | null |
| 2025-02-19 | Are Rules Meant to be Broken? Understanding Multilingual Moral Reasoning as a Computational Pipeline with UniMoral | Shivani Kumar et.al. | 2502.14083 | translate | read | null |
| 2025-02-19 | PSCon: Toward Conversational Product Search | Jie Zou et.al. | 2502.13881 | translate | read | link |
| 2025-02-19 | SNN-Driven Multimodal Human Action Recognition via Event Camera and Skeleton Data Fusion | Naichuan Zheng et.al. | 2502.13385 | translate | read | null |
| 2025-02-18 | Beyond Timesteps: A Novel Activation-wise Membrane Potential Propagation Mechanism for Spiking Neural Networks in 3D cloud | Jian Song et.al. | 2502.12791 | translate | read | null |
| 2025-02-18 | Adaptive Prototype Model for Attribute-based Multi-label Few-shot Action Recognition | Juefeng Xiao et.al. | 2502.12582 | translate | read | null |
| 2025-02-25 | Duo Streamers: A Streaming Gesture Recognition Framework | Boxuan Zhu et.al. | 2502.12297 | translate | read | link |
| 2025-02-17 | Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation | Zhongyi Qiu et.al. | 2502.12073 | translate | read | null |
| 2025-02-14 | ManiTrend: Bridging Future Generation and Action Prediction with 3D Flow for Robotic Manipulation | Yuxin He et.al. | 2502.10028 | translate | read | null |
| 2025-02-14 | VicKAM: Visual Conceptual Knowledge Guided Action Map for Weakly Supervised Group Activity Recognition | Zhuming Wang et.al. | 2502.09967 | translate | read | null |
| 2025-02-13 | CellFlow: Simulating Cellular Morphology Changes via Flow Matching | Yuhui Zhang et.al. | 2502.09775 | translate | read | link |
| 2025-02-12 | Measuring Anxiety Levels with Head Motion Patterns in Severe Depression Population | Fouad Boualeb et.al. | 2502.08813 | translate | read | null |
| 2025-02-18 | Robot Data Curation with Mutual Information Estimators | Joey Hejna et.al. | 2502.08623 | translate | read | null |
| 2025-02-12 | DGSense: A Domain Generalization Framework for Wireless Sensing | Rui Zhou et.al. | 2502.08155 | translate | read | null |
| 2025-02-11 | Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis | Amir Hosein Fadaei et.al. | 2502.07277 | translate | read | null |
| 2025-02-10 | From Image to Video: An Empirical Study of Diffusion Representations | Pedro Vélez et.al. | 2502.07001 | translate | read | null |
| 2025-02-10 | Conformal Predictions for Human Action Recognition with Vision-Language Models | Bary Tim et.al. | 2502.06631 | translate | read | null |
| 2025-02-10 | AppVLM: A Lightweight Vision Language Model for Online App Control | Georgios Papoudakis et.al. | 2502.06395 | translate | read | null |
| 2025-02-09 | Preventing Rogue Agents Improves Multi-Agent Collaboration | Ohav Barbi et.al. | 2502.05986 | translate | read | link |
| 2025-02-09 | HyLiFormer: Hyperbolic Linear Attention for Skeleton-based Human Action Recognition | Yue Li et.al. | 2502.05869 | translate | read | null |
| 2025-02-11 | HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation | Yi Li et.al. | 2502.05485 | translate | read | link |
| 2025-02-06 | HD-EPIC: A Highly-Detailed Egocentric Video Dataset | Toby Perrett et.al. | 2502.04144 | translate | read | null |
| 2025-02-06 | MD-BERT: Action Recognition in Dark Videos via Dynamic Multi-Stream Fusion and Temporal Modeling | Sharana Dharshikgan Suresh Dass et.al. | 2502.03724 | translate | read | null |
| 2025-02-10 | Kronecker Mask and Interpretive Prompts are Language-Action Video Learners | Jingyi Yang et.al. | 2502.03549 | translate | read | link |
| 2025-02-05 | SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living | Arkaprava Sinha et.al. | 2502.03459 | translate | read | null |
| 2025-02-01 | Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues | Rohit Girmaji et.al. | 2502.00397 | translate | read | null |
| 2025-02-03 | Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models | Hao Dong et.al. | 2501.18592 | translate | read | link |
(<a href=../Action_Recognition.md>back to Action Recognition</a>)