Pose Estimation - 2025-09
Pose Estimation - 2025-09
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-09-30 | TTT3R: 3D Reconstruction as Test-Time Training | Xingyu Chen et.al. | 2509.26645 | translate | read | link |
| 2025-09-30 | A Multi-purpose Tracking Framework for Salmon Welfare Monitoring in Challenging Environments | Espen Uri Høgstedt et.al. | 2509.25969 | translate | read | null |
| 2025-09-30 | Physics-Informed Learning for Human Whole-Body Kinematics Prediction via Sparse IMUs | Cheng Guo et.al. | 2509.25704 | translate | read | null |
| 2025-09-29 | Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity | Tu-Hoa Pham et.al. | 2509.25520 | translate | read | null |
| 2025-09-29 | VGGT-X: When VGGT Meets Dense Novel View Synthesis | Yang Liu et.al. | 2509.25191 | translate | read | link |
| 2025-09-29 | PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos | Ting-Hsuan Liao et.al. | 2509.25183 | translate | read | null |
| 2025-09-29 | SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation | Shuang Liang et.al. | 2509.24980 | translate | read | link |
| 2025-09-29 | PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control | Haozhuo Zhang et.al. | 2509.24591 | translate | read | null |
| 2025-09-29 | SCOPE: Semantic Conditioning for Sim2Real Category-Level Object Pose Estimation in Robotics | Peter Hönig et.al. | 2509.24572 | translate | read | null |
| 2025-09-28 | GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State | Guole Shen et.al. | 2509.23737 | translate | read | null |
| 2025-09-28 | Color-Pair Guided Robust Zero-Shot 6D Pose Estimation and Tracking of Cluttered Objects on Edge Devices | Xingjian Yang et.al. | 2509.23647 | translate | read | null |
| 2025-09-27 | 3DPCNet: Pose Canonicalization for Robust Viewpoint-Invariant 3D Kinematic Analysis from Monocular RGB cameras | Tharindu Ekanayake et.al. | 2509.23455 | translate | read | null |
| 2025-09-27 | Generative Modeling of Shape-Dependent Self-Contact Human Poses | Takehiko Ohkawa et.al. | 2509.23393 | translate | read | null |
| 2025-09-27 | UniPose: Unified Cross-modality Pose Prior Propagation towards RGB-D data for Weakly Supervised 3D Human Pose Estimation | Jinghong Zheng et.al. | 2509.23376 | translate | read | null |
| 2025-09-27 | GeLoc3r: Enhancing Relative Camera Pose Regression with Geometric Consistency Regularization | Jingxing Li et.al. | 2509.23038 | translate | read | null |
| 2025-09-26 | Good Weights: Proactive, Adaptive Dead Reckoning Fusion for Continuous and Robust Visual SLAM | Yanwei Du et.al. | 2509.22910 | translate | read | null |
| 2025-09-26 | ControlEvents: Controllable Synthesis of Event Camera Datawith Foundational Prior from Image Diffusion Models | Yixuan Hu et.al. | 2509.22864 | translate | read | null |
| 2025-09-26 | An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose | Qifeng Wang et.al. | 2509.22058 | translate | read | null |
| 2025-09-26 | SingRef6D: Monocular Novel Object Pose Estimation with a Single RGB Reference | Jiahui Wang et.al. | 2509.21927 | translate | read | null |
| 2025-09-24 | mmHSense: Multi-Modal and Distributed mmWave ISAC Datasets for Human Sensing | Nabeel Nisar Bhat et.al. | 2509.21396 | translate | read | null |
| 2025-09-25 | Finding 3D Positions of Distant Objects from Noisy Camera Movement and Semantic Segmentation Sequences | Julius Pesonen et.al. | 2509.20906 | translate | read | null |
| 2025-09-25 | AI-Enabled Crater-Based Navigation for Lunar Mapping | Sofia McLeod et.al. | 2509.20748 | translate | read | null |
| 2025-09-25 | EEG-Driven AR-Robot System for Zero-Touch Grasping Manipulation | Junzhe Wang et.al. | 2509.20656 | translate | read | null |
| 2025-09-24 | Reflect3r: Single-View 3D Stereo Reconstruction Aided by Mirror Reflections | Jing Wu et.al. | 2509.20607 | translate | read | null |
| 2025-09-24 | AJAHR: Amputated Joint Aware 3D Human Mesh Recovery | Hyunjin Cho et.al. | 2509.19939 | translate | read | null |
| 2025-09-23 | Category-Level Object Shape and Pose Estimation in Less Than a Millisecond | Lorenzo Shaikewitz et.al. | 2509.18979 | translate | read | null |
| 2025-09-23 | Towards Robust LiDAR Localization: Deep Learning-based Uncertainty Estimation | Minoo Dolatabadi et.al. | 2509.18954 | translate | read | null |
| 2025-09-23 | Human-Interpretable Uncertainty Explanations for Point Cloud Registration | Johannes A. Gaus et.al. | 2509.18786 | translate | read | null |
| 2025-09-23 | SINGER: An Onboard Generalist Vision-Language Navigation Policy for Drones | Maximilian Adang et.al. | 2509.18610 | translate | read | null |
| 2025-09-22 | Selecting Optimal Camera Views for Gait Analysis: A Multi-Metric Assessment of 2D Projections | Dong Chen et.al. | 2509.17805 | translate | read | null |
| 2025-09-22 | Evict3R: Training-Free Token Eviction for Memory-Bounded Streaming Visual Geometry Transformers | Soroush Mahdi et.al. | 2509.17650 | translate | read | null |
| 2025-09-22 | VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video | Yu Liu et.al. | 2509.17647 | translate | read | null |
| 2025-09-22 | Pose Estimation of a Cable-Driven Serpentine Manipulator Utilizing Intrinsic Dynamics via Physical Reservoir Computing | Kazutoshi Tanaka et.al. | 2509.17308 | translate | read | null |
| 2025-09-21 | SPFSplatV2: Efficient Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views | Ranran Huang et.al. | 2509.17246 | translate | read | null |
| 2025-09-21 | Leveraging RGB Images for Pre-Training of Event-Based Hand Pose Estimation | Ruicong Liu et.al. | 2509.16949 | translate | read | null |
| 2025-09-19 | UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation | Mingdong Wu et.al. | 2509.15934 | translate | read | null |
| 2025-09-19 | Self-Supervised Cross-Modal Learning for Image-to-Point Cloud Registration | Xingmei Wang et.al. | 2509.15882 | translate | read | null |
| 2025-09-19 | STARC: See-Through-Wall Augmented Reality Framework for Human-Robot Collaboration in Emergency Response | Shenghai Yuan et.al. | 2509.15507 | translate | read | null |
| 2025-09-18 | NeRF-based Visualization of 3D Cues Supporting Data-Driven Spacecraft Pose Estimation | Antoine Legrand et.al. | 2509.14890 | translate | read | null |
| 2025-09-17 | SWA-PF: Semantic-Weighted Adaptive Particle Filter for Memory-Efficient 4-DoF UAV Localization in GNSS-Denied Environments | Jiayu Yuan et.al. | 2509.13795 | translate | read | null |
| 2025-09-17 | Bridging the Synthetic-Real Gap: Supervised Domain Adaptation for Robust Spacecraft 6-DoF Pose Estimation | Inder Pal Singh et.al. | 2509.13792 | translate | read | null |
| 2025-09-17 | UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry | Tae-Wook Um et.al. | 2509.13713 | translate | read | null |
| 2025-09-17 | Gaussian Alignment for Relative Camera Pose Estimation via Single-View Reconstruction | Yumin Li et.al. | 2509.13652 | translate | read | null |
| 2025-09-16 | Object Pose Estimation through Dexterous Touch | Amir-Hossein Shahidzadeh et.al. | 2509.13591 | translate | read | null |
| 2025-09-16 | Using Visual Language Models to Control Bionic Hands: Assessment of Object Perception and Grasp Inference | Ozan Karaali et.al. | 2509.13572 | translate | read | null |
| 2025-09-16 | ROOM: A Physics-Based Continuum Robot Simulator for Photorealistic Medical Datasets Generation | Salvatore Esposito et.al. | 2509.13177 | translate | read | link |
| 2025-09-15 | 3D Human Pose and Shape Estimation from LiDAR Point Clouds: A Review | Salma Galaaoui et.al. | 2509.12197 | translate | read | null |
| 2025-09-15 | Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation | Sebastian Diaz et.al. | 2509.12062 | translate | read | null |
| 2025-09-15 | Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting | Yi-Hsin Li et.al. | 2509.11853 | translate | read | null |
| 2025-09-15 | IMD: A 6-DoF Pose Estimation Benchmark for Industrial Metallic Objects | Ruimin Ma et.al. | 2509.11680 | translate | read | null |
| 2025-09-14 | ActivePose: Active 6D Object Pose Estimation and Tracking for Robotic Manipulation | Sheng Liu et.al. | 2509.11364 | translate | read | null |
| 2025-09-13 | AutoOEP – A Multi-modal Framework for Online Exam Proctoring | Aryan Kashyap Naveen et.al. | 2509.10887 | translate | read | null |
| 2025-09-09 | HiLWS: A Human-in-the-Loop Weak Supervision Framework for Curating Clinical and Home Video Data for Neurological Assessment | Atefeh Irani et.al. | 2509.10557 | translate | read | null |
| 2025-09-12 | Self-supervised Learning Of Visual Pose Estimation Without Pose Labels By Classifying LED States | Nicholas Carlotti et.al. | 2509.10405 | translate | read | null |
| 2025-09-11 | MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos | Rutav Shah et.al. | 2509.09769 | translate | read | link |
| 2025-09-10 | MultimodalHugs: Enabling Sign Language Processing in Hugging Face | Gerard Sant et.al. | 2509.09729 | translate | read | null |
| 2025-09-09 | Australian Supermarket Object Set (ASOS): A Benchmark Dataset of Physical Objects and 3D Models for Robotics and Computer Vision | Akansel Cosgun et.al. | 2509.09720 | translate | read | null |
| 2025-09-10 | iMatcher: Improve matching in point cloud registration via local-to-global geometric consistency learning | Karim Slimani et.al. | 2509.08982 | translate | read | null |
| 2025-09-10 | PianoVAM: A Multimodal Piano Performance Dataset | Yonghyun Kim et.al. | 2509.08800 | translate | read | null |
| 2025-09-10 | Deep Visual Odometry for Stereo Event Cameras | Sheng Zhong et.al. | 2509.08235 | translate | read | null |
| 2025-09-09 | SVN-ICP: Uncertainty Estimation of ICP-based LiDAR Odometry using Stein Variational Newton | Shiping Ma et.al. | 2509.08069 | translate | read | null |
| 2025-09-09 | One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation | Zheng Geng et.al. | 2509.07978 | translate | read | link |
| 2025-09-09 | Parse Graph-Based Visual-Language Interaction for Human Pose Estimation | Shibang Liu et.al. | 2509.07385 | translate | read | null |
| 2025-09-08 | H $_{2}$ OT: Hierarchical Hourglass Tokenizer for Efficient Video Pose Transformers | Wenhao Li et.al. | 2509.06956 | translate | read | link |
| 2025-09-08 | Musculoskeletal simulation of limb movement biomechanics in Drosophila melanogaster | Pembe Gizem Özdil et.al. | 2509.06426 | translate | read | null |
| 2025-09-07 | DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal Fusion | Mengmeng Liu et.al. | 2509.06023 | translate | read | null |
| 2025-09-07 | Motion Aware ViT-based Framework for Monocular 6-DoF Spacecraft Pose Estimation | Jose Sosa et.al. | 2509.06000 | translate | read | null |
| 2025-09-06 | Multi-LVI-SAM: A Robust LiDAR-Visual-Inertial Odometry for Multiple Fisheye Cameras | Xinyu Zhang et.al. | 2509.05740 | translate | read | null |
| 2025-09-05 | WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool | Zizun Li et.al. | 2509.05296 | translate | read | link |
| 2025-09-04 | Odometry Calibration and Pose Estimation of a 4WIS4WID Mobile Wall Climbing Robot | Branimir Ćaran et.al. | 2509.04016 | translate | read | null |
| 2025-09-03 | SmartPoser: Arm Pose Estimation with a Smartphone and Smartwatch Using UWB and IMU Data | Nathan DeVrio et.al. | 2509.03451 | translate | read | null |
| 2025-09-03 | Towards Realistic Hand-Object Interaction with Gravity-Field Based Diffusion Bridge | Miao Xu et.al. | 2509.03114 | translate | read | null |
| 2025-09-03 | IL-SLAM: Intelligent Line-assisted SLAM Based on Feature Awareness for Dynamic Environments | Haolan Zhang et.al. | 2509.02972 | translate | read | null |
| 2025-09-02 | Robotic 3D Flower Pose Estimation for Small-Scale Urban Farms | Harsh Muriki et.al. | 2509.02870 | translate | read | null |
| 2025-09-02 | Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions | Beibei Zhou et.al. | 2509.02011 | translate | read | null |
| 2025-09-02 | Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction | Xueyang Kang et.al. | 2509.01873 | translate | read | null |
| 2025-09-01 | FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field | Fan Zhu et.al. | 2509.01547 | translate | read | null |
| 2025-09-01 | Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation | Lee Chae-Yeon et.al. | 2509.01242 | translate | read | null |
| 2025-09-01 | SR-SLAM: Scene-reliability Based RGB-D SLAM in Diverse Environments | Haolan Zhang et.al. | 2509.01111 | translate | read | null |
| 2025-09-01 | An End-to-End Framework for Video Multi-Person Pose Estimation | Zhihong Wei et.al. | 2509.01095 | translate | read | null |
(<a href=../Pose_Estimation.md>back to Pose Estimation</a>)