Pose Estimation - 2025-11
Pose Estimation - 2025-11
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-11-29 | CC-FMO: Camera-Conditioned Zero-Shot Single Image to 3D Scene Generation with Foundation Model Orchestration | Boshi Tang et.al. | 2512.00493 | translate | read | null |
| 2025-11-03 | Learning from Watching: Scalable Extraction of Manipulation Trajectories from Human Videos | X. Hu et.al. | 2512.00024 | translate | read | null |
| 2025-11-28 | Zero-Shot Multi-Criteria Visual Quality Inspection for Semi-Controlled Industrial Environments via Real-Time 3D Digital Twin Simulation | Jose Moises Araya-Martinez et.al. | 2511.23214 | translate | read | null |
| 2025-11-28 | DiskChunGS: Large-Scale 3D Gaussian SLAM Through Chunk-Based Memory Management | Casimir Feldmann et.al. | 2511.23030 | translate | read | null |
| 2025-11-28 | Threat-Aware UAV Dodging of Human-Thrown Projectiles with an RGB-D Camera | Yuying Zhang et.al. | 2511.22847 | translate | read | null |
| 2025-11-27 | Emergent Extreme-View Geometry in 3D Foundation Models | Yiwen Zhang et.al. | 2511.22686 | translate | read | null |
| 2025-11-27 | UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data | Longkun Zou et.al. | 2511.22404 | translate | read | null |
| 2025-11-27 | ColonAdapter: Geometry Estimation Through Foundation Model Adaptation for Colonoscopy | Zhiyi Jiang et.al. | 2511.22250 | translate | read | null |
| 2025-11-26 | Seeing without Pixels: Perception from Camera Trajectories | Zihui Xue et.al. | 2511.21681 | translate | read | null |
| 2025-11-26 | Uncertainty Quantification for Visual Object Pose Estimation | Lorenzo Shaikewitz et.al. | 2511.21666 | translate | read | null |
| 2025-11-26 | Enhanced Landmark Detection Model in Pelvic Fluoroscopy using 2D/3D Registration Loss | Chou Mo et.al. | 2511.21575 | translate | read | null |
| 2025-11-25 | Metric, inertially aligned monocular state estimation via kinetodynamic priors | Jiaxin Liu et.al. | 2511.20496 | translate | read | null |
| 2025-11-25 | Dance Style Classification using Laban-Inspired and Frequency-Domain Motion Features | Ben Hamscher et.al. | 2511.20469 | translate | read | null |
| 2025-11-25 | VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction | Yu Hu et.al. | 2511.19971 | translate | read | null |
| 2025-11-24 | The Determinant Ratio Matrix Approach to Solving 3D Matching and 2D Orthographic Projection Alignment Tasks | Andrew J. Hanson et.al. | 2511.19511 | translate | read | null |
| 2025-11-18 | PuzzlePoles: Cylindrical Fiducial Markers Based on the PuzzleBoard Pattern | Juri Zach et.al. | 2511.19448 | translate | read | null |
| 2025-11-24 | Graph-based 3D Human Pose Estimation using WiFi Signals | Jichao Chen et.al. | 2511.19105 | translate | read | null |
| 2025-11-24 | Analysis of Deep-Learning Methods in an ISO/TS 15066-Compliant Human-Robot Safety Framework | David Bricher et.al. | 2511.19094 | translate | read | null |
| 2025-11-24 | LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space | Hai Wu et.al. | 2511.19057 | translate | read | null |
| 2025-11-24 | Robust Long-term Test-Time Adaptation for 3D Human Pose Estimation through Motion Discretization | Yilin Wen et.al. | 2511.18851 | translate | read | null |
| 2025-11-24 | CNN-Based Camera Pose Estimation and Localisation of Scan Images for Aircraft Visual Inspection | Xueyan Oh et.al. | 2511.18702 | translate | read | null |
| 2025-11-23 | Expanding the Workspace of Electromagnetic Navigation Systems Using Dynamic Feedback for Single- and Multi-agent Control | Jasan Zughaibi et.al. | 2511.18486 | translate | read | null |
| 2025-11-22 | Muskie: Multi-view Masked Image Modeling for 3D Vision Pre-training | Wenyu Li et.al. | 2511.18115 | translate | read | null |
| 2025-11-21 | NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior | Dongbo Shi et.al. | 2511.17322 | translate | read | null |
| 2025-11-21 | MuM: Multi-View Masked Image Modeling for 3D Vision | David Nordström et.al. | 2511.17309 | translate | read | null |
| 2025-11-21 | BiFingerPose: Bimodal Finger Pose Estimation for Touch Devices | Xiongjun Guan et.al. | 2511.17306 | translate | read | null |
| 2025-11-21 | RacketVision: A Multiple Racket Sports Benchmark for Unified Ball and Racket Analysis | Linfeng Dong et.al. | 2511.17045 | translate | read | null |
| 2025-11-21 | MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots | Junseo Kim et.al. | 2511.16949 | translate | read | null |
| 2025-11-20 | BOP-ASK: Object-Interaction Reasoning for Vision-Language Models | Vineet Bhat et.al. | 2511.16857 | translate | read | null |
| 2025-11-20 | NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses | Jing Wen et.al. | 2511.16673 | translate | read | null |
| 2025-11-20 | EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering | Pierrick Bournez et.al. | 2511.16542 | translate | read | null |
| 2025-11-20 | Physics-Informed Machine Learning for Efficient Sim-to-Real Data Augmentation in Micro-Object Pose Estimation | Zongcai Tan et.al. | 2511.16494 | translate | read | null |
| 2025-11-20 | End-to-End Motion Capture from Rigid Body Markers with Geodesic Loss | Hai Lan et.al. | 2511.16418 | translate | read | null |
| 2025-11-19 | Box6D : Zero-shot Category-level 6D Pose Estimation of Warehouse Boxes | Yintao Ma et.al. | 2511.15884 | translate | read | null |
| 2025-11-19 | WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion | Sajjad Pakdamansavoji et.al. | 2511.15874 | translate | read | null |
| 2025-11-19 | Scriboora: Rethinking Human Pose Forecasting | Daniel Bermuth et.al. | 2511.15565 | translate | read | null |
| 2025-11-18 | RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems | Jaro Meyer et.al. | 2511.14948 | translate | read | null |
| 2025-11-18 | A Quantitative Method for Shoulder Presentation Evaluation in Biometric Identity Documents | Alfonso Pedro Ridao et.al. | 2511.14376 | translate | read | null |
| 2025-11-18 | Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors | Jeryes Danial et.al. | 2511.14335 | translate | read | null |
| 2025-11-18 | LSP-YOLO: A Lightweight Single-Stage Network for Sitting Posture Recognition on Embedded Devices | Nanjun Li et.al. | 2511.14322 | translate | read | null |
| 2025-11-18 | iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion | Hao Wang et.al. | 2511.14149 | translate | read | null |
| 2025-11-17 | GRLoc: Geometric Representation Regression for Visual Localization | Changyang Li et.al. | 2511.13864 | translate | read | null |
| 2025-11-17 | RSPose: Ranking Based Losses for Human Pose Estimation | Muhammed Can Keles et.al. | 2511.13857 | translate | read | null |
| 2025-11-17 | GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models | Yushuo Zheng et.al. | 2511.13259 | translate | read | null |
| 2025-11-17 | GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry | Chiyun Noh et.al. | 2511.13216 | translate | read | null |
| 2025-11-17 | End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer | Yonghui Yu et.al. | 2511.13208 | translate | read | null |
| 2025-11-17 | CapeNext: Rethinking and Refining Dynamic Support Information for Category-Agnostic Pose Estimation | Yu Zhu et.al. | 2511.13102 | translate | read | null |
| 2025-11-17 | PFAvatar: Pose-Fusion 3D Personalized Avatar Reconstruction from Real-World Outfit-of-the-Day Photos | Dianbing Xi et.al. | 2511.12935 | translate | read | null |
| 2025-11-17 | CoordAR: One-Reference 6D Pose Estimation of Novel Objects via Autoregressive Coordinate Map Generation | Dexin Zuo et.al. | 2511.12919 | translate | read | null |
| 2025-11-16 | OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding | Artem Moroz et.al. | 2511.12614 | translate | read | null |
| 2025-11-16 | Visible Structure Retrieval for Lightweight Image-Based Relocalisation | Fereidoon Zangeneh et.al. | 2511.12503 | translate | read | null |
| 2025-11-15 | Changes in Real Time: Online Scene Change Detection with Multi-View Fusion | Chamuditha Jayanga Galappaththige et.al. | 2511.12370 | translate | read | null |
| 2025-11-15 | AURA: Development and Validation of an Augmented Unplanned Removal Alert System using Synthetic ICU Videos | Junhyuk Seo et.al. | 2511.12241 | translate | read | null |
| 2025-11-15 | VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose Estimation | Jun Zhou et.al. | 2511.12030 | translate | read | null |
| 2025-11-12 | Understanding the Representation of Older Adults in Motion Capture Locomotion Datasets | Yunkai Yu et.al. | 2511.11713 | translate | read | null |
| 2025-11-14 | YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation | Pavel Rojtberg et.al. | 2511.11344 | translate | read | null |
| 2025-11-14 | 6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data | Saptarshi Neil Sinha et.al. | 2511.11307 | translate | read | null |
| 2025-11-13 | Depth Anything 3: Recovering the Visual Space from Any Views | Haotong Lin et.al. | 2511.10647 | translate | read | null |
| 2025-11-13 | OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer | Haosong Peng et.al. | 2511.10560 | translate | read | null |
| 2025-11-12 | STORM: Segment, Track, and Object Re-Localization from a Single Image | Yu Deng et.al. | 2511.09771 | translate | read | null |
| 2025-11-12 | DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation | Jerrin Bright et.al. | 2511.09502 | translate | read | null |
| 2025-11-12 | SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields | Sangheon Yang et.al. | 2511.09072 | translate | read | null |
| 2025-11-12 | RadHARSimulator V2: Video to Doppler Generator | Weicheng Gao et.al. | 2511.09022 | translate | read | null |
| 2025-11-12 | SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation | Hu Cui et.al. | 2511.08872 | translate | read | null |
| 2025-11-11 | Adaptive graph Kolmogorov-Arnold network for 3D human pose estimation | Abu Taib Mohammed Shahjahan et.al. | 2511.08809 | translate | read | null |
| 2025-11-11 | RAPTR: Radar-based 3D Pose Estimation using Transformer | Sorachi Kato et.al. | 2511.08387 | translate | read | null |
| 2025-11-11 | SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering | Laura Bragagnolo et.al. | 2511.08294 | translate | read | null |
| 2025-11-11 | An Image-Based Path Planning Algorithm Using a UAV Equipped with Stereo Vision | Selim Ahmet Iz et.al. | 2511.07928 | translate | read | null |
| 2025-11-10 | LeCoT: revisiting network architecture for two-view correspondence pruning | Luanyuan Dai et.al. | 2511.07078 | translate | read | null |
| 2025-11-10 | Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes | Meijun Guo et.al. | 2511.06765 | translate | read | null |
| 2025-11-10 | Semi-distributed Cross-modal Air-Ground Relative Localization | Weining Lu et.al. | 2511.06749 | translate | read | null |
| 2025-11-09 | VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes | Zhengyu Zou et.al. | 2511.06408 | translate | read | null |
| 2025-11-07 | Pedicle Screw Pairing and Registration for Screw Pose Estimation from Dual C-arm Images Using CAD Models | Yehyun Suh et.al. | 2511.05702 | translate | read | null |
| 2025-11-07 | Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments | Laura Alejandra Encinar Gonzalez et.al. | 2511.05404 | translate | read | null |
| 2025-11-07 | No Pose Estimation? No Problem: Pose-Agnostic and Instance-Aware Test-Time Adaptation for Monocular Depth Estimation | Mingyu Sung et.al. | 2511.05055 | translate | read | null |
| 2025-11-06 | Synchronous Observer Design for Landmark-Inertial SLAM with Almost-Global Convergence | Arkadeep Saha et.al. | 2511.04531 | translate | read | null |
| 2025-11-06 | A Two-stage Adaptive Lifting PINN Framework for Solving Viscous Approximations to Hyperbolic Conservation Laws | Yameng Zhu et.al. | 2511.04490 | translate | read | null |
| 2025-11-06 | Deep Dictionary-Free Method for Identifying Linear Model of Nonlinear System with Input Delay | Patrik Valábek et.al. | 2511.04451 | translate | read | null |
| 2025-11-06 | MedSapiens: Taking a Pose to Rethink Medical Imaging Landmark Detection | Marawan Elbatel et.al. | 2511.04255 | translate | read | null |
| 2025-11-06 | DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platforms | Shengyu Tang et.al. | 2511.04128 | translate | read | null |
| 2025-11-06 | Simple 3D Pose Features Support Human and Machine Social Scene Understanding | Wenshuo Qin et.al. | 2511.03988 | translate | read | null |
| 2025-11-05 | CORE - A Cell-Level Coarse-to-Fine Image Registration Engine for Multi-stain Image Alignment | Esha Sadia Nasir et.al. | 2511.03826 | translate | read | null |
| 2025-11-05 | FusionDP: Foundation Model-Assisted Differentially Private Learning for Partially Sensitive Features | Linghui Zeng et.al. | 2511.03806 | translate | read | null |
| 2025-11-04 | Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks | Dmitrii Pozdeev et.al. | 2511.02830 | translate | read | null |
| 2025-11-04 | Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization | Tao Liu et.al. | 2511.02489 | translate | read | link |
| 2025-11-04 | A New Perspective on Precision and Recall for Generative Models | Benjamin Sykes et.al. | 2511.02414 | translate | read | null |
| 2025-11-04 | Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization | Shaohan Li et.al. | 2511.02329 | translate | read | null |
| 2025-11-04 | Are Euler angles a useful rotation parameterisation for pose estimation with Normalizing Flows? | Giorgos Sfikas et.al. | 2511.02277 | translate | read | null |
| 2025-11-04 | A Joint Variational Framework for Multimodal X-ray Ptychography and Fluorescence Reconstruction | Eric Zou et.al. | 2511.02153 | translate | read | null |
| 2025-11-04 | A new approach for the analysis of evolution partial differential equations on a finite interval | Türker Özsarı et.al. | 2511.02145 | translate | read | null |
| 2025-11-03 | HGFreNet: Hop-hybrid GraphFomer for 3D Human Pose Estimation with Trajectory Consistency in Frequency Domain | Kai Zhai et.al. | 2511.01756 | translate | read | null |
| 2025-11-03 | Clutter Suppression in Bistatic ISAC with Joint Angle and Doppler Estimation | M. Ertug Pihtili et.al. | 2511.01599 | translate | read | null |
| 2025-11-03 | Defining Energy Indicators for Impact Identification on Aerospace Composites: A Physics-Informed Machine Learning Perspective | Natália Ribeiro Marinho et.al. | 2511.01592 | translate | read | null |
| 2025-11-03 | SE(3)-PoseFlow: Estimating 6D Pose Distributions for Uncertainty-Aware Robotic Manipulation | Yufeng Jin et.al. | 2511.01501 | translate | read | null |
| 2025-11-03 | Floor Plan-Guided Visual Navigation Incorporating Depth and Directional Cues | Wei Huang et.al. | 2511.01493 | translate | read | null |
| 2025-11-03 | Tackling the Kidnapped Robot Problem via Sparse Feasible Hypothesis Sampling and Reliable Batched Multi-Stage Inference | Muhua Zhang et.al. | 2511.01219 | translate | read | null |
| 2025-11-03 | LiDAR-VGGT: Cross-Modal Coarse-to-Fine Fusion for Globally Consistent and Metric-Scale Dense Mapping | Lijie Wang et.al. | 2511.01186 | translate | read | null |
| 2025-11-03 | Web-Scale Collection of Video Data for 4D Animal Reconstruction | Brian Nlong Zhao et.al. | 2511.01169 | translate | read | null |
| 2025-11-01 | Active learning-based variance reduction for Monte Carlo simulations: A feasibility study for the nanodosimetry around a gold nanoparticle | Leo Thomas et.al. | 2511.00563 | translate | read | null |
(<a href=../Pose_Estimation.md>back to Pose Estimation</a>)