Pose Estimation - 2025-11 | Paper Arxiv Daily

Pose Estimation - 2025-11

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-11-29	CC-FMO: Camera-Conditioned Zero-Shot Single Image to 3D Scene Generation with Foundation Model Orchestration	Boshi Tang et.al.	2512.00493	translate	read	null
2025-11-03	Learning from Watching: Scalable Extraction of Manipulation Trajectories from Human Videos	X. Hu et.al.	2512.00024	translate	read	null
2025-11-28	Zero-Shot Multi-Criteria Visual Quality Inspection for Semi-Controlled Industrial Environments via Real-Time 3D Digital Twin Simulation	Jose Moises Araya-Martinez et.al.	2511.23214	translate	read	null
2025-11-28	DiskChunGS: Large-Scale 3D Gaussian SLAM Through Chunk-Based Memory Management	Casimir Feldmann et.al.	2511.23030	translate	read	null
2025-11-28	Threat-Aware UAV Dodging of Human-Thrown Projectiles with an RGB-D Camera	Yuying Zhang et.al.	2511.22847	translate	read	null
2025-11-27	Emergent Extreme-View Geometry in 3D Foundation Models	Yiwen Zhang et.al.	2511.22686	translate	read	null
2025-11-27	UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data	Longkun Zou et.al.	2511.22404	translate	read	null
2025-11-27	ColonAdapter: Geometry Estimation Through Foundation Model Adaptation for Colonoscopy	Zhiyi Jiang et.al.	2511.22250	translate	read	null
2025-11-26	Seeing without Pixels: Perception from Camera Trajectories	Zihui Xue et.al.	2511.21681	translate	read	null
2025-11-26	Uncertainty Quantification for Visual Object Pose Estimation	Lorenzo Shaikewitz et.al.	2511.21666	translate	read	null
2025-11-26	Enhanced Landmark Detection Model in Pelvic Fluoroscopy using 2D/3D Registration Loss	Chou Mo et.al.	2511.21575	translate	read	null
2025-11-25	Metric, inertially aligned monocular state estimation via kinetodynamic priors	Jiaxin Liu et.al.	2511.20496	translate	read	null
2025-11-25	Dance Style Classification using Laban-Inspired and Frequency-Domain Motion Features	Ben Hamscher et.al.	2511.20469	translate	read	null
2025-11-25	VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction	Yu Hu et.al.	2511.19971	translate	read	null
2025-11-24	The Determinant Ratio Matrix Approach to Solving 3D Matching and 2D Orthographic Projection Alignment Tasks	Andrew J. Hanson et.al.	2511.19511	translate	read	null
2025-11-18	PuzzlePoles: Cylindrical Fiducial Markers Based on the PuzzleBoard Pattern	Juri Zach et.al.	2511.19448	translate	read	null
2025-11-24	Graph-based 3D Human Pose Estimation using WiFi Signals	Jichao Chen et.al.	2511.19105	translate	read	null
2025-11-24	Analysis of Deep-Learning Methods in an ISO/TS 15066-Compliant Human-Robot Safety Framework	David Bricher et.al.	2511.19094	translate	read	null
2025-11-24	LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space	Hai Wu et.al.	2511.19057	translate	read	null
2025-11-24	Robust Long-term Test-Time Adaptation for 3D Human Pose Estimation through Motion Discretization	Yilin Wen et.al.	2511.18851	translate	read	null
2025-11-24	CNN-Based Camera Pose Estimation and Localisation of Scan Images for Aircraft Visual Inspection	Xueyan Oh et.al.	2511.18702	translate	read	null
2025-11-23	Expanding the Workspace of Electromagnetic Navigation Systems Using Dynamic Feedback for Single- and Multi-agent Control	Jasan Zughaibi et.al.	2511.18486	translate	read	null
2025-11-22	Muskie: Multi-view Masked Image Modeling for 3D Vision Pre-training	Wenyu Li et.al.	2511.18115	translate	read	null
2025-11-21	NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior	Dongbo Shi et.al.	2511.17322	translate	read	null
2025-11-21	MuM: Multi-View Masked Image Modeling for 3D Vision	David Nordström et.al.	2511.17309	translate	read	null
2025-11-21	BiFingerPose: Bimodal Finger Pose Estimation for Touch Devices	Xiongjun Guan et.al.	2511.17306	translate	read	null
2025-11-21	RacketVision: A Multiple Racket Sports Benchmark for Unified Ball and Racket Analysis	Linfeng Dong et.al.	2511.17045	translate	read	null
2025-11-21	MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots	Junseo Kim et.al.	2511.16949	translate	read	null
2025-11-20	BOP-ASK: Object-Interaction Reasoning for Vision-Language Models	Vineet Bhat et.al.	2511.16857	translate	read	null
2025-11-20	NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses	Jing Wen et.al.	2511.16673	translate	read	null
2025-11-20	EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering	Pierrick Bournez et.al.	2511.16542	translate	read	null
2025-11-20	Physics-Informed Machine Learning for Efficient Sim-to-Real Data Augmentation in Micro-Object Pose Estimation	Zongcai Tan et.al.	2511.16494	translate	read	null
2025-11-20	End-to-End Motion Capture from Rigid Body Markers with Geodesic Loss	Hai Lan et.al.	2511.16418	translate	read	null
2025-11-19	Box6D : Zero-shot Category-level 6D Pose Estimation of Warehouse Boxes	Yintao Ma et.al.	2511.15884	translate	read	null
2025-11-19	WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion	Sajjad Pakdamansavoji et.al.	2511.15874	translate	read	null
2025-11-19	Scriboora: Rethinking Human Pose Forecasting	Daniel Bermuth et.al.	2511.15565	translate	read	null
2025-11-18	RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems	Jaro Meyer et.al.	2511.14948	translate	read	null
2025-11-18	A Quantitative Method for Shoulder Presentation Evaluation in Biometric Identity Documents	Alfonso Pedro Ridao et.al.	2511.14376	translate	read	null
2025-11-18	Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors	Jeryes Danial et.al.	2511.14335	translate	read	null
2025-11-18	LSP-YOLO: A Lightweight Single-Stage Network for Sitting Posture Recognition on Embedded Devices	Nanjun Li et.al.	2511.14322	translate	read	null
2025-11-18	iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion	Hao Wang et.al.	2511.14149	translate	read	null
2025-11-17	GRLoc: Geometric Representation Regression for Visual Localization	Changyang Li et.al.	2511.13864	translate	read	null
2025-11-17	RSPose: Ranking Based Losses for Human Pose Estimation	Muhammed Can Keles et.al.	2511.13857	translate	read	null
2025-11-17	GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models	Yushuo Zheng et.al.	2511.13259	translate	read	null
2025-11-17	GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry	Chiyun Noh et.al.	2511.13216	translate	read	null
2025-11-17	End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer	Yonghui Yu et.al.	2511.13208	translate	read	null
2025-11-17	CapeNext: Rethinking and Refining Dynamic Support Information for Category-Agnostic Pose Estimation	Yu Zhu et.al.	2511.13102	translate	read	null
2025-11-17	PFAvatar: Pose-Fusion 3D Personalized Avatar Reconstruction from Real-World Outfit-of-the-Day Photos	Dianbing Xi et.al.	2511.12935	translate	read	null
2025-11-17	CoordAR: One-Reference 6D Pose Estimation of Novel Objects via Autoregressive Coordinate Map Generation	Dexin Zuo et.al.	2511.12919	translate	read	null
2025-11-16	OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding	Artem Moroz et.al.	2511.12614	translate	read	null
2025-11-16	Visible Structure Retrieval for Lightweight Image-Based Relocalisation	Fereidoon Zangeneh et.al.	2511.12503	translate	read	null
2025-11-15	Changes in Real Time: Online Scene Change Detection with Multi-View Fusion	Chamuditha Jayanga Galappaththige et.al.	2511.12370	translate	read	null
2025-11-15	AURA: Development and Validation of an Augmented Unplanned Removal Alert System using Synthetic ICU Videos	Junhyuk Seo et.al.	2511.12241	translate	read	null
2025-11-15	VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose Estimation	Jun Zhou et.al.	2511.12030	translate	read	null
2025-11-12	Understanding the Representation of Older Adults in Motion Capture Locomotion Datasets	Yunkai Yu et.al.	2511.11713	translate	read	null
2025-11-14	YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation	Pavel Rojtberg et.al.	2511.11344	translate	read	null
2025-11-14	6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data	Saptarshi Neil Sinha et.al.	2511.11307	translate	read	null
2025-11-13	Depth Anything 3: Recovering the Visual Space from Any Views	Haotong Lin et.al.	2511.10647	translate	read	null
2025-11-13	OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer	Haosong Peng et.al.	2511.10560	translate	read	null
2025-11-12	STORM: Segment, Track, and Object Re-Localization from a Single Image	Yu Deng et.al.	2511.09771	translate	read	null
2025-11-12	DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation	Jerrin Bright et.al.	2511.09502	translate	read	null
2025-11-12	SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields	Sangheon Yang et.al.	2511.09072	translate	read	null
2025-11-12	RadHARSimulator V2: Video to Doppler Generator	Weicheng Gao et.al.	2511.09022	translate	read	null
2025-11-12	SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation	Hu Cui et.al.	2511.08872	translate	read	null
2025-11-11	Adaptive graph Kolmogorov-Arnold network for 3D human pose estimation	Abu Taib Mohammed Shahjahan et.al.	2511.08809	translate	read	null
2025-11-11	RAPTR: Radar-based 3D Pose Estimation using Transformer	Sorachi Kato et.al.	2511.08387	translate	read	null
2025-11-11	SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering	Laura Bragagnolo et.al.	2511.08294	translate	read	null
2025-11-11	An Image-Based Path Planning Algorithm Using a UAV Equipped with Stereo Vision	Selim Ahmet Iz et.al.	2511.07928	translate	read	null
2025-11-10	LeCoT: revisiting network architecture for two-view correspondence pruning	Luanyuan Dai et.al.	2511.07078	translate	read	null
2025-11-10	Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes	Meijun Guo et.al.	2511.06765	translate	read	null
2025-11-10	Semi-distributed Cross-modal Air-Ground Relative Localization	Weining Lu et.al.	2511.06749	translate	read	null
2025-11-09	VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes	Zhengyu Zou et.al.	2511.06408	translate	read	null
2025-11-07	Pedicle Screw Pairing and Registration for Screw Pose Estimation from Dual C-arm Images Using CAD Models	Yehyun Suh et.al.	2511.05702	translate	read	null
2025-11-07	Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments	Laura Alejandra Encinar Gonzalez et.al.	2511.05404	translate	read	null
2025-11-07	No Pose Estimation? No Problem: Pose-Agnostic and Instance-Aware Test-Time Adaptation for Monocular Depth Estimation	Mingyu Sung et.al.	2511.05055	translate	read	null
2025-11-06	Synchronous Observer Design for Landmark-Inertial SLAM with Almost-Global Convergence	Arkadeep Saha et.al.	2511.04531	translate	read	null
2025-11-06	A Two-stage Adaptive Lifting PINN Framework for Solving Viscous Approximations to Hyperbolic Conservation Laws	Yameng Zhu et.al.	2511.04490	translate	read	null
2025-11-06	Deep Dictionary-Free Method for Identifying Linear Model of Nonlinear System with Input Delay	Patrik Valábek et.al.	2511.04451	translate	read	null
2025-11-06	MedSapiens: Taking a Pose to Rethink Medical Imaging Landmark Detection	Marawan Elbatel et.al.	2511.04255	translate	read	null
2025-11-06	DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platforms	Shengyu Tang et.al.	2511.04128	translate	read	null
2025-11-06	Simple 3D Pose Features Support Human and Machine Social Scene Understanding	Wenshuo Qin et.al.	2511.03988	translate	read	null
2025-11-05	CORE - A Cell-Level Coarse-to-Fine Image Registration Engine for Multi-stain Image Alignment	Esha Sadia Nasir et.al.	2511.03826	translate	read	null
2025-11-05	FusionDP: Foundation Model-Assisted Differentially Private Learning for Partially Sensitive Features	Linghui Zeng et.al.	2511.03806	translate	read	null
2025-11-04	Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks	Dmitrii Pozdeev et.al.	2511.02830	translate	read	null
2025-11-04	Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization	Tao Liu et.al.	2511.02489	translate	read	link
2025-11-04	A New Perspective on Precision and Recall for Generative Models	Benjamin Sykes et.al.	2511.02414	translate	read	null
2025-11-04	Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization	Shaohan Li et.al.	2511.02329	translate	read	null
2025-11-04	Are Euler angles a useful rotation parameterisation for pose estimation with Normalizing Flows?	Giorgos Sfikas et.al.	2511.02277	translate	read	null
2025-11-04	A Joint Variational Framework for Multimodal X-ray Ptychography and Fluorescence Reconstruction	Eric Zou et.al.	2511.02153	translate	read	null
2025-11-04	A new approach for the analysis of evolution partial differential equations on a finite interval	Türker Özsarı et.al.	2511.02145	translate	read	null
2025-11-03	HGFreNet: Hop-hybrid GraphFomer for 3D Human Pose Estimation with Trajectory Consistency in Frequency Domain	Kai Zhai et.al.	2511.01756	translate	read	null
2025-11-03	Clutter Suppression in Bistatic ISAC with Joint Angle and Doppler Estimation	M. Ertug Pihtili et.al.	2511.01599	translate	read	null
2025-11-03	Defining Energy Indicators for Impact Identification on Aerospace Composites: A Physics-Informed Machine Learning Perspective	Natália Ribeiro Marinho et.al.	2511.01592	translate	read	null
2025-11-03	SE(3)-PoseFlow: Estimating 6D Pose Distributions for Uncertainty-Aware Robotic Manipulation	Yufeng Jin et.al.	2511.01501	translate	read	null
2025-11-03	Floor Plan-Guided Visual Navigation Incorporating Depth and Directional Cues	Wei Huang et.al.	2511.01493	translate	read	null
2025-11-03	Tackling the Kidnapped Robot Problem via Sparse Feasible Hypothesis Sampling and Reliable Batched Multi-Stage Inference	Muhua Zhang et.al.	2511.01219	translate	read	null
2025-11-03	LiDAR-VGGT: Cross-Modal Coarse-to-Fine Fusion for Globally Consistent and Metric-Scale Dense Mapping	Lijie Wang et.al.	2511.01186	translate	read	null
2025-11-03	Web-Scale Collection of Video Data for 4D Animal Reconstruction	Brian Nlong Zhao et.al.	2511.01169	translate	read	null
2025-11-01	Active learning-based variance reduction for Monte Carlo simulations: A feasibility study for the nanodosimetry around a gold nanoparticle	Leo Thomas et.al.	2511.00563	translate	read	null

(<a href=../Pose_Estimation.md>back to Pose Estimation</a>)