Depth Estimation - 2025-03 | Paper Arxiv Daily

Depth Estimation - 2025-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-03-31	ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image	Tianyi Gong et.al.	2503.23881	translate	read	null
2025-03-31	Detail-aware multi-view stereo network for depth estimation	Haitao Tian et.al.	2503.23684	translate	read	null
2025-03-30	Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries	Wei Xu et.al.	2503.23606	translate	read	null
2025-03-30	Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model	Jannik Endres et.al.	2503.23502	translate	read	link
2025-03-28	SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations	Krispin Wandel et.al.	2503.22462	translate	read	null
2025-03-28	EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting	Xu Wang et.al.	2503.22437	translate	read	link
2025-03-28	MVSAnywhere: Zero-Shot Multi-View Stereo	Sergio Izquierdo et.al.	2503.22430	translate	read	null
2025-03-28	One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images	Byeongjun Kwon et.al.	2503.22351	translate	read	null
2025-03-28	Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces	Wonhyeok Choi et.al.	2503.22209	translate	read	null
2025-03-28	Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges	Ukcheol Shin et.al.	2503.22060	translate	read	link
2025-03-27	A Unified Image-Dense Annotation Generation Model for Underwater Scenes	Hongkai Lin et.al.	2503.21771	translate	read	link
2025-03-27	ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo	Yuxi Hu et.al.	2503.21525	translate	read	null
2025-03-26	Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors	Weilong Yan et.al.	2503.20211	translate	read	link
2025-03-26	FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion	Pihai Sun et.al.	2503.19739	translate	read	link
2025-03-25	Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving	Yusen Xie et.al.	2503.19713	translate	read	link
2025-03-25	StableGS: A Floater-Free Framework for 3D Gaussian Splatting	Luchao Wang et.al.	2503.18458	translate	read	null
2025-03-24	PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes	Xinhua Xu et.al.	2503.18393	translate	read	null
2025-03-23	Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images	Yara AlaaEldin et.al.	2503.17982	translate	read	link
2025-03-21	Radar-Guided Polynomial Fitting for Metric Depth Estimation	Patrick Rim et.al.	2503.17182	translate	read	null
2025-03-21	AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process	Junjie Hu et.al.	2503.17029	translate	read	null
2025-03-21	Distilling Monocular Foundation Model for Fine-grained Depth Completion	Yingping Liang et.al.	2503.16970	translate	read	null
2025-03-20	QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge	Xuan Shen et.al.	2503.16709	translate	read	link
2025-03-20	A Recipe for Generating 3D Worlds From a Single Image	Katja Schwarz et.al.	2503.16611	translate	read	null
2025-03-20	Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras	Beilei Cui et.al.	2503.15917	translate	read	null
2025-03-20	Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation	Jiyuan Wang et.al.	2503.15905	translate	read	null
2025-03-19	TULIP: Towards Unified Language-Image Pretraining	Zineng Tang et.al.	2503.15485	translate	read	null
2025-03-19	EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining	Boshen Xu et.al.	2503.15470	translate	read	null
2025-03-19	USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network	Joseph Emmanuel DL Dayo et.al.	2503.14950	translate	read	null
2025-03-18	Multi-view Reconstruction via SfM-guided Monocular Depth Estimation	Haoyu Guo et.al.	2503.14483	translate	read	null
2025-03-18	DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers	Mert Bulent Sariyildiz et.al.	2503.14405	translate	read	null
2025-03-18	3D Densification for Multi-Map Monocular VSLAM in Endoscopy	X. Anadón et.al.	2503.14346	translate	read	null
2025-03-17	MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models	Johannes Meier et.al.	2503.13743	translate	read	null
2025-03-17	Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios	Iryna Repinetska et.al.	2503.13710	translate	read	null
2025-03-19	FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis	Luxi Chen et.al.	2503.13265	translate	read	null
2025-03-17	MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs	Erik Daxberger et.al.	2503.13111	translate	read	null
2025-03-17	TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image	Haoxiao Wang et.al.	2503.12779	translate	read	null
2025-03-16	UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing	Tsu-Jui Fu et.al.	2503.12652	translate	read	null
2025-03-16	Deblur Gaussian Splatting SLAM	Francesco Girlanda et.al.	2503.12572	translate	read	null
2025-03-14	VGGT: Visual Geometry Grounded Transformer	Jianyuan Wang et.al.	2503.11651	translate	read	null
2025-03-14	Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation	Hongyu Wen et.al.	2503.11633	translate	read	null
2025-03-14	Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation	Fengchen He et.al.	2503.11213	translate	read	null
2025-03-13	Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations	Xunzhi Zheng et.al.	2503.10464	translate	read	null
2025-03-15	WonderVerse: Extendable 3D Scene Generation with Video Generative Models	Hao Feng et.al.	2503.09160	translate	read	null
2025-03-11	Language-Depth Navigated Thermal and Visible Image Fusion	Jinchang Zhang et.al.	2503.08676	translate	read	null
2025-03-11	CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning	Kaiqiang Xiong et.al.	2503.08219	translate	read	null
2025-03-10	SIRE: SE(3) Intrinsic Rigidity Embeddings	Cameron Smith et.al.	2503.07739	translate	read	null
2025-03-10	LBM: Latent Bridge Matching for Fast Image-to-Image Translation	Clément Chadebec et.al.	2503.07535	translate	read	link
2025-03-12	Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion	Mona Sheikh Zeinoddin et.al.	2503.07204	translate	read	null
2025-03-11	LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation	Quanjian Song et.al.	2503.06508	translate	read	null
2025-03-08	Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity	Xiaohao Xu et.al.	2503.06014	translate	read	link
2025-03-07	TomatoScanner: phenotyping tomato fruit based on only RGB image	Xiaobei Zhao et.al.	2503.05568	translate	read	null
2025-03-07	Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects	Justin Yu et.al.	2503.05189	translate	read	null
2025-03-05	RTFusion: A depth estimation network based on multimodal fusion in challenging scenarios	Zelin Meng et.al.	2503.04821	translate	read	null
2025-03-06	A Novel Solution for Drone Photogrammetry with Low-overlap Aerial Images using Monocular Depth Estimation	Jiageng Zhong et.al.	2503.04513	translate	read	null
2025-03-08	EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images	Rohit Menon et.al.	2503.04441	translate	read	null
2025-03-06	H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision	Yunxiao Shi et.al.	2503.04059	translate	read	null
2025-03-05	Task-Agnostic Attacks Against Vision Foundation Models	Brian Pulfer et.al.	2503.03842	translate	read	null
2025-03-05	Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings	Xusheng Du et.al.	2503.03068	translate	read	null
2025-03-04	RGBSQGrasp: Inferring Local Superquadric Primitives from Single RGB Image for Graspability-Aware Bin Picking	Yifeng Xu et.al.	2503.02387	translate	read	null
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	translate	read	null
2025-03-02	Bridging Spectral-wise and Multi-spectral Depth Estimation via Geometry-guided Contrastive Learning	Ukcheol Shin et.al.	2503.00793	translate	read	null
2025-03-03	Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion	Jiangyuan Liu et.al.	2502.14616	translate	read	link

(<a href=../Depth_Estimation.md>back to Depth Estimation</a>)