Depth Estimation - 2026-03 | Paper Arxiv Daily

Depth Estimation - 2026-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2026-03-31	Extend3D: Town-Scale 3D Generation	Seungwoo Yoon et.al.	2603.29387	translate	read	null
2026-03-31	StereoVGGT: A Training-Free Visual Geometry Transformer for Stereo Vision	Ziyang Chen et.al.	2603.29368	translate	read	null
2026-03-25	EndoVGGT: GNN-Enhanced Depth Estimation for Surgical 3D Reconstruction	Falong Fan et.al.	2603.24577	translate	read	null
2026-03-24	One View Is Enough! Monocular Training for In-the-Wild Novel View Generation	Adrien Ramanana Rahary et.al.	2603.23488	translate	read	null
2026-03-24	Active Robotic Perception for Disease Detection and Mapping in Apple Trees	Hayden Feddock et.al.	2603.23112	translate	read	null
2026-03-24	Generative Event Pretraining with Foundation Model Alignment	Jianwen Cao et.al.	2603.23032	translate	read	null
2026-03-23	GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning	Yixuan Luo et.al.	2603.22270	translate	read	null
2026-03-22	PAS3R: Pose-Adaptive Streaming 3D Reconstruction for Long Video Sequences	Lanbo Xu et.al.	2603.21436	translate	read	null
2026-03-22	Single-Eye View: Monocular Real-time Perception Package for Autonomous Driving	Haixi Zhang et.al.	2603.21061	translate	read	null
2026-03-21	The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting	Ivan Desiatov et.al.	2603.20714	translate	read	null
2026-03-20	CeRLP: A Cross-embodiment Robot Local Planning Framework for Visual Navigation	Haoyu Xi et.al.	2603.19602	translate	read	null
2026-03-20	StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention	Zhongrui Yu et.al.	2603.19552	translate	read	null
2026-03-20	SeeClear: Reliable Transparent Object Depth Estimation via Generative Opacification	Xiaoying Wang et.al.	2603.19547	translate	read	null
2026-03-19	VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation	Jiayi Yuan et.al.	2603.18943	translate	read	null
2026-03-18	Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting	Guillem Casadesus Vila et.al.	2603.18218	translate	read	null
2026-03-18	UniSem: Generalizable Semantic 3D Reconstruction from Sparse Unposed Images	Guibiao Liao et.al.	2603.17519	translate	read	null
2026-03-18	Stereo World Model: Camera-Guided Stereo Video Generation	Yang-Tian Sun et.al.	2603.17375	translate	read	null
2026-03-17	LLM-Powered Flood Depth Estimation from Social Media Imagery: A Vision-Language Model Framework with Mechanistic Interpretability for Transportation Resilience	Nafis Fuad et.al.	2603.17108	translate	read	null
2026-03-17	MessyKitchens: Contact-rich object-level 3D scene reconstruction	Junaid Ahmed Ansari et.al.	2603.16868	translate	read	null
2026-03-17	WildDepth: A Multimodal Dataset for 3D Wildlife Perception and Depth Estimation	Muhammad Aamir et.al.	2603.16816	translate	read	null
2026-03-17	$D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation	Ruizhi Wang et.al.	2603.16362	translate	read	null
2026-03-17	Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation	Xinhao Cai et.al.	2603.16340	translate	read	null
2026-03-17	PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding Space	Ryutaro Miya et.al.	2603.16238	translate	read	null
2026-03-17	Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation	Yiming Huang et.al.	2603.16211	translate	read	null
2026-03-16	Pointing-Based Object Recognition	Lukáš Hajdúch et.al.	2603.15403	translate	read	null
2026-03-16	Spectral Rectification for Parameter-Efficient Adaptation of Foundation Models in Colonoscopy Depth Estimation	Xiaoxian Zhang et.al.	2603.15374	translate	read	null
2026-03-16	Reference-Free Omnidirectional Stereo Matching via Multi-View Consistency Maximization	Lehuai Xu et.al.	2603.15019	translate	read	null
2026-03-16	Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3	Hürkan Şahin et.al.	2603.14998	translate	read	null
2026-03-16	Fractal Autoregressive Depth Estimation with Continuous Token Diffusion	Jinchang Zhang et.al.	2603.14702	translate	read	null
2026-03-16	E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction	Yunsoo Kim et.al.	2603.14684	translate	read	null
2026-03-15	V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning	Lorenzo Mur-Labadia et.al.	2603.14482	translate	read	null
2026-03-12	DVD: Deterministic Video Depth Estimation with Generative Priors	Hongfei Zhang et.al.	2603.12250	translate	read	null
2026-03-12	R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection	Zhongyu Xia et.al.	2603.11566	translate	read	null
2026-03-11	WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation	Rafi Ibn Sultan et.al.	2603.10703	translate	read	null
2026-03-11	AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory	Lianjie Ma et.al.	2603.10438	translate	read	null
2026-03-10	SurgFed: Language-guided Multi-Task Federated Learning for Surgical Video Understanding	Zheng Fang et.al.	2603.09496	translate	read	null
2026-03-10	EventVGGT: Exploring Cross-Modal Distillation for Consistent Event-based Depth Estimation	Yinrui Ren et.al.	2603.09385	translate	read	null
2026-03-10	SpaceSense-Bench: A Large-Scale Multi-Modal Benchmark for Spacecraft Perception and Pose Estimation	Aodi Wu et.al.	2603.09320	translate	read	null
2026-03-09	Viewpoint-Agnostic Grasp Pipeline using VLM and Partial Observations	Dilermando Almeida et.al.	2603.07866	translate	read	null
2026-03-08	FrameVGGT: Frame Evidence Rolling Memory for streaming VGGT	Zhisong Xu et.al.	2603.07690	translate	read	null
2026-03-06	SurgSync: Time-Synchronized Multi-Modal Data Collection Framework and Dataset for Surgical Robotics	Haoying Zhou et.al.	2603.06919	translate	read	null
2026-03-06	CHMv2: Improvements in Global Canopy Height Mapping using DINOv3	John Brandt et.al.	2603.06382	translate	read	null
2026-03-06	RePer-360: Releasing Perspective Priors for 360 $^\circ$ Depth Estimation via Self-Modulation	Cheng Guan et.al.	2603.05999	translate	read	null
2026-03-06	EventGeM: Global-to-Local Feature Matching for Event-Based Visual Place Recognition	Adam D. Hines et.al.	2603.05807	translate	read	null
2026-03-05	EmboAlign: Aligning Video Generation with Compositional Constraints for Zero-Shot Manipulation	Gehao Zhang et.al.	2603.05757	translate	read	null
2026-03-05	Any to Full: Prompting Depth Anything for Depth Completion in One Stage	Zhiyuan Zhou et.al.	2603.05711	translate	read	null
2026-03-04	LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving	Qihao Sun et.al.	2603.03765	translate	read	null
2026-03-03	Confidence-aware Monocular Depth Estimation for Minimally Invasive Surgery	Muhammad Asad et.al.	2603.03571	translate	read	null
2026-03-03	The Dresden Dataset for 4D Reconstruction of Non-Rigid Abdominal Surgical Scenes	Reuben Docea et.al.	2603.02985	translate	read	null
2026-03-03	DREAM: Where Visual Understanding Meets Text-to-Image Generation	Chao Li et.al.	2603.02667	translate	read	null
2026-03-02	Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular Depth Estimation	Jan Finke et.al.	2603.01999	translate	read	null
2026-03-02	WildCross: A Cross-Modal Large Scale Benchmark for Place Recognition and Metric Depth Estimation in Natural Environments	Joshua Knights et.al.	2603.01475	translate	read	null
2026-03-01	Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving	Xubo Zhu et.al.	2603.01007	translate	read	null

(<a href=../Depth_Estimation.md>back to Depth Estimation</a>)