Depth Estimation - 2026-03

Publish Date Title Authors PDF Translate Read Code
2026-03-31 Extend3D: Town-Scale 3D Generation Seungwoo Yoon et.al. 2603.29387 translate read null
2026-03-31 StereoVGGT: A Training-Free Visual Geometry Transformer for Stereo Vision Ziyang Chen et.al. 2603.29368 translate read null
2026-03-25 EndoVGGT: GNN-Enhanced Depth Estimation for Surgical 3D Reconstruction Falong Fan et.al. 2603.24577 translate read null
2026-03-24 One View Is Enough! Monocular Training for In-the-Wild Novel View Generation Adrien Ramanana Rahary et.al. 2603.23488 translate read null
2026-03-24 Active Robotic Perception for Disease Detection and Mapping in Apple Trees Hayden Feddock et.al. 2603.23112 translate read null
2026-03-24 Generative Event Pretraining with Foundation Model Alignment Jianwen Cao et.al. 2603.23032 translate read null
2026-03-23 GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning Yixuan Luo et.al. 2603.22270 translate read null
2026-03-22 PAS3R: Pose-Adaptive Streaming 3D Reconstruction for Long Video Sequences Lanbo Xu et.al. 2603.21436 translate read null
2026-03-22 Single-Eye View: Monocular Real-time Perception Package for Autonomous Driving Haixi Zhang et.al. 2603.21061 translate read null
2026-03-21 The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting Ivan Desiatov et.al. 2603.20714 translate read null
2026-03-20 CeRLP: A Cross-embodiment Robot Local Planning Framework for Visual Navigation Haoyu Xi et.al. 2603.19602 translate read null
2026-03-20 StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention Zhongrui Yu et.al. 2603.19552 translate read null
2026-03-20 SeeClear: Reliable Transparent Object Depth Estimation via Generative Opacification Xiaoying Wang et.al. 2603.19547 translate read null
2026-03-19 VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation Jiayi Yuan et.al. 2603.18943 translate read null
2026-03-18 Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting Guillem Casadesus Vila et.al. 2603.18218 translate read null
2026-03-18 UniSem: Generalizable Semantic 3D Reconstruction from Sparse Unposed Images Guibiao Liao et.al. 2603.17519 translate read null
2026-03-18 Stereo World Model: Camera-Guided Stereo Video Generation Yang-Tian Sun et.al. 2603.17375 translate read null
2026-03-17 LLM-Powered Flood Depth Estimation from Social Media Imagery: A Vision-Language Model Framework with Mechanistic Interpretability for Transportation Resilience Nafis Fuad et.al. 2603.17108 translate read null
2026-03-17 MessyKitchens: Contact-rich object-level 3D scene reconstruction Junaid Ahmed Ansari et.al. 2603.16868 translate read null
2026-03-17 WildDepth: A Multimodal Dataset for 3D Wildlife Perception and Depth Estimation Muhammad Aamir et.al. 2603.16816 translate read null
2026-03-17 $D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation Ruizhi Wang et.al. 2603.16362 translate read null
2026-03-17 Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation Xinhao Cai et.al. 2603.16340 translate read null
2026-03-17 PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding Space Ryutaro Miya et.al. 2603.16238 translate read null
2026-03-17 Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation Yiming Huang et.al. 2603.16211 translate read null
2026-03-16 Pointing-Based Object Recognition Lukáš Hajdúch et.al. 2603.15403 translate read null
2026-03-16 Spectral Rectification for Parameter-Efficient Adaptation of Foundation Models in Colonoscopy Depth Estimation Xiaoxian Zhang et.al. 2603.15374 translate read null
2026-03-16 Reference-Free Omnidirectional Stereo Matching via Multi-View Consistency Maximization Lehuai Xu et.al. 2603.15019 translate read null
2026-03-16 Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3 Hürkan Şahin et.al. 2603.14998 translate read null
2026-03-16 Fractal Autoregressive Depth Estimation with Continuous Token Diffusion Jinchang Zhang et.al. 2603.14702 translate read null
2026-03-16 E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction Yunsoo Kim et.al. 2603.14684 translate read null
2026-03-15 V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning Lorenzo Mur-Labadia et.al. 2603.14482 translate read null
2026-03-12 DVD: Deterministic Video Depth Estimation with Generative Priors Hongfei Zhang et.al. 2603.12250 translate read null
2026-03-12 R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection Zhongyu Xia et.al. 2603.11566 translate read null
2026-03-11 WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation Rafi Ibn Sultan et.al. 2603.10703 translate read null
2026-03-11 AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory Lianjie Ma et.al. 2603.10438 translate read null
2026-03-10 SurgFed: Language-guided Multi-Task Federated Learning for Surgical Video Understanding Zheng Fang et.al. 2603.09496 translate read null
2026-03-10 EventVGGT: Exploring Cross-Modal Distillation for Consistent Event-based Depth Estimation Yinrui Ren et.al. 2603.09385 translate read null
2026-03-10 SpaceSense-Bench: A Large-Scale Multi-Modal Benchmark for Spacecraft Perception and Pose Estimation Aodi Wu et.al. 2603.09320 translate read null
2026-03-09 Viewpoint-Agnostic Grasp Pipeline using VLM and Partial Observations Dilermando Almeida et.al. 2603.07866 translate read null
2026-03-08 FrameVGGT: Frame Evidence Rolling Memory for streaming VGGT Zhisong Xu et.al. 2603.07690 translate read null
2026-03-06 SurgSync: Time-Synchronized Multi-Modal Data Collection Framework and Dataset for Surgical Robotics Haoying Zhou et.al. 2603.06919 translate read null
2026-03-06 CHMv2: Improvements in Global Canopy Height Mapping using DINOv3 John Brandt et.al. 2603.06382 translate read null
2026-03-06 RePer-360: Releasing Perspective Priors for 360 $^\circ$ Depth Estimation via Self-Modulation Cheng Guan et.al. 2603.05999 translate read null
2026-03-06 EventGeM: Global-to-Local Feature Matching for Event-Based Visual Place Recognition Adam D. Hines et.al. 2603.05807 translate read null
2026-03-05 EmboAlign: Aligning Video Generation with Compositional Constraints for Zero-Shot Manipulation Gehao Zhang et.al. 2603.05757 translate read null
2026-03-05 Any to Full: Prompting Depth Anything for Depth Completion in One Stage Zhiyuan Zhou et.al. 2603.05711 translate read null
2026-03-04 LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving Qihao Sun et.al. 2603.03765 translate read null
2026-03-03 Confidence-aware Monocular Depth Estimation for Minimally Invasive Surgery Muhammad Asad et.al. 2603.03571 translate read null
2026-03-03 The Dresden Dataset for 4D Reconstruction of Non-Rigid Abdominal Surgical Scenes Reuben Docea et.al. 2603.02985 translate read null
2026-03-03 DREAM: Where Visual Understanding Meets Text-to-Image Generation Chao Li et.al. 2603.02667 translate read null
2026-03-02 Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular Depth Estimation Jan Finke et.al. 2603.01999 translate read null
2026-03-02 WildCross: A Cross-Modal Large Scale Benchmark for Place Recognition and Metric Depth Estimation in Natural Environments Joshua Knights et.al. 2603.01475 translate read null
2026-03-01 Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving Xubo Zhu et.al. 2603.01007 translate read null

(<a href=../Depth_Estimation.md>back to Depth Estimation</a>)