Depth Estimation - 2026-03
Depth Estimation - 2026-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-03-31 | Extend3D: Town-Scale 3D Generation | Seungwoo Yoon et.al. | 2603.29387 | translate | read | null |
| 2026-03-31 | StereoVGGT: A Training-Free Visual Geometry Transformer for Stereo Vision | Ziyang Chen et.al. | 2603.29368 | translate | read | null |
| 2026-03-25 | EndoVGGT: GNN-Enhanced Depth Estimation for Surgical 3D Reconstruction | Falong Fan et.al. | 2603.24577 | translate | read | null |
| 2026-03-24 | One View Is Enough! Monocular Training for In-the-Wild Novel View Generation | Adrien Ramanana Rahary et.al. | 2603.23488 | translate | read | null |
| 2026-03-24 | Active Robotic Perception for Disease Detection and Mapping in Apple Trees | Hayden Feddock et.al. | 2603.23112 | translate | read | null |
| 2026-03-24 | Generative Event Pretraining with Foundation Model Alignment | Jianwen Cao et.al. | 2603.23032 | translate | read | null |
| 2026-03-23 | GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning | Yixuan Luo et.al. | 2603.22270 | translate | read | null |
| 2026-03-22 | PAS3R: Pose-Adaptive Streaming 3D Reconstruction for Long Video Sequences | Lanbo Xu et.al. | 2603.21436 | translate | read | null |
| 2026-03-22 | Single-Eye View: Monocular Real-time Perception Package for Autonomous Driving | Haixi Zhang et.al. | 2603.21061 | translate | read | null |
| 2026-03-21 | The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting | Ivan Desiatov et.al. | 2603.20714 | translate | read | null |
| 2026-03-20 | CeRLP: A Cross-embodiment Robot Local Planning Framework for Visual Navigation | Haoyu Xi et.al. | 2603.19602 | translate | read | null |
| 2026-03-20 | StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention | Zhongrui Yu et.al. | 2603.19552 | translate | read | null |
| 2026-03-20 | SeeClear: Reliable Transparent Object Depth Estimation via Generative Opacification | Xiaoying Wang et.al. | 2603.19547 | translate | read | null |
| 2026-03-19 | VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation | Jiayi Yuan et.al. | 2603.18943 | translate | read | null |
| 2026-03-18 | Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting | Guillem Casadesus Vila et.al. | 2603.18218 | translate | read | null |
| 2026-03-18 | UniSem: Generalizable Semantic 3D Reconstruction from Sparse Unposed Images | Guibiao Liao et.al. | 2603.17519 | translate | read | null |
| 2026-03-18 | Stereo World Model: Camera-Guided Stereo Video Generation | Yang-Tian Sun et.al. | 2603.17375 | translate | read | null |
| 2026-03-17 | LLM-Powered Flood Depth Estimation from Social Media Imagery: A Vision-Language Model Framework with Mechanistic Interpretability for Transportation Resilience | Nafis Fuad et.al. | 2603.17108 | translate | read | null |
| 2026-03-17 | MessyKitchens: Contact-rich object-level 3D scene reconstruction | Junaid Ahmed Ansari et.al. | 2603.16868 | translate | read | null |
| 2026-03-17 | WildDepth: A Multimodal Dataset for 3D Wildlife Perception and Depth Estimation | Muhammad Aamir et.al. | 2603.16816 | translate | read | null |
| 2026-03-17 | $D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation | Ruizhi Wang et.al. | 2603.16362 | translate | read | null |
| 2026-03-17 | Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation | Xinhao Cai et.al. | 2603.16340 | translate | read | null |
| 2026-03-17 | PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding Space | Ryutaro Miya et.al. | 2603.16238 | translate | read | null |
| 2026-03-17 | Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation | Yiming Huang et.al. | 2603.16211 | translate | read | null |
| 2026-03-16 | Pointing-Based Object Recognition | Lukáš Hajdúch et.al. | 2603.15403 | translate | read | null |
| 2026-03-16 | Spectral Rectification for Parameter-Efficient Adaptation of Foundation Models in Colonoscopy Depth Estimation | Xiaoxian Zhang et.al. | 2603.15374 | translate | read | null |
| 2026-03-16 | Reference-Free Omnidirectional Stereo Matching via Multi-View Consistency Maximization | Lehuai Xu et.al. | 2603.15019 | translate | read | null |
| 2026-03-16 | Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3 | Hürkan Şahin et.al. | 2603.14998 | translate | read | null |
| 2026-03-16 | Fractal Autoregressive Depth Estimation with Continuous Token Diffusion | Jinchang Zhang et.al. | 2603.14702 | translate | read | null |
| 2026-03-16 | E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction | Yunsoo Kim et.al. | 2603.14684 | translate | read | null |
| 2026-03-15 | V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning | Lorenzo Mur-Labadia et.al. | 2603.14482 | translate | read | null |
| 2026-03-12 | DVD: Deterministic Video Depth Estimation with Generative Priors | Hongfei Zhang et.al. | 2603.12250 | translate | read | null |
| 2026-03-12 | R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection | Zhongyu Xia et.al. | 2603.11566 | translate | read | null |
| 2026-03-11 | WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation | Rafi Ibn Sultan et.al. | 2603.10703 | translate | read | null |
| 2026-03-11 | AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory | Lianjie Ma et.al. | 2603.10438 | translate | read | null |
| 2026-03-10 | SurgFed: Language-guided Multi-Task Federated Learning for Surgical Video Understanding | Zheng Fang et.al. | 2603.09496 | translate | read | null |
| 2026-03-10 | EventVGGT: Exploring Cross-Modal Distillation for Consistent Event-based Depth Estimation | Yinrui Ren et.al. | 2603.09385 | translate | read | null |
| 2026-03-10 | SpaceSense-Bench: A Large-Scale Multi-Modal Benchmark for Spacecraft Perception and Pose Estimation | Aodi Wu et.al. | 2603.09320 | translate | read | null |
| 2026-03-09 | Viewpoint-Agnostic Grasp Pipeline using VLM and Partial Observations | Dilermando Almeida et.al. | 2603.07866 | translate | read | null |
| 2026-03-08 | FrameVGGT: Frame Evidence Rolling Memory for streaming VGGT | Zhisong Xu et.al. | 2603.07690 | translate | read | null |
| 2026-03-06 | SurgSync: Time-Synchronized Multi-Modal Data Collection Framework and Dataset for Surgical Robotics | Haoying Zhou et.al. | 2603.06919 | translate | read | null |
| 2026-03-06 | CHMv2: Improvements in Global Canopy Height Mapping using DINOv3 | John Brandt et.al. | 2603.06382 | translate | read | null |
| 2026-03-06 | RePer-360: Releasing Perspective Priors for 360 $^\circ$ Depth Estimation via Self-Modulation | Cheng Guan et.al. | 2603.05999 | translate | read | null |
| 2026-03-06 | EventGeM: Global-to-Local Feature Matching for Event-Based Visual Place Recognition | Adam D. Hines et.al. | 2603.05807 | translate | read | null |
| 2026-03-05 | EmboAlign: Aligning Video Generation with Compositional Constraints for Zero-Shot Manipulation | Gehao Zhang et.al. | 2603.05757 | translate | read | null |
| 2026-03-05 | Any to Full: Prompting Depth Anything for Depth Completion in One Stage | Zhiyuan Zhou et.al. | 2603.05711 | translate | read | null |
| 2026-03-04 | LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving | Qihao Sun et.al. | 2603.03765 | translate | read | null |
| 2026-03-03 | Confidence-aware Monocular Depth Estimation for Minimally Invasive Surgery | Muhammad Asad et.al. | 2603.03571 | translate | read | null |
| 2026-03-03 | The Dresden Dataset for 4D Reconstruction of Non-Rigid Abdominal Surgical Scenes | Reuben Docea et.al. | 2603.02985 | translate | read | null |
| 2026-03-03 | DREAM: Where Visual Understanding Meets Text-to-Image Generation | Chao Li et.al. | 2603.02667 | translate | read | null |
| 2026-03-02 | Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular Depth Estimation | Jan Finke et.al. | 2603.01999 | translate | read | null |
| 2026-03-02 | WildCross: A Cross-Modal Large Scale Benchmark for Place Recognition and Metric Depth Estimation in Natural Environments | Joshua Knights et.al. | 2603.01475 | translate | read | null |
| 2026-03-01 | Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving | Xubo Zhu et.al. | 2603.01007 | translate | read | null |
(<a href=../Depth_Estimation.md>back to Depth Estimation</a>)