Depth Estimation - 2025-12
Depth Estimation - 2025-12
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-12-31 | Projection-based Adversarial Attack using Physics-in-the-Loop Optimization for Monocular Depth Estimation | Takeru Kusakabe et.al. | 2512.24792 | translate | read | null |
| 2025-12-30 | Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks | Yongtao Chen et.al. | 2512.24111 | translate | read | null |
| 2025-12-29 | Leveraging Synthetic Priors for Monocular Depth Estimation in Specular Surgical Environments | Ankan Aich et.al. | 2512.23786 | translate | read | null |
| 2025-12-28 | With Great Context Comes Great Prediction Power: Classifying Objects via Geo-Semantic Scene Graphs | Ciprian Constantinescu et.al. | 2512.23024 | translate | read | null |
| 2025-12-28 | Depth Anything in $360^\circ$ : Towards Scale Invariance in the Wild | Hualie Jiang et.al. | 2512.22819 | translate | read | null |
| 2025-12-27 | Visual Autoregressive Modelling for Monocular Depth Estimation | Amir El-Ghoussani et.al. | 2512.22653 | translate | read | null |
| 2025-12-26 | iOSPointMapper: RealTime Pedestrian and Accessibility Mapping with Mobile AI | Himanshu Naidu et.al. | 2512.22392 | translate | read | null |
| 2025-12-26 | Bab_Sak Robotic Intubation System (BRIS): A Learning-Enabled Control Framework for Safe Fiberoptic Endotracheal Intubation | Saksham Gupta et.al. | 2512.21983 | translate | read | null |
| 2025-12-26 | StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision | Shengliang Deng et.al. | 2512.21970 | translate | read | null |
| 2025-12-22 | CoDrone: Autonomous Drone Navigation Assisted by Edge and Cloud Foundation Models | Pengyu Chen et.al. | 2512.19083 | translate | read | null |
| 2025-12-22 | CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization | Zelin Zhao et.al. | 2512.19020 | translate | read | null |
| 2025-12-21 | A Study of Finetuning Video Transformers for Multi-view Geometry Tasks | Huimin Wu et.al. | 2512.18684 | translate | read | null |
| 2025-12-20 | EndoStreamDepth: Temporally Consistent Monocular Depth Estimation for Endoscopic Video Streams | Hao Li et.al. | 2512.18159 | translate | read | null |
| 2025-12-17 | A Modular Framework for Single-View 3D Reconstruction of Indoor Environments | Yuxiao Li et.al. | 2512.17955 | translate | read | null |
| 2025-12-19 | Re-Depth Anything: Test-Time Depth Refinement via Self-Supervised Re-lighting | Ananta R. Bhattarai et.al. | 2512.17908 | translate | read | null |
| 2025-12-19 | Long-Range depth estimation using learning based Hybrid Distortion Model for CCTV cameras | Ami Pandat et.al. | 2512.17784 | translate | read | null |
| 2025-12-19 | SAVeD: A First-Person Social Media Video Dataset for ADAS-equipped vehicle Near-Miss and Crash Event Analyses | Shaoyan Zhai et.al. | 2512.17724 | translate | read | null |
| 2025-12-18 | Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation | Min-Jung Kim et.al. | 2512.17040 | translate | read | null |
| 2025-12-18 | Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation | Xin Lin et.al. | 2512.16913 | translate | read | null |
| 2025-12-18 | N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models | Yuxin Wang et.al. | 2512.16561 | translate | read | null |
| 2025-12-17 | In Pursuit of Pixel Supervision for Visual Pre-training | Lihe Yang et.al. | 2512.15715 | translate | read | null |
| 2025-12-16 | DASP: Self-supervised Nighttime Monocular Depth Estimation with Domain Adaptation of Spatiotemporal Priors | Yiheng Huang et.al. | 2512.14536 | translate | read | null |
| 2025-12-16 | Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding | Nando Metzger et.al. | 2512.14236 | translate | read | null |
| 2025-12-16 | Robust Single-shot Structured Light 3D Imaging via Neural Feature Decoding | Jiaheng Li et.al. | 2512.14028 | translate | read | null |
| 2025-12-16 | Deep Learning Perspective of Scene Understanding in Autonomous Robots | Afia Maham et.al. | 2512.14020 | translate | read | null |
| 2025-12-15 | StarryGazer: Leveraging Monocular Depth Estimation Models for Domain-Agnostic Single Depth Image Completion | Sangmin Hong et.al. | 2512.13147 | translate | read | null |
| 2025-12-13 | BokehDepth: Enhancing Monocular Depth Estimation through Bokeh Generation | Hangwei Zhang et.al. | 2512.12425 | translate | read | null |
| 2025-12-12 | ProbeMDE: Uncertainty-Guided Active Proprioception for Monocular Depth Estimation in Surgical Robotics | Britton Jordan et.al. | 2512.11773 | translate | read | null |
| 2025-12-11 | Empowering Dynamic Urban Navigation with Stereo and Mid-Level Vision | Wentao Zhou et.al. | 2512.10956 | translate | read | null |
| 2025-12-11 | Video Depth Propagation | Luigi Piccinelli et.al. | 2512.10725 | translate | read | null |
| 2025-12-11 | SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving | Peizheng Li et.al. | 2512.10719 | translate | read | null |
| 2025-12-11 | Robust Shape from Focus via Multiscale Directional Dilated Laplacian and Recurrent Network | Khurram Ashfaq et.al. | 2512.10498 | translate | read | null |
| 2025-12-09 | Scale-invariant and View-relational Representation Learning for Full Surround Monocular Depth | Kyumin Hwang et.al. | 2512.08700 | translate | read | null |
| 2025-12-09 | Development & first Performance evaluation of multi-element monolithic HPGe detector for X-ray spectroscopy | N. Goyal et.al. | 2512.08389 | translate | read | null |
| 2025-12-09 | Accuracy Does Not Guarantee Human-Likeness in Monocular Depth Estimators | Yuki Kubota et.al. | 2512.08163 | translate | read | null |
| 2025-12-08 | More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery | Wenzhen Dong et.al. | 2512.07596 | translate | read | null |
| 2025-12-07 | CoT4Det: A Chain-of-Thought Framework for Perception-Oriented Vision-Language Tasks | Yu Qi et.al. | 2512.06663 | translate | read | null |
| 2025-12-06 | HuPrior3R: Incorporating Human Priors for Better 3D Dynamic Reconstruction from Monocular Videos | Weitao Xiong et.al. | 2512.06368 | translate | read | null |
| 2025-12-05 | See in Depth: Training-Free Surgical Scene Segmentation with Monocular Depth Priors | Kunyi Yang et.al. | 2512.05529 | translate | read | null |
| 2025-12-05 | YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications | Yida Lin et.al. | 2512.05412 | translate | read | null |
| 2025-12-03 | Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications | Gasser Elazab et.al. | 2512.04303 | translate | read | null |
| 2025-12-03 | Unique Lives, Shared World: Learning from Single-Life Videos | Tengda Han et.al. | 2512.04085 | translate | read | null |
| 2025-12-03 | SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL | Siyi Chen et.al. | 2512.04069 | translate | read | null |
| 2025-12-03 | MDE-AgriVLN: Agricultural Vision-and-Language Navigation with Monocular Depth Estimation | Xiaobei Zhao et.al. | 2512.03958 | translate | read | null |
| 2025-12-03 | Generalization Evaluation of Deep Stereo Matching Methods for UAV-Based Forestry Applications | Yida Lin et.al. | 2512.03427 | translate | read | null |
| 2025-12-02 | DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling | Kairun Wen et.al. | 2512.03000 | translate | read | null |
| 2025-12-02 | BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection | Guowen Zhang et.al. | 2512.02972 | translate | read | null |
| 2025-12-01 | DepthScape: Authoring 2.5D Designs via Depth Estimation, Semantic Understanding, and Geometry Extraction | Xia Su et.al. | 2512.02263 | translate | read | null |
| 2025-12-01 | BlinkBud: Detecting Hazards from Behind via Sampled Monocular 3D Detection on a Single Earbud | Yunzhe Li et.al. | 2512.01366 | translate | read | null |
(<a href=../Depth_Estimation.md>back to Depth Estimation</a>)