Depth Estimation - 2025-03
Depth Estimation - 2025-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-03-31 | ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image | Tianyi Gong et.al. | 2503.23881 | translate | read | null |
| 2025-03-31 | Detail-aware multi-view stereo network for depth estimation | Haitao Tian et.al. | 2503.23684 | translate | read | null |
| 2025-03-30 | Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries | Wei Xu et.al. | 2503.23606 | translate | read | null |
| 2025-03-30 | Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model | Jannik Endres et.al. | 2503.23502 | translate | read | link |
| 2025-03-28 | SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations | Krispin Wandel et.al. | 2503.22462 | translate | read | null |
| 2025-03-28 | EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting | Xu Wang et.al. | 2503.22437 | translate | read | link |
| 2025-03-28 | MVSAnywhere: Zero-Shot Multi-View Stereo | Sergio Izquierdo et.al. | 2503.22430 | translate | read | null |
| 2025-03-28 | One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images | Byeongjun Kwon et.al. | 2503.22351 | translate | read | null |
| 2025-03-28 | Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces | Wonhyeok Choi et.al. | 2503.22209 | translate | read | null |
| 2025-03-28 | Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges | Ukcheol Shin et.al. | 2503.22060 | translate | read | link |
| 2025-03-27 | A Unified Image-Dense Annotation Generation Model for Underwater Scenes | Hongkai Lin et.al. | 2503.21771 | translate | read | link |
| 2025-03-27 | ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo | Yuxi Hu et.al. | 2503.21525 | translate | read | null |
| 2025-03-26 | Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors | Weilong Yan et.al. | 2503.20211 | translate | read | link |
| 2025-03-26 | FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion | Pihai Sun et.al. | 2503.19739 | translate | read | link |
| 2025-03-25 | Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving | Yusen Xie et.al. | 2503.19713 | translate | read | link |
| 2025-03-25 | StableGS: A Floater-Free Framework for 3D Gaussian Splatting | Luchao Wang et.al. | 2503.18458 | translate | read | null |
| 2025-03-24 | PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes | Xinhua Xu et.al. | 2503.18393 | translate | read | null |
| 2025-03-23 | Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images | Yara AlaaEldin et.al. | 2503.17982 | translate | read | link |
| 2025-03-21 | Radar-Guided Polynomial Fitting for Metric Depth Estimation | Patrick Rim et.al. | 2503.17182 | translate | read | null |
| 2025-03-21 | AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process | Junjie Hu et.al. | 2503.17029 | translate | read | null |
| 2025-03-21 | Distilling Monocular Foundation Model for Fine-grained Depth Completion | Yingping Liang et.al. | 2503.16970 | translate | read | null |
| 2025-03-20 | QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge | Xuan Shen et.al. | 2503.16709 | translate | read | link |
| 2025-03-20 | A Recipe for Generating 3D Worlds From a Single Image | Katja Schwarz et.al. | 2503.16611 | translate | read | null |
| 2025-03-20 | Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras | Beilei Cui et.al. | 2503.15917 | translate | read | null |
| 2025-03-20 | Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation | Jiyuan Wang et.al. | 2503.15905 | translate | read | null |
| 2025-03-19 | TULIP: Towards Unified Language-Image Pretraining | Zineng Tang et.al. | 2503.15485 | translate | read | null |
| 2025-03-19 | EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining | Boshen Xu et.al. | 2503.15470 | translate | read | null |
| 2025-03-19 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network | Joseph Emmanuel DL Dayo et.al. | 2503.14950 | translate | read | null |
| 2025-03-18 | Multi-view Reconstruction via SfM-guided Monocular Depth Estimation | Haoyu Guo et.al. | 2503.14483 | translate | read | null |
| 2025-03-18 | DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers | Mert Bulent Sariyildiz et.al. | 2503.14405 | translate | read | null |
| 2025-03-18 | 3D Densification for Multi-Map Monocular VSLAM in Endoscopy | X. Anadón et.al. | 2503.14346 | translate | read | null |
| 2025-03-17 | MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models | Johannes Meier et.al. | 2503.13743 | translate | read | null |
| 2025-03-17 | Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios | Iryna Repinetska et.al. | 2503.13710 | translate | read | null |
| 2025-03-19 | FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis | Luxi Chen et.al. | 2503.13265 | translate | read | null |
| 2025-03-17 | MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs | Erik Daxberger et.al. | 2503.13111 | translate | read | null |
| 2025-03-17 | TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image | Haoxiao Wang et.al. | 2503.12779 | translate | read | null |
| 2025-03-16 | UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing | Tsu-Jui Fu et.al. | 2503.12652 | translate | read | null |
| 2025-03-16 | Deblur Gaussian Splatting SLAM | Francesco Girlanda et.al. | 2503.12572 | translate | read | null |
| 2025-03-14 | VGGT: Visual Geometry Grounded Transformer | Jianyuan Wang et.al. | 2503.11651 | translate | read | null |
| 2025-03-14 | Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation | Hongyu Wen et.al. | 2503.11633 | translate | read | null |
| 2025-03-14 | Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation | Fengchen He et.al. | 2503.11213 | translate | read | null |
| 2025-03-13 | Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations | Xunzhi Zheng et.al. | 2503.10464 | translate | read | null |
| 2025-03-15 | WonderVerse: Extendable 3D Scene Generation with Video Generative Models | Hao Feng et.al. | 2503.09160 | translate | read | null |
| 2025-03-11 | Language-Depth Navigated Thermal and Visible Image Fusion | Jinchang Zhang et.al. | 2503.08676 | translate | read | null |
| 2025-03-11 | CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning | Kaiqiang Xiong et.al. | 2503.08219 | translate | read | null |
| 2025-03-10 | SIRE: SE(3) Intrinsic Rigidity Embeddings | Cameron Smith et.al. | 2503.07739 | translate | read | null |
| 2025-03-10 | LBM: Latent Bridge Matching for Fast Image-to-Image Translation | Clément Chadebec et.al. | 2503.07535 | translate | read | link |
| 2025-03-12 | Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion | Mona Sheikh Zeinoddin et.al. | 2503.07204 | translate | read | null |
| 2025-03-11 | LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation | Quanjian Song et.al. | 2503.06508 | translate | read | null |
| 2025-03-08 | Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity | Xiaohao Xu et.al. | 2503.06014 | translate | read | link |
| 2025-03-07 | TomatoScanner: phenotyping tomato fruit based on only RGB image | Xiaobei Zhao et.al. | 2503.05568 | translate | read | null |
| 2025-03-07 | Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects | Justin Yu et.al. | 2503.05189 | translate | read | null |
| 2025-03-05 | RTFusion: A depth estimation network based on multimodal fusion in challenging scenarios | Zelin Meng et.al. | 2503.04821 | translate | read | null |
| 2025-03-06 | A Novel Solution for Drone Photogrammetry with Low-overlap Aerial Images using Monocular Depth Estimation | Jiageng Zhong et.al. | 2503.04513 | translate | read | null |
| 2025-03-08 | EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images | Rohit Menon et.al. | 2503.04441 | translate | read | null |
| 2025-03-06 | H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision | Yunxiao Shi et.al. | 2503.04059 | translate | read | null |
| 2025-03-05 | Task-Agnostic Attacks Against Vision Foundation Models | Brian Pulfer et.al. | 2503.03842 | translate | read | null |
| 2025-03-05 | Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings | Xusheng Du et.al. | 2503.03068 | translate | read | null |
| 2025-03-04 | RGBSQGrasp: Inferring Local Superquadric Primitives from Single RGB Image for Graspability-Aware Bin Picking | Yifeng Xu et.al. | 2503.02387 | translate | read | null |
| 2025-03-03 | MUSt3R: Multi-view Network for Stereo 3D Reconstruction | Yohann Cabon et.al. | 2503.01661 | translate | read | null |
| 2025-03-02 | Bridging Spectral-wise and Multi-spectral Depth Estimation via Geometry-guided Contrastive Learning | Ukcheol Shin et.al. | 2503.00793 | translate | read | null |
| 2025-03-03 | Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion | Jiangyuan Liu et.al. | 2502.14616 | translate | read | link |
(<a href=../Depth_Estimation.md>back to Depth Estimation</a>)