Depth Estimation - 2025-06
Depth Estimation - 2025-06
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-06-30 | SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures | Fengyi Jiang et.al. | 2507.00209 | translate | read | null |
| 2025-06-30 | OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving | Mingqian Ji et.al. | 2506.23565 | translate | read | null |
| 2025-06-26 | ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation | Shruti Bansal et.al. | 2506.20969 | translate | read | null |
| 2025-06-25 | THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion | Calin Teodor Ioan et.al. | 2506.20877 | translate | read | null |
| 2025-06-30 | StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation | Haodong Li et.al. | 2506.20756 | translate | read | null |
| 2025-06-24 | Look to Locate: Vision-Based Multisensory Navigation with 3-D Digital Maps for GNSS-Challenged Environments | Ola Elmaghraby et.al. | 2506.19827 | translate | read | null |
| 2025-06-23 | SOF: Sorted Opacity Fields for Fast Unbounded Surface Reconstruction | Lukas Radl et.al. | 2506.19139 | translate | read | null |
| 2025-06-23 | BulletGen: Improving 4D Reconstruction with Bullet-Time Generation | Denys Rozumnyi et.al. | 2506.18601 | translate | read | null |
| 2025-06-21 | Optimization-Free Patch Attack on Stereo Depth Estimation | Hangcheng Liu et.al. | 2506.17632 | translate | read | null |
| 2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206 | translate | read | link |
| 2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119 | translate | read | link |
| 2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110 | translate | read | null |
| 2025-06-20 | DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches | Yun Xing et.al. | 2506.16690 | translate | read | null |
| 2025-06-19 | EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training | Liangjing Shao et.al. | 2506.16017 | translate | read | link |
| 2025-06-18 | RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation | Xingrui Qin et.al. | 2506.15560 | translate | read | null |
| 2025-06-17 | Time-Optimized Safe Navigation in Unstructured Environments through Learning Based Depth Completion | Jeffrey Mao et.al. | 2506.14975 | translate | read | null |
| 2025-06-17 | DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning | Kunal Swami et.al. | 2506.14709 | translate | read | null |
| 2025-06-16 | Test3R: Learning to Reconstruct 3D at Test Time | Yuheng Yuan et.al. | 2506.13750 | translate | read | link |
| 2025-06-16 | Multiview Geometric Regularization of Gaussian Splatting for Accurate Radiance Fields | Jungeon Kim et.al. | 2506.13508 | translate | read | null |
| 2025-06-17 | Self-Supervised Enhancement for Depth from a Lightweight ToF Sensor with Monocular Images | Laiyan Ding et.al. | 2506.13444 | translate | read | null |
| 2025-06-16 | TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast | Beilei Cui et.al. | 2506.13387 | translate | read | link |
| 2025-06-17 | 3D Hand Mesh-Guided AI-Generated Malformed Hand Refinement with Hand Pose Transformation via Diffusion Model | Chen-Bin Feng et.al. | 2506.12680 | translate | read | null |
| 2025-06-12 | Leveraging 6DoF Pose Foundation Models For Mapping Marine Sediment Burial | Jerry Yan et.al. | 2506.10386 | translate | read | link |
| 2025-06-11 | DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects | Guanghu Xie et.al. | 2506.09491 | translate | read | null |
| 2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | translate | read | null |
| 2025-06-10 | AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models | Zheda Mai et.al. | 2506.09082 | translate | read | null |
| 2025-06-10 | One Patch to Rule Them All: Transforming Static Patches into Dynamic Attacks in the Physical World | Xingshuo Han et.al. | 2506.08482 | translate | read | null |
| 2025-06-09 | Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence | Octave Mariotti et.al. | 2506.08220 | translate | read | null |
| 2025-06-09 | Hidden in plain sight: VLMs overlook their visual representations | Stephanie Fu et.al. | 2506.08008 | translate | read | null |
| 2025-06-09 | EgoM2P: Egocentric Multimodal Multitask Pretraining | Gen Li et.al. | 2506.07886 | translate | read | link |
| 2025-06-09 | Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images | Yingping Liang et.al. | 2506.07740 | translate | read | null |
| 2025-06-07 | Dark Channel-Assisted Depth-from-Defocus from a Single Image | Moushumi Medhi et.al. | 2506.06643 | translate | read | null |
| 2025-06-06 | NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces | Pierluigi Zama Ramirez et.al. | 2506.05815 | translate | read | null |
| 2025-06-06 | Advancement and Field Evaluation of a Dual-arm Apple Harvesting Robot | Keyi Zhu et.al. | 2506.05714 | translate | read | null |
| 2025-06-06 | Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration | Fanhu Zeng et.al. | 2506.05709 | translate | read | null |
| 2025-06-06 | Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues | Yimei Liu et.al. | 2506.05655 | translate | read | null |
| 2025-06-03 | Attacking Attention of Foundation Models Disrupts Downstream Tasks | Hondamunige Prasanna Silva et.al. | 2506.05394 | translate | read | null |
| 2025-06-09 | Structure-Aware Radar-Camera Depth Estimation | Fuyi Zhang et.al. | 2506.05008 | translate | read | null |
| 2025-06-05 | Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer | Filip Slezak et.al. | 2506.04908 | translate | read | null |
| 2025-06-05 | Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation | Yijun Cao et.al. | 2506.04758 | translate | read | null |
| 2025-06-04 | JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting | Yang Xiao et.al. | 2506.03872 | translate | read | null |
| 2025-06-03 | ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting Heads | Yifan Li et.al. | 2506.03433 | translate | read | null |
| 2025-06-02 | E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models | Wenyan Cong et.al. | 2506.01933 | translate | read | null |
| 2025-06-01 | Perceptual Inductive Bias Is What You Need Before Contrastive Learning | Tianqin Li et.al. | 2506.01201 | translate | read | null |
(<a href=../Depth_Estimation.md>back to Depth Estimation</a>)