Depth Estimation - 2024-12 | Paper Arxiv Daily

Depth Estimation - 2024-12

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-12-31	Tech Report: Divide and Conquer 3D Real-Time Reconstruction for Improved IGS	Yicheng Zhu et.al.	2501.01465	translate	read	null
2024-12-30	FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI	Zhengdong Li et.al.	2412.20974	translate	read	null
2024-12-29	MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning	Chunpu Liu et.al.	2412.20390	translate	read	null
2024-12-28	Multi-Modality Driven LoRA for Adverse Condition Depth Estimation	Guanglei Yang et.al.	2412.20162	translate	read	null
2024-12-28	DepthMamba with Adaptive Fusion	Zelin Meng et.al.	2412.19964	translate	read	null
2024-12-26	An End-to-End Depth-Based Pipeline for Selfie Image Rectification	Ahmed Alhawwary et.al.	2412.19189	translate	read	null
2024-12-26	Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement	Qiude Zhang et.al.	2412.19165	translate	read	null
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	translate	read	null
2024-12-26	Learning Monocular Depth from Events via Egomotion Compensation	Haitao Meng et.al.	2412.19067	translate	read	null
2024-12-24	RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis	Yiling Yao et.al.	2412.18380	translate	read	null
2024-12-27	LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance	Huawei Sun et.al.	2412.16380	translate	read	link
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	translate	read	null
2024-12-19	Scaling 4D Representations	João Carreira et.al.	2412.15212	translate	read	null
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103	translate	read	null
2024-12-18	Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation	Haotong Lin et.al.	2412.14015	translate	read	null
2024-12-18	Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion	Massimiliano Viola et.al.	2412.13389	translate	read	null
2024-12-18	Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera	Zhengdi Yu et.al.	2412.12861	translate	read	null
2024-12-17	PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Kun Guo et.al.	2412.12460	translate	read	null
2024-12-16	V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations	Jin-Cheng Jhang et.al.	2412.11412	translate	read	null
2024-12-16	Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video	Junkai Fan et.al.	2412.11395	translate	read	null
2024-12-15	ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction	Yi Feng et.al.	2412.11210	translate	read	link
2024-12-14	MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance	Wenjun Huang et.al.	2412.10730	translate	read	null
2024-12-12	Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos	Linyi Jin et.al.	2412.09621	translate	read	null
2024-12-12	T-SVG: Text-Driven Stereoscopic Video Generation	Qiao Jin et.al.	2412.09323	translate	read	null
2024-12-12	Cross-View Completion Models are Zero-shot Correspondence Estimators	Honggyu An et.al.	2412.09072	translate	read	null
2024-12-11	BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation	Shengze Wang et.al.	2412.08640	translate	read	null
2024-12-13	Utilizing Multi-step Loss for Single Image Reflection Removal	Abdelrahman Elnenaey et.al.	2412.08582	translate	read	link
2024-12-11	Dense Depth from Event Focal Stack	Kenta Horikawa et.al.	2412.08120	translate	read	null
2024-12-10	Diffusion-Based Attention Warping for Consistent 3D Scene Editing	Eyal Gomel et.al.	2412.07984	translate	read	null
2024-12-10	Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation	Kurt H. W. Stolle et.al.	2412.07966	translate	read	null
2024-12-09	SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception	Yaniv Benny et.al.	2412.06968	translate	read	null
2024-12-09	Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving	Xin Fei et.al.	2412.06777	translate	read	link
2024-12-09	MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views	Antoine Guédon et.al.	2412.06767	translate	read	null
2024-12-09	On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events	Jesse Hagenaars et.al.	2412.06359	translate	read	null
2024-12-09	Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction	Dongxu Wei et.al.	2412.06273	translate	read	null
2024-12-09	Event fields: Capturing light fields at high speed, resolution, and dynamic range	Ziyuan Qu et.al.	2412.06191	translate	read	null
2024-12-08	GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion	Karlo Koledic et.al.	2412.06080	translate	read	null
2024-12-08	Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors	Alex Rich et.al.	2412.05771	translate	read	null
2024-12-10	TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action	Zixian Ma et.al.	2412.05479	translate	read	null
2024-12-06	SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images	Jiahua Dong et.al.	2412.05274	translate	read	null
2024-12-06	Penetrative rotating magnetoconvection subject to lateral variations in temperature gradients	Tirtharaj Barman et.al.	2412.05235	translate	read	null
2024-12-06	PanoDreamer: 3D Panorama Synthesis from a Single Image	Avinash Paliwal et.al.	2412.04827	translate	read	link
2024-12-05	LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation	Kebin Peng et.al.	2412.04666	translate	read	null
2024-12-05	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	translate	read	null
2024-12-05	MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction	Mithun Parab et.al.	2412.03928	translate	read	null
2024-12-04	Perception Tokens Enhance Visual Reasoning in Multimodal Language Models	Mahtab Bigverdi et.al.	2412.03548	translate	read	null
2024-12-04	Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter	Hermes McGriff et.al.	2412.03518	translate	read	null
2024-12-04	MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction	Gangjian Zhang et.al.	2412.03103	translate	read	null
2024-12-05	Align3R: Aligned Monocular Depth Estimation for Dynamic Videos	Jiahao Lu et.al.	2412.03079	translate	read	null
2024-12-03	Single-Shot Metric Depth from Focused Plenoptic Cameras	Blanca Lasheras-Hernandez et.al.	2412.02386	translate	read	null
2024-12-03	Dual Exposure Stereo for Extended Dynamic Range 3D Imaging	Juhyung Choi et.al.	2412.02351	translate	read	null
2024-12-03	Amodal Depth Anything: Amodal Depth Estimation in the Wild	Zhenyu Li et.al.	2412.02336	translate	read	null
2024-12-03	GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos	Zhiyuan Chen et.al.	2412.02267	translate	read	null
2024-12-03	FoveaSPAD: Exploiting Depth Priors for Adaptive and Efficient Single-Photon 3D Imaging	Justin Folden et.al.	2412.02052	translate	read	null
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039	translate	read	link
2024-12-02	AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric Depth Estimation	Xiaohu Liu et.al.	2412.01637	translate	read	null
2024-12-02	STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation	Sunghun Yang et.al.	2412.01090	translate	read	null
2024-12-01	FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation	Yunpeng Bai et.al.	2412.00671	translate	read	null

(<a href=../Depth_Estimation.md>back to Depth Estimation</a>)