Scene Understanding - 2024-03 | Paper Arxiv Daily

Scene Understanding - 2024-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-03-31	Adapting to Length Shift: FlexiLength Network for Trajectory Prediction	Yi Xu et.al.	2404.00742	translate	read	null
2024-03-31	Neural Radiance Field-based Visual Rendering: A Comprehensive Review	Mingyuan Yao et.al.	2404.00714	translate	read	null
2024-03-29	VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection	Zihua Liu et.al.	2404.00149	translate	read	null
2024-03-29	HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes	Ke Wu et.al.	2403.20159	translate	read	null
2024-03-27	Object Pose Estimation via the Aggregation of Diffusion Features	Tianfu Wang et.al.	2403.18791	translate	read	link
2024-03-25	Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding	Lingdong Kong et.al.	2403.17010	translate	read	link
2024-03-25	Towards Trustworthy Automated Driving through Qualitative Scene Understanding and Explanations	Nassim Belmecheri et.al.	2403.16908	translate	read	null
2024-03-25	DOCTR: Disentangled Object-Centric Transformer for Point Scene Understanding	Xiaoxuan Yu et.al.	2403.16431	translate	read	link
2024-03-24	AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans	Cedric Perauer et.al.	2403.16318	translate	read	null
2024-03-24	Improving Scene Graph Generation with Relation Words’ Debiasing in Vision-Language Models	Yuxuan Wang et.al.	2403.16184	translate	read	null
2024-03-24	Multi-Task Learning with Multi-Task Optimization	Lu Bai et.al.	2403.16162	translate	read	null
2024-03-24	Semantic Is Enough: Only Semantic Information For NeRF Reconstruction	Ruibo Wang et.al.	2403.16043	translate	read	null
2024-03-22	Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting	Jun Guo et.al.	2403.15624	translate	read	null
2024-03-22	DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data	Hanrong Ye et.al.	2403.15389	translate	read	null
2024-03-21	DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation	Zeeshan Hayder et.al.	2403.14886	translate	read	null
2024-03-21	Evaluating Panoramic 3D Estimation in Indoor Lighting Analysis	Zining Cheng et.al.	2403.14836	translate	read	null
2024-03-21	SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field	Lizhe Liu et.al.	2403.14366	translate	read	null
2024-03-21	Exosense: A Vision-Centric Scene Understanding System For Safe Exoskeleton Navigation	Jianeng Wang et.al.	2403.14320	translate	read	null
2024-03-21	Volumetric Environment Representation for Vision-Language Navigation	Rui Liu et.al.	2403.14158	translate	read	null
2024-03-21	3D Object Detection from Point Cloud via Voting Step Diffusion	Haoran Hou et.al.	2403.14133	translate	read	null
2024-03-20	Efficient scene text image super-resolution with semantic guidance	LeoWu TomyEnrique et.al.	2403.13330	translate	read	link
2024-03-19	SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model	Armen Avetisyan et.al.	2403.13064	translate	read	null
2024-03-19	HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting	Hongyu Zhou et.al.	2403.12722	translate	read	null
2024-03-19	M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving	Dongyang Xu et.al.	2403.12552	translate	read	null
2024-03-19	Multi-Object RANSAC: Efficient Plane Clustering Method in a Clutter	Seunghyeon Lim et.al.	2403.12449	translate	read	null
2024-03-19	Geometric Constraints in Deep Learning Frameworks: A Survey	Vibhas K Vats et.al.	2403.12431	translate	read	null
2024-03-18	R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding	Qirui Wu et.al.	2403.12301	translate	read	null
2024-03-18	HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation	Ce Zhang et.al.	2403.12033	translate	read	link
2024-03-18	Agent3D-Zero: An Agent for Zero-shot 3D Understanding	Sha Zhang et.al.	2403.11835	translate	read	null
2024-03-18	OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation	Haochen Jiang et.al.	2403.11796	translate	read	null
2024-03-19	Urban Scene Diffusion through Semantic Occupancy Map	Junge Zhang et.al.	2403.11697	translate	read	null
2024-03-18	Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation	Ming Xu et.al.	2403.11541	translate	read	link
2024-03-18	Beyond Uncertainty: Risk-Aware Active View Acquisition for Safe Robot Navigation and 3D Scene Understanding with FisherRF	Guangyi Liu et.al.	2403.11396	translate	read	null
2024-03-17	Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications	Yonggan Fu et.al.	2403.11131	translate	read	link
2024-03-16	N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields	Yash Bhalgat et.al.	2403.10997	translate	read	null
2024-03-16	Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation	Mariia Khan et.al.	2403.10780	translate	read	null
2024-03-15	Robust Shape Fitting for 3D Scene Abstraction	Florian Kluger et.al.	2403.10452	translate	read	link
2024-03-15	Do Visual-Language Maps Capture Latent Semantics?	Matti Pekkanen et.al.	2403.10117	translate	read	null
2024-03-15	Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning	Hang Zhang et.al.	2403.10107	translate	read	null
2024-03-14	GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding	Chengyao Wang et.al.	2403.09639	translate	read	link
2024-03-12	IndicSTR12: A Dataset for Indic Scene Text Recognition	Harsh Lunia et.al.	2403.08007	translate	read	null
2024-03-12	Efficient Global Navigational Planning in 3D Structures based on Point Cloud Tomography	Bowen Yang et.al.	2403.07631	translate	read	link
2024-03-12	Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss	Xuhua Ren et.al.	2403.07518	translate	read	null
2024-03-12	MoAI: Mixture of All Intelligence for Large Language and Vision Models	Byung-Kwan Lee et.al.	2403.07508	translate	read	link
2024-03-11	Mapping High-level Semantic Regions in Indoor Environments without Object Recognition	Roberto Bigazzi et.al.	2403.07076	translate	read	null
2024-03-11	Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer	Siddhant Satyanaik et.al.	2403.06953	translate	read	null
2024-03-08	Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation	Yifan Mao et.al.	2403.05056	translate	read	link
2024-03-07	Towards Scene Graph Anticipation	Rohith Peddi et.al.	2403.04899	translate	read	null
2024-03-07	Embodied Understanding of Driving Scenarios	Yunsong Zhou et.al.	2403.04593	translate	read	link
2024-03-07	Out of the Room: Generalizing Event-Based Dynamic Motion Segmentation for Complex Scenes	Stamatios Georgoulis et.al.	2403.04562	translate	read	null
2024-03-06	GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding	Zi-Ting Chou et.al.	2403.03608	translate	read	null
2024-03-05	OORD: The Oxford Offroad Radar Dataset	Matthew Gadd et.al.	2403.02845	translate	read	link
2024-03-05	HUNTER: Unsupervised Human-centric 3D Detection via Transferring Knowledge from Synthetic Instances to Real Scenes	Yichen Yao et.al.	2403.02769	translate	read	null

(<a href=../Scene_Understanding.md>back to Scene Understanding</a>)