Semantic Segmentation - 2025-06

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-06-30	SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures	Fengyi Jiang et.al.	2507.00209	translate	read	null
2025-06-30	Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors	Ce Wang et.al.	2506.23801	translate	read	null
2025-06-30	Deep Learning-Based Semantic Segmentation for Real-Time Kidney Imaging and Measurements with Augmented Reality-Assisted Ultrasound	Gijs Luijten et.al.	2506.23721	translate	read	null
2025-06-30	PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum	Shiqi Zhang et.al.	2506.23607	translate	read	null
2025-06-30	Interactive Interface For Semantic Segmentation Dataset Synthesis	Ngoc-Do Tran et.al.	2506.23470	translate	read	null
2025-06-30	Contrastive Learning with Diffusion Features for Weakly Supervised Medical Image Segmentation	Dewen Zeng et.al.	2506.23460	translate	read	null
2025-06-29	Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement	Siyuan Chai et.al.	2506.23353	translate	read	null
2025-06-29	FastSeg: Efficient Training-Free Open-Vocabulary Segmentation via Hierarchical Attention Refinement Method	Quang-Huy Che et.al.	2506.23323	translate	read	null
2025-06-29	BPD-Neo: An MRI Dataset for Lung-Trachea Segmentation with Clinical Data for Neonatal Bronchopulmonary Dysplasia	Rachit Saluja et.al.	2506.23305	translate	read	null
2025-06-29	High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation	Lunhao Duan et.al.	2506.23227	translate	read	null
2025-06-29	DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation	Jihun Kim et.al.	2506.23104	translate	read	null
2025-06-27	Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation	Jialei Chen et.al.	2506.22032	translate	read	null
2025-06-27	TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models	Meng Yu et.al.	2506.21975	translate	read	null
2025-06-27	SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images	Naftaly Wambugu et.al.	2506.21945	translate	read	null
2025-06-26	Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection	Tobias J. Riedlinger et.al.	2506.21486	translate	read	null
2025-06-26	PanSt3R: Multi-view Consistent Panoptic Segmentation	Lojze Zust et.al.	2506.21348	translate	read	null
2025-06-26	HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation	Diego Biagini et.al.	2506.21287	translate	read	null
2025-06-27	ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation	Xiwei Xuan et.al.	2506.21233	translate	read	null
2025-06-26	Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4	Jongyeon Park et.al.	2506.21174	translate	read	null
2025-06-27	DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation	Wenzhou Lyu et.al.	2506.21034	translate	read	null
2025-06-26	TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation	Chade Li et.al.	2506.20991	translate	read	null
2025-06-26	Segment Anything in Pathology Images with Natural Language	Zhixuan Chen et.al.	2506.20988	translate	read	null
2025-06-25	A Deep Learning Approach to Identify Rock Bolts in Complex 3D Point Clouds of Underground Mines Captured Using Mobile Laser Scanners	Dibyayan Patra et.al.	2506.20464	translate	read	null
2025-06-26	Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition	Man Duc Chuc et.al.	2506.20174	translate	read	null
2025-06-24	A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects	Shulan Ruan et.al.	2506.19769	translate	read	null
2025-06-24	USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation	Lin Hong et.al.	2506.19472	translate	read	null
2025-06-24	A Global-Local Cross-Attention Network for Ultra-high Resolution Remote Sensing Image Semantic Segmentation	Chen Yi et.al.	2506.19406	translate	read	null
2025-06-25	AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation	Ziyan Zhao et.al.	2506.19269	translate	read	null
2025-06-23	Orthogonal Projection Subspace to Aggregate Online Prior-knowledge for Continual Test-time Adaptation	Jinlong Li et.al.	2506.19022	translate	read	null
2025-06-23	Multi-Scale Spectral Attention Module-based Hyperspectral Segmentation in Autonomous Driving Scenarios	Imad Ali Shah et.al.	2506.18682	translate	read	null
2025-06-23	SafeClick: Error-Tolerant Interactive Segmentation of Any Medical Volumes via Hierarchical Expert Consensus	Yifan Gao et.al.	2506.18404	translate	read	null
2025-06-23	Jet Reconstruction with Mamba Networks in Collider Events	Jinmian Li et.al.	2506.18336	translate	read	null
2025-06-22	OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model	Shuaiyu Chen et.al.	2506.18006	translate	read	null
2025-06-22	Relation3D: Enhancing Relation Modeling for Point Cloud Instance Segmentation	Jiahao Lu et.al.	2506.17891	translate	read	null
2025-06-22	Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation	Xiaodong Guo et.al.	2506.17869	translate	read	null
2025-06-20	Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation	Qing Xu et.al.	2506.17159	translate	read	link
2025-06-20	ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds	Binbin Xiang et.al.	2506.16991	translate	read	link
2025-06-20	LunarLoc: Segment-Based Global Localization on the Moon	Annika Thomas et.al.	2506.16940	translate	read	link
2025-06-19	From Semantic To Instance: A Semi-Self-Supervised Learning Approach	Keyhan Najafian et.al.	2506.16563	translate	read	null
2025-06-19	Structured Semantic 3D Reconstruction (S23DR) Challenge 2025 – Winning solution	Jan Skvrna et.al.	2506.16421	translate	read	null
2025-06-19	LBMamba: Locally Bi-directional Mamba	Jingwei Zhang et.al.	2506.15976	translate	read	link
2025-06-19	Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging	Jiawen Yang et.al.	2506.15971	translate	read	null
2025-06-19	Polyline Path Masked Attention for Vision Transformer	Zhongchen Zhao et.al.	2506.15940	translate	read	link
2025-06-18	MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning	Leonid Ivanov et.al.	2506.15313	translate	read	link
2025-06-18	Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation	Jiaqi Shi et.al.	2506.15160	translate	read	link
2025-06-17	Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset	Nikolaos Dionelis et.al.	2506.14765	translate	read	null
2025-06-17	FocalClick-XL: Towards Unified and High-quality Interactive Segmentation	Xi Chen et.al.	2506.14686	translate	read	null
2025-06-17	VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy	Zhuoyue Tan et.al.	2506.14525	translate	read	null
2025-06-17	DepthSeg: Depth prompting in remote sensing semantic segmentation	Ning Zhou et.al.	2506.14382	translate	read	null
2025-06-17	Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment	Weiming Zhang et.al.	2506.14271	translate	read	null
2025-06-16	HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment	Numair Nadeem et.al.	2506.13925	translate	read	null
2025-06-16	A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects	Guohuan Xie et.al.	2506.13552	translate	read	null
2025-06-16	Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning	Rohit Mohan et.al.	2506.13265	translate	read	null
2025-06-16	ViewPCL: a point cloud based active learning method for multi-view segmentation	Christian Hilaire et.al.	2506.13043	translate	read	null
2025-06-15	A large-scale, physically-based synthetic dataset for satellite pose estimation	Szabolcs Velkei et.al.	2506.12782	translate	read	null
2025-06-15	Unleashing Diffusion and State Space Models for Medical Image Segmentation	Rong Wu et.al.	2506.12747	translate	read	null
2025-06-15	Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups	Zhenghao Xi et.al.	2506.12712	translate	read	null
2025-06-13	O2Former:Direction-Aware and Multi-Scale Query Enhancement for SAR Ship Instance Segmentation	F. Gao et.al.	2506.11913	translate	read	null
2025-06-13	Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling	Yunhan Ren et.al.	2506.11661	translate	read	null
2025-06-13	A $^2$ LC: Active and Automated Label Correction for Semantic Segmentation	Youjin Jeon et.al.	2506.11599	translate	read	null
2025-06-13	OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots	Juno Kim et.al.	2506.11585	translate	read	null
2025-06-12	GynSurg: A Comprehensive Gynecology Laparoscopic Surgery Dataset	Sahar Nasirihaghighi et.al.	2506.11356	translate	read	null
2025-06-12	Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes	Masahiro Yasuda et.al.	2506.10676	translate	read	link
2025-06-12	Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models	Francisco Caetano et.al.	2506.10634	translate	read	link
2025-06-12	Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration	Jun Wang et.al.	2506.10573	translate	read	null
2025-06-12	ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation	Teerapong Panboonyuen et.al.	2506.10524	translate	read	null
2025-06-12	Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation	Shuyang Li et.al.	2506.10503	translate	read	null
2025-06-12	Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success	Che Wang et.al.	2506.10359	translate	read	null
2025-06-11	Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements	Mustafa Atahan Nuhoglu et.al.	2506.10107	translate	read	null
2025-06-11	Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation	Siyu Chen et.al.	2506.09881	translate	read	link
2025-06-11	Accurate and efficient zero-shot 6D pose estimation with frozen foundation models	Andrea Caraffa et.al.	2506.09784	translate	read	null
2025-06-11	The Four Color Theorem for Cell Instance Segmentation	Ye Zhang et.al.	2506.09724	translate	read	link
2025-06-11	Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments	Fatemeh Mohammadi Amin et.al.	2506.09552	translate	read	null
2025-06-12	Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20 $^{th}$ century Urban Landscapes with Satellite Imageries	Tianxiang Hao et.al.	2506.09476	translate	read	null
2025-06-11	MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning	Tong Wang et.al.	2506.09327	translate	read	null
2025-06-10	WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos	Negin Ghamsarian et.al.	2506.08896	translate	read	null
2025-06-11	RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation	Jiayi Song et.al.	2506.08772	translate	read	null
2025-06-10	ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction	Juan Yeo et.al.	2506.08678	translate	read	null
2025-06-10	ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network	Feixiang Du et.al.	2506.08629	translate	read	null
2025-06-09	LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds	Zihui Zhang et.al.	2506.07857	translate	read	null
2025-06-09	SAM2Auto: Auto Annotation Using FLASH	Arash Rocky et.al.	2506.07850	translate	read	null
2025-06-09	F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation	Hengzhi Chen et.al.	2506.07847	translate	read	null
2025-06-09	Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity	Mohamed Djilani et.al.	2506.07773	translate	read	null
2025-06-09	OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting	Jens Piekenbrinck et.al.	2506.07697	translate	read	null
2025-06-09	Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation	Jintao Tong et.al.	2506.07376	translate	read	null
2025-06-09	Multiple Object Stitching for Unsupervised Representation Learning	Chengchao Shen et.al.	2506.07364	translate	read	link
2025-06-08	BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite	Liyang Chen et.al.	2506.07116	translate	read	null
2025-06-08	Technical Report for ICRA 2025 GOOSE 3D Semantic Segmentation Challenge: Adaptive Point Cloud Understanding for Heterogeneous Robotic Systems	Xiaoya Zhang et.al.	2506.06995	translate	read	null
2025-06-07	Position Prediction Self-Supervised Learning for Multimodal Satellite Imagery Semantic Segmentation	John Waithaka et.al.	2506.06852	translate	read	null
2025-06-06	Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness	Steven Landgraf et.al.	2506.05917	translate	read	null
2025-06-06	You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping	Jingshun Huang et.al.	2506.05719	translate	read	null
2025-06-05	FRAME: Pre-Training Video Feature Representations via Anticipation and Memory	Sethuraman TV et.al.	2506.05543	translate	read	null
2025-06-05	U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation	Marwane Kzadri et.al.	2506.05444	translate	read	null
2025-06-05	Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting	Alfred T. Christiansen et.al.	2506.05009	translate	read	null
2025-06-05	Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery	Mélisande Teng et.al.	2506.04970	translate	read	null
2025-06-05	CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx	Lukas Picek et.al.	2506.04931	translate	read	null
2025-06-05	OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model	Kunshen Zhang et.al.	2506.04837	translate	read	null
2025-06-05	Gen-n-Val: Agentic Image Data Generation and Validation	Jing-En Huang et.al.	2506.04676	translate	read	null
2025-06-04	You Only Train Once	Christos Sakaridis et.al.	2506.04349	translate	read	null
2025-06-04	AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives	Aniruddh Sikdar et.al.	2506.03709	translate	read	null
2025-06-04	OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation	Aditya Gandhamal et.al.	2506.03706	translate	read	null
2025-06-04	BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation	Jialei Chen et.al.	2506.03675	translate	read	null
2025-06-03	Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery	Pengyu Chen et.al.	2506.03388	translate	read	null
2025-06-03	Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding	Weiqing Xiao et.al.	2506.03134	translate	read	link
2025-06-03	GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal	Shufan Qing et.al.	2506.02736	translate	read	link
2025-06-03	Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather	Longyu Yang et.al.	2506.02396	translate	read	null
2025-06-04	SAB3R: Semantic-Augmented Backbone in 3D Reconstruction	Xuweiyi Chen et.al.	2506.02112	translate	read	null
2025-06-02	SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation	Rafael Flor-Rodríguez et.al.	2506.01418	translate	read	null
2025-06-01	Perceptual Inductive Bias Is What You Need Before Contrastive Learning	Tianqin Li et.al.	2506.01201	translate	read	null
2025-06-01	GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning	Sahiti Yerramilli et.al.	2506.00785	translate	read	null
2025-06-02	NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation	Xuzhi Wang et.al.	2505.24634	translate	read	null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)