Semantic Segmentation - 2025-06

Publish Date Title Authors PDF Translate Read Code
2025-06-30 SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures Fengyi Jiang et.al. 2507.00209 translate read null
2025-06-30 Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors Ce Wang et.al. 2506.23801 translate read null
2025-06-30 Deep Learning-Based Semantic Segmentation for Real-Time Kidney Imaging and Measurements with Augmented Reality-Assisted Ultrasound Gijs Luijten et.al. 2506.23721 translate read null
2025-06-30 PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum Shiqi Zhang et.al. 2506.23607 translate read null
2025-06-30 Interactive Interface For Semantic Segmentation Dataset Synthesis Ngoc-Do Tran et.al. 2506.23470 translate read null
2025-06-30 Contrastive Learning with Diffusion Features for Weakly Supervised Medical Image Segmentation Dewen Zeng et.al. 2506.23460 translate read null
2025-06-29 Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement Siyuan Chai et.al. 2506.23353 translate read null
2025-06-29 FastSeg: Efficient Training-Free Open-Vocabulary Segmentation via Hierarchical Attention Refinement Method Quang-Huy Che et.al. 2506.23323 translate read null
2025-06-29 BPD-Neo: An MRI Dataset for Lung-Trachea Segmentation with Clinical Data for Neonatal Bronchopulmonary Dysplasia Rachit Saluja et.al. 2506.23305 translate read null
2025-06-29 High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation Lunhao Duan et.al. 2506.23227 translate read null
2025-06-29 DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation Jihun Kim et.al. 2506.23104 translate read null
2025-06-27 Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation Jialei Chen et.al. 2506.22032 translate read null
2025-06-27 TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models Meng Yu et.al. 2506.21975 translate read null
2025-06-27 SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Naftaly Wambugu et.al. 2506.21945 translate read null
2025-06-26 Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection Tobias J. Riedlinger et.al. 2506.21486 translate read null
2025-06-26 PanSt3R: Multi-view Consistent Panoptic Segmentation Lojze Zust et.al. 2506.21348 translate read null
2025-06-26 HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation Diego Biagini et.al. 2506.21287 translate read null
2025-06-27 ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation Xiwei Xuan et.al. 2506.21233 translate read null
2025-06-26 Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 Jongyeon Park et.al. 2506.21174 translate read null
2025-06-27 DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation Wenzhou Lyu et.al. 2506.21034 translate read null
2025-06-26 TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation Chade Li et.al. 2506.20991 translate read null
2025-06-26 Segment Anything in Pathology Images with Natural Language Zhixuan Chen et.al. 2506.20988 translate read null
2025-06-25 A Deep Learning Approach to Identify Rock Bolts in Complex 3D Point Clouds of Underground Mines Captured Using Mobile Laser Scanners Dibyayan Patra et.al. 2506.20464 translate read null
2025-06-26 Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition Man Duc Chuc et.al. 2506.20174 translate read null
2025-06-24 A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects Shulan Ruan et.al. 2506.19769 translate read null
2025-06-24 USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation Lin Hong et.al. 2506.19472 translate read null
2025-06-24 A Global-Local Cross-Attention Network for Ultra-high Resolution Remote Sensing Image Semantic Segmentation Chen Yi et.al. 2506.19406 translate read null
2025-06-25 AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation Ziyan Zhao et.al. 2506.19269 translate read null
2025-06-23 Orthogonal Projection Subspace to Aggregate Online Prior-knowledge for Continual Test-time Adaptation Jinlong Li et.al. 2506.19022 translate read null
2025-06-23 Multi-Scale Spectral Attention Module-based Hyperspectral Segmentation in Autonomous Driving Scenarios Imad Ali Shah et.al. 2506.18682 translate read null
2025-06-23 SafeClick: Error-Tolerant Interactive Segmentation of Any Medical Volumes via Hierarchical Expert Consensus Yifan Gao et.al. 2506.18404 translate read null
2025-06-23 Jet Reconstruction with Mamba Networks in Collider Events Jinmian Li et.al. 2506.18336 translate read null
2025-06-22 OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model Shuaiyu Chen et.al. 2506.18006 translate read null
2025-06-22 Relation3D: Enhancing Relation Modeling for Point Cloud Instance Segmentation Jiahao Lu et.al. 2506.17891 translate read null
2025-06-22 Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation Xiaodong Guo et.al. 2506.17869 translate read null
2025-06-20 Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation Qing Xu et.al. 2506.17159 translate read link
2025-06-20 ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds Binbin Xiang et.al. 2506.16991 translate read link
2025-06-20 LunarLoc: Segment-Based Global Localization on the Moon Annika Thomas et.al. 2506.16940 translate read link
2025-06-19 From Semantic To Instance: A Semi-Self-Supervised Learning Approach Keyhan Najafian et.al. 2506.16563 translate read null
2025-06-19 Structured Semantic 3D Reconstruction (S23DR) Challenge 2025 – Winning solution Jan Skvrna et.al. 2506.16421 translate read null
2025-06-19 LBMamba: Locally Bi-directional Mamba Jingwei Zhang et.al. 2506.15976 translate read link
2025-06-19 Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging Jiawen Yang et.al. 2506.15971 translate read null
2025-06-19 Polyline Path Masked Attention for Vision Transformer Zhongchen Zhao et.al. 2506.15940 translate read link
2025-06-18 MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning Leonid Ivanov et.al. 2506.15313 translate read link
2025-06-18 Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation Jiaqi Shi et.al. 2506.15160 translate read link
2025-06-17 Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset Nikolaos Dionelis et.al. 2506.14765 translate read null
2025-06-17 FocalClick-XL: Towards Unified and High-quality Interactive Segmentation Xi Chen et.al. 2506.14686 translate read null
2025-06-17 VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy Zhuoyue Tan et.al. 2506.14525 translate read null
2025-06-17 DepthSeg: Depth prompting in remote sensing semantic segmentation Ning Zhou et.al. 2506.14382 translate read null
2025-06-17 Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment Weiming Zhang et.al. 2506.14271 translate read null
2025-06-16 HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment Numair Nadeem et.al. 2506.13925 translate read null
2025-06-16 A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects Guohuan Xie et.al. 2506.13552 translate read null
2025-06-16 Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning Rohit Mohan et.al. 2506.13265 translate read null
2025-06-16 ViewPCL: a point cloud based active learning method for multi-view segmentation Christian Hilaire et.al. 2506.13043 translate read null
2025-06-15 A large-scale, physically-based synthetic dataset for satellite pose estimation Szabolcs Velkei et.al. 2506.12782 translate read null
2025-06-15 Unleashing Diffusion and State Space Models for Medical Image Segmentation Rong Wu et.al. 2506.12747 translate read null
2025-06-15 Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups Zhenghao Xi et.al. 2506.12712 translate read null
2025-06-13 O2Former:Direction-Aware and Multi-Scale Query Enhancement for SAR Ship Instance Segmentation F. Gao et.al. 2506.11913 translate read null
2025-06-13 Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling Yunhan Ren et.al. 2506.11661 translate read null
2025-06-13 A $^2$ LC: Active and Automated Label Correction for Semantic Segmentation Youjin Jeon et.al. 2506.11599 translate read null
2025-06-13 OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots Juno Kim et.al. 2506.11585 translate read null
2025-06-12 GynSurg: A Comprehensive Gynecology Laparoscopic Surgery Dataset Sahar Nasirihaghighi et.al. 2506.11356 translate read null
2025-06-12 Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes Masahiro Yasuda et.al. 2506.10676 translate read link
2025-06-12 Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models Francisco Caetano et.al. 2506.10634 translate read link
2025-06-12 Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration Jun Wang et.al. 2506.10573 translate read null
2025-06-12 ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation Teerapong Panboonyuen et.al. 2506.10524 translate read null
2025-06-12 Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation Shuyang Li et.al. 2506.10503 translate read null
2025-06-12 Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success Che Wang et.al. 2506.10359 translate read null
2025-06-11 Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements Mustafa Atahan Nuhoglu et.al. 2506.10107 translate read null
2025-06-11 Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation Siyu Chen et.al. 2506.09881 translate read link
2025-06-11 Accurate and efficient zero-shot 6D pose estimation with frozen foundation models Andrea Caraffa et.al. 2506.09784 translate read null
2025-06-11 The Four Color Theorem for Cell Instance Segmentation Ye Zhang et.al. 2506.09724 translate read link
2025-06-11 Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments Fatemeh Mohammadi Amin et.al. 2506.09552 translate read null
2025-06-12 Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20 $^{th}$ century Urban Landscapes with Satellite Imageries Tianxiang Hao et.al. 2506.09476 translate read null
2025-06-11 MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Tong Wang et.al. 2506.09327 translate read null
2025-06-10 WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos Negin Ghamsarian et.al. 2506.08896 translate read null
2025-06-11 RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation Jiayi Song et.al. 2506.08772 translate read null
2025-06-10 ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction Juan Yeo et.al. 2506.08678 translate read null
2025-06-10 ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network Feixiang Du et.al. 2506.08629 translate read null
2025-06-09 LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds Zihui Zhang et.al. 2506.07857 translate read null
2025-06-09 SAM2Auto: Auto Annotation Using FLASH Arash Rocky et.al. 2506.07850 translate read null
2025-06-09 F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation Hengzhi Chen et.al. 2506.07847 translate read null
2025-06-09 Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity Mohamed Djilani et.al. 2506.07773 translate read null
2025-06-09 OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting Jens Piekenbrinck et.al. 2506.07697 translate read null
2025-06-09 Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation Jintao Tong et.al. 2506.07376 translate read null
2025-06-09 Multiple Object Stitching for Unsupervised Representation Learning Chengchao Shen et.al. 2506.07364 translate read link
2025-06-08 BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite Liyang Chen et.al. 2506.07116 translate read null
2025-06-08 Technical Report for ICRA 2025 GOOSE 3D Semantic Segmentation Challenge: Adaptive Point Cloud Understanding for Heterogeneous Robotic Systems Xiaoya Zhang et.al. 2506.06995 translate read null
2025-06-07 Position Prediction Self-Supervised Learning for Multimodal Satellite Imagery Semantic Segmentation John Waithaka et.al. 2506.06852 translate read null
2025-06-06 Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness Steven Landgraf et.al. 2506.05917 translate read null
2025-06-06 You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping Jingshun Huang et.al. 2506.05719 translate read null
2025-06-05 FRAME: Pre-Training Video Feature Representations via Anticipation and Memory Sethuraman TV et.al. 2506.05543 translate read null
2025-06-05 U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation Marwane Kzadri et.al. 2506.05444 translate read null
2025-06-05 Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting Alfred T. Christiansen et.al. 2506.05009 translate read null
2025-06-05 Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery Mélisande Teng et.al. 2506.04970 translate read null
2025-06-05 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx Lukas Picek et.al. 2506.04931 translate read null
2025-06-05 OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model Kunshen Zhang et.al. 2506.04837 translate read null
2025-06-05 Gen-n-Val: Agentic Image Data Generation and Validation Jing-En Huang et.al. 2506.04676 translate read null
2025-06-04 You Only Train Once Christos Sakaridis et.al. 2506.04349 translate read null
2025-06-04 AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives Aniruddh Sikdar et.al. 2506.03709 translate read null
2025-06-04 OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation Aditya Gandhamal et.al. 2506.03706 translate read null
2025-06-04 BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation Jialei Chen et.al. 2506.03675 translate read null
2025-06-03 Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery Pengyu Chen et.al. 2506.03388 translate read null
2025-06-03 Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding Weiqing Xiao et.al. 2506.03134 translate read link
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 translate read link
2025-06-03 Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather Longyu Yang et.al. 2506.02396 translate read null
2025-06-04 SAB3R: Semantic-Augmented Backbone in 3D Reconstruction Xuweiyi Chen et.al. 2506.02112 translate read null
2025-06-02 SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation Rafael Flor-Rodríguez et.al. 2506.01418 translate read null
2025-06-01 Perceptual Inductive Bias Is What You Need Before Contrastive Learning Tianqin Li et.al. 2506.01201 translate read null
2025-06-01 GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning Sahiti Yerramilli et.al. 2506.00785 translate read null
2025-06-02 NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation Xuzhi Wang et.al. 2505.24634 translate read null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)