Semantic Segmentation - 2025-05

Publish Date Title Authors PDF Translate Read Code
2025-05-31 BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation Wei Tao et.al. 2506.00475 translate read null
2025-05-30 Bi-Manual Joint Camera Calibration and Scene Representation Haozhan Tang et.al. 2505.24819 translate read null
2025-05-30 SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds Cheng Zeng et.al. 2505.24475 translate read null
2025-05-30 Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation Roger Ferrod et.al. 2505.24361 translate read null
2025-05-30 Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors Peiran Xu et.al. 2505.24103 translate read null
2025-05-29 MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking Numair Nadeem et.al. 2505.24026 translate read null
2025-05-29 Semantics-Guided Generative Image Compression Cheng-Lin Wu et.al. 2505.24015 translate read null
2025-05-29 Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts Xuweiyi Chen et.al. 2505.23926 translate read null
2025-05-29 TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models Yao Xiao et.al. 2505.23769 translate read link
2025-05-29 Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation Georgios Voulgaris et.al. 2505.23597 translate read null
2025-05-29 VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration Ben Li et.al. 2505.23439 translate read link
2025-05-29 Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation Lingyan Ran et.al. 2505.23438 translate read null
2025-05-29 Federated Unsupervised Semantic Segmentation Evangelos Charalampakis et.al. 2505.23292 translate read null
2025-05-29 LeMoRe: Learn More Details for Lightweight Semantic Segmentation Mian Muhammad Naeem Abid et.al. 2505.23093 translate read link
2025-05-28 ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions Maxence Wynen et.al. 2505.22537 translate read null
2025-05-28 Universal Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2505.22458 translate read null
2025-05-28 LiDAR Based Semantic Perception for Forklifts in Outdoor Environments Benjamin Serfling et.al. 2505.22258 translate read null
2025-05-29 YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction Mingzhuang Wang et.al. 2505.22250 translate read null
2025-05-28 Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation Zhisong Wang et.al. 2505.22230 translate read null
2025-05-28 A Survey on Training-free Open-Vocabulary Semantic Segmentation Naomi Kombol et.al. 2505.22209 translate read null
2025-05-28 S2AFormer: Strip Self-Attention for Efficient Vision Transformer Guoan Xu et.al. 2505.22195 translate read null
2025-05-28 LiDARDustX: A LiDAR Dataset for Dusty Unstructured Road Environments Chenfeng Wei et.al. 2505.21914 translate read null
2025-05-29 CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation Pardis Taghavi et.al. 2505.21904 translate read null
2025-05-28 Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation Mehrdad Noori et.al. 2505.21844 translate read null
2025-05-27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et.al. 2505.21457 translate read link
2025-05-27 Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning Nikos Giannakakis et.al. 2505.20962 translate read null
2025-05-27 DSOcc: Leveraging Depth Awareness and Semantic Aid to Boost Camera-Based 3D Semantic Occupancy Prediction Naiyu Fang et.al. 2505.20951 translate read null
2025-05-26 Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments Julio de la Torre-Vanegas et.al. 2505.20423 translate read null
2025-05-26 A fully automated urban PV parameterization framework for improved estimation of energy production profiles Bowen Tian et.al. 2505.19876 translate read null
2025-05-26 Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation Nagito Saito et.al. 2505.19846 translate read null
2025-05-26 The Missing Point in Vision Transformers for Universal Image Segmentation Sajjad Shahabodini et.al. 2505.19795 translate read link
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 translate read null
2025-05-25 A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic Segmentation Yuze Wang et.al. 2505.19159 translate read link
2025-05-25 SPARS: Self-Play Adversarial Reinforcement Learning for Segmentation of Liver Tumours Catalina Tan et.al. 2505.18989 translate read link
2025-05-25 How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation Yining Pan et.al. 2505.18956 translate read link
2025-05-25 LLM-Guided Taxonomy and Hierarchical Uncertainty for 3D Point CLoud Active Learning Chenxi Li et.al. 2505.18924 translate read null
2025-05-24 ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts Shiu-hong Kao et.al. 2505.18561 translate read null
2025-05-23 REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders Savya Khosla et.al. 2505.18153 translate read null
2025-05-23 SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Shashank Agnihotri et.al. 2505.18015 translate read null
2025-05-23 Semantic segmentation with reward Xie Ting et.al. 2505.17905 translate read null
2025-05-23 Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring Nikolas Papadopoulos et.al. 2505.17782 translate read null
2025-05-23 EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy Yichun Yu et.al. 2505.17665 translate read null
2025-05-22 Deep mineralogical segmentation of thin section images based on QEMSCAN maps Jean Pablo Vieira de Mello et.al. 2505.17008 translate read link
2025-05-22 OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning Zongyan Han et.al. 2505.16974 translate read link
2025-05-22 NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification NovelSeek Team et.al. 2505.16938 translate read link
2025-05-22 TextureSAM: Towards a Texture Aware Foundation Model for Segmentation Inbal Cohen et.al. 2505.16540 translate read null
2025-05-22 Detailed Evaluation of Modern Machine Learning Approaches for Optic Plastics Sorting Vaishali Maheshkar et.al. 2505.16513 translate read null
2025-05-22 Sketchy Bounding-box Supervision for 3D Instance Segmentation Qian Deng et.al. 2505.16399 translate read null
2025-05-22 Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation Estelle Chigot et.al. 2505.16360 translate read link
2025-05-22 RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition Yechan Park et.al. 2505.16165 translate read link
2025-05-21 VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation Niccolo Avogaro et.al. 2505.15592 translate read null
2025-05-21 UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset Hua Li et.al. 2505.15581 translate read link
2025-05-21 seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation Andrew Caunes et.al. 2505.15545 translate read link
2025-05-21 Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation Ce Zhang et.al. 2505.15491 translate read null
2025-05-21 gen2seg: Generative Models Enable Generalizable Instance Segmentation Om Khangaonkar et.al. 2505.15263 translate read link
2025-05-21 Zero-Shot Gaze-based Volumetric Medical Image Segmentation Tatyana Shmykova et.al. 2505.15256 translate read null
2025-05-21 From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation Quanwei Liu et.al. 2505.15147 translate read null
2025-05-20 Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning Amine Elhafsi et.al. 2505.14938 translate read null
2025-05-20 Instance Segmentation for Point Sets Abhimanyu Talwar et.al. 2505.14583 translate read null
2025-05-20 ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains Guillaume Vray et.al. 2505.14511 translate read link
2025-05-20 Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation Bin-Bin Gao et.al. 2505.14239 translate read link
2025-05-20 Intra-class Patch Swap for Self-Distillation Hongjun Choi et.al. 2505.14124 translate read link
2025-05-20 Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts Xi Chen et.al. 2505.14088 translate read null
2025-05-20 Scaling Vision Mamba Across Resolutions via Fractal Traversal Bo Li et.al. 2505.14062 translate read null
2025-05-20 EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation Zelin Zhang et.al. 2505.14014 translate read null
2025-05-19 Self-Supervised Learning for Image Segmentation: A Comprehensive Survey Thangarajah Akilan et.al. 2505.13584 translate read null
2025-05-19 FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching Alp Eren Sari et.al. 2505.13174 translate read null
2025-05-20 Industrial Synthetic Segment Pre-training Shinichi Mae et.al. 2505.13099 translate read null
2025-05-19 Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation Jiaqi Tan et.al. 2505.12861 translate read link
2025-05-19 Enhancing Transformers Through Conditioned Embedded Tokens Hemanth Saratchandran et.al. 2505.12789 translate read null
2025-05-18 Temporal-Spectral-Spatial Unified Remote Sensing Dense Prediction Sijie Zhao et.al. 2505.12280 translate read link
2025-05-17 SoftPQ: Robust Instance Segmentation Evaluation via Soft Matching and Tunable Thresholds Ranit Karmakar et.al. 2505.12155 translate read link
2025-05-17 EarthSynth: Generating Informative Earth Observation with Diffusion Models Jiancheng Pan et.al. 2505.12108 translate read null
2025-05-17 iSegMan: Interactive Segment-and-Manipulate 3D Gaussians Yian Zhao et.al. 2505.11934 translate read null
2025-05-17 Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Boosting Off-Road Segmentation via Photometric Distortion and Exponential Moving Average Wonjune Kim et.al. 2505.11769 translate read null
2025-05-16 DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation Ziyu Zhao et.al. 2505.11676 translate read null
2025-05-16 SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision Utsav Rai et.al. 2505.11439 translate read null
2025-05-16 Pseudo-Label Quality Decoupling and Correction for Semi-Supervised Instance Segmentation Jianghang Lin et.al. 2505.11075 translate read null
2025-05-16 Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation David Minkwan Kim et.al. 2505.10781 translate read null
2025-05-15 Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis Francisco Raverta Capua et.al. 2505.10751 translate read null
2025-05-15 TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation Manthan Patel et.al. 2505.10696 translate read link
2025-05-15 SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity Shihao Zou et.al. 2505.10352 translate read null
2025-05-15 APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds Yuan Gao et.al. 2505.09971 translate read link
2025-05-14 FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization Xiaoyang Yu et.al. 2505.09385 translate read null
2025-05-14 MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning Bin-Bin Gao et.al. 2505.09265 translate read link
2025-05-13 MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment Barak Pinkovich et.al. 2505.08589 translate read null
2025-05-14 The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning Mohamed Lamine Mekhalfi et.al. 2505.08537 translate read null
2025-05-13 Dynamic Snake Upsampling Operater and Boundary-Skeleton Weighted Loss for Tubular Structure Segmentation Yiqi Chen et.al. 2505.08525 translate read null
2025-05-13 Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency Adel Ammar et.al. 2505.08445 translate read null
2025-05-13 GNCAF: A GNN-based Neighboring Context Aggregation Framework for Tertiary Lymphoid Structures Semantic Segmentation in WSI Lei Su et.al. 2505.08430 translate read null
2025-05-12 Vision Foundation Model Embedding-Based Semantic Anomaly Detection Max Peter Ronecker et.al. 2505.07998 translate read null
2025-05-12 Privacy Risks of Robot Vision: A User Study on Image Modalities and Resolution Xuying Huang et.al. 2505.07766 translate read null
2025-05-12 Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation Negin Ghamsarian et.al. 2505.07691 translate read null
2025-05-12 MAIS: Memory-Attention for Interactive Segmentation Mauricio Orbes-Arteaga et.al. 2505.07511 translate read null
2025-05-13 TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset Olaf Wysocki et.al. 2505.07396 translate read null
2025-05-11 Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution Zihang Liu et.al. 2505.07071 translate read link
2025-05-11 Depth-Sensitive Soft Suppression with RGB-D Inter-Modal Stylization Flow for Domain Generalization Semantic Segmentation Binbin Wei et.al. 2505.07050 translate read null
2025-05-11 Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding Chih-Chung Hsu et.al. 2505.06991 translate read null
2025-05-11 Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation Seokjun Kwon et.al. 2505.06951 translate read null
2025-05-10 Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization Xu Zheng et.al. 2505.06635 translate read null
2025-05-10 RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation Zhiwen Zeng et.al. 2505.06515 translate read null
2025-05-09 Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet Kodai Hirata et.al. 2505.06185 translate read null
2025-05-08 CottonSim: Development of an autonomous visual-guided robotic cotton-picking system in the Gazebo Thevathayarajh Thayananthan et.al. 2505.05317 translate read null
2025-05-08 RepSNet: A Nucleus Instance Segmentation model based on Boundary Regression and Structural Re-parameterization Shengchun Xiong et.al. 2505.05073 translate read null
2025-05-09 UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model Timo Kaiser et.al. 2505.05049 translate read link
2025-05-08 Split Matching for Inductive Zero-shot Semantic Segmentation Jialei Chen et.al. 2505.05023 translate read null
2025-05-08 Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model Navin Ranjan et.al. 2505.04861 translate read null
2025-05-07 Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions? Shashank Agnihotri et.al. 2505.04835 translate read link
2025-05-07 Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer Sainath Dey et.al. 2505.04740 translate read null
2025-05-07 DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Junjie Wang et.al. 2505.04410 translate read link
2025-05-07 MFSeg: Efficient Multi-frame 3D Semantic Segmentation Chengjie Huang et.al. 2505.04408 translate read null
2025-05-06 Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach Srecharan Selvam et.al. 2505.03702 translate read null
2025-05-06 CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting Huawei Sun et.al. 2505.03679 translate read null
2025-05-06 Panoramic Out-of-Distribution Segmentation Mengfei Duan et.al. 2505.03539 translate read link
2025-05-06 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation Andrew Caunes et.al. 2505.03300 translate read null
2025-05-05 Platelet enumeration in dense aggregates H. Martin Gillis et.al. 2505.02751 translate read null
2025-05-04 Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation Volodymyr Havrylov et.al. 2505.02075 translate read link
2025-05-04 Segment Any RGB-Thermal Model with Language-aided Distillation Dong Xing et.al. 2505.01950 translate read null
2025-05-03 OODTE: A Differential Testing Engine for the ONNX Optimizer Nikolaos Louloudakis et.al. 2505.01892 translate read null
2025-05-03 A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory Chenyang Fan et.al. 2505.01656 translate read null
2025-05-02 A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative Learning Anan Yaghmour et.al. 2505.01558 translate read null
2025-05-02 Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation Zhen Yao et.al. 2505.01548 translate read link
2025-05-02 Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing Fahong Zhang et.al. 2505.01385 translate read null
2025-05-02 GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation Boris Kriuk et.al. 2505.01057 translate read null
2025-05-03 Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook Muyi Bao et.al. 2505.00630 translate read null
2025-05-01 Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation Feng Xue et.al. 2505.00378 translate read null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)