Semantic Segmentation - 2025-05

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-05-31	BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation	Wei Tao et.al.	2506.00475	translate	read	null
2025-05-30	Bi-Manual Joint Camera Calibration and Scene Representation	Haozhan Tang et.al.	2505.24819	translate	read	null
2025-05-30	SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds	Cheng Zeng et.al.	2505.24475	translate	read	null
2025-05-30	Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation	Roger Ferrod et.al.	2505.24361	translate	read	null
2025-05-30	Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors	Peiran Xu et.al.	2505.24103	translate	read	null
2025-05-29	MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking	Numair Nadeem et.al.	2505.24026	translate	read	null
2025-05-29	Semantics-Guided Generative Image Compression	Cheng-Lin Wu et.al.	2505.24015	translate	read	null
2025-05-29	Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts	Xuweiyi Chen et.al.	2505.23926	translate	read	null
2025-05-29	TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models	Yao Xiao et.al.	2505.23769	translate	read	link
2025-05-29	Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation	Georgios Voulgaris et.al.	2505.23597	translate	read	null
2025-05-29	VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration	Ben Li et.al.	2505.23439	translate	read	link
2025-05-29	Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation	Lingyan Ran et.al.	2505.23438	translate	read	null
2025-05-29	Federated Unsupervised Semantic Segmentation	Evangelos Charalampakis et.al.	2505.23292	translate	read	null
2025-05-29	LeMoRe: Learn More Details for Lightweight Semantic Segmentation	Mian Muhammad Naeem Abid et.al.	2505.23093	translate	read	link
2025-05-28	ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions	Maxence Wynen et.al.	2505.22537	translate	read	null
2025-05-28	Universal Domain Adaptation for Semantic Segmentation	Seun-An Choe et.al.	2505.22458	translate	read	null
2025-05-28	LiDAR Based Semantic Perception for Forklifts in Outdoor Environments	Benjamin Serfling et.al.	2505.22258	translate	read	null
2025-05-29	YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction	Mingzhuang Wang et.al.	2505.22250	translate	read	null
2025-05-28	Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation	Zhisong Wang et.al.	2505.22230	translate	read	null
2025-05-28	A Survey on Training-free Open-Vocabulary Semantic Segmentation	Naomi Kombol et.al.	2505.22209	translate	read	null
2025-05-28	S2AFormer: Strip Self-Attention for Efficient Vision Transformer	Guoan Xu et.al.	2505.22195	translate	read	null
2025-05-28	LiDARDustX: A LiDAR Dataset for Dusty Unstructured Road Environments	Chenfeng Wei et.al.	2505.21914	translate	read	null
2025-05-29	CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation	Pardis Taghavi et.al.	2505.21904	translate	read	null
2025-05-28	Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation	Mehrdad Noori et.al.	2505.21844	translate	read	null
2025-05-27	Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO	Muzhi Zhu et.al.	2505.21457	translate	read	link
2025-05-27	Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning	Nikos Giannakakis et.al.	2505.20962	translate	read	null
2025-05-27	DSOcc: Leveraging Depth Awareness and Semantic Aid to Boost Camera-Based 3D Semantic Occupancy Prediction	Naiyu Fang et.al.	2505.20951	translate	read	null
2025-05-26	Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments	Julio de la Torre-Vanegas et.al.	2505.20423	translate	read	null
2025-05-26	A fully automated urban PV parameterization framework for improved estimation of energy production profiles	Bowen Tian et.al.	2505.19876	translate	read	null
2025-05-26	Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation	Nagito Saito et.al.	2505.19846	translate	read	null
2025-05-26	The Missing Point in Vision Transformers for Universal Image Segmentation	Sajjad Shahabodini et.al.	2505.19795	translate	read	link
2025-05-26	ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting	Wenhua Wu et.al.	2505.19420	translate	read	null
2025-05-25	A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic Segmentation	Yuze Wang et.al.	2505.19159	translate	read	link
2025-05-25	SPARS: Self-Play Adversarial Reinforcement Learning for Segmentation of Liver Tumours	Catalina Tan et.al.	2505.18989	translate	read	link
2025-05-25	How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation	Yining Pan et.al.	2505.18956	translate	read	link
2025-05-25	LLM-Guided Taxonomy and Hierarchical Uncertainty for 3D Point CLoud Active Learning	Chenxi Li et.al.	2505.18924	translate	read	null
2025-05-24	ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts	Shiu-hong Kao et.al.	2505.18561	translate	read	null
2025-05-23	REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders	Savya Khosla et.al.	2505.18153	translate	read	null
2025-05-23	SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification	Shashank Agnihotri et.al.	2505.18015	translate	read	null
2025-05-23	Semantic segmentation with reward	Xie Ting et.al.	2505.17905	translate	read	null
2025-05-23	Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring	Nikolas Papadopoulos et.al.	2505.17782	translate	read	null
2025-05-23	EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy	Yichun Yu et.al.	2505.17665	translate	read	null
2025-05-22	Deep mineralogical segmentation of thin section images based on QEMSCAN maps	Jean Pablo Vieira de Mello et.al.	2505.17008	translate	read	link
2025-05-22	OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning	Zongyan Han et.al.	2505.16974	translate	read	link
2025-05-22	NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification	NovelSeek Team et.al.	2505.16938	translate	read	link
2025-05-22	TextureSAM: Towards a Texture Aware Foundation Model for Segmentation	Inbal Cohen et.al.	2505.16540	translate	read	null
2025-05-22	Detailed Evaluation of Modern Machine Learning Approaches for Optic Plastics Sorting	Vaishali Maheshkar et.al.	2505.16513	translate	read	null
2025-05-22	Sketchy Bounding-box Supervision for 3D Instance Segmentation	Qian Deng et.al.	2505.16399	translate	read	null
2025-05-22	Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation	Estelle Chigot et.al.	2505.16360	translate	read	link
2025-05-22	RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition	Yechan Park et.al.	2505.16165	translate	read	link
2025-05-21	VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation	Niccolo Avogaro et.al.	2505.15592	translate	read	null
2025-05-21	UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset	Hua Li et.al.	2505.15581	translate	read	link
2025-05-21	seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation	Andrew Caunes et.al.	2505.15545	translate	read	link
2025-05-21	Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation	Ce Zhang et.al.	2505.15491	translate	read	null
2025-05-21	gen2seg: Generative Models Enable Generalizable Instance Segmentation	Om Khangaonkar et.al.	2505.15263	translate	read	link
2025-05-21	Zero-Shot Gaze-based Volumetric Medical Image Segmentation	Tatyana Shmykova et.al.	2505.15256	translate	read	null
2025-05-21	From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation	Quanwei Liu et.al.	2505.15147	translate	read	null
2025-05-20	Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning	Amine Elhafsi et.al.	2505.14938	translate	read	null
2025-05-20	Instance Segmentation for Point Sets	Abhimanyu Talwar et.al.	2505.14583	translate	read	null
2025-05-20	ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains	Guillaume Vray et.al.	2505.14511	translate	read	link
2025-05-20	Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation	Bin-Bin Gao et.al.	2505.14239	translate	read	link
2025-05-20	Intra-class Patch Swap for Self-Distillation	Hongjun Choi et.al.	2505.14124	translate	read	link
2025-05-20	Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts	Xi Chen et.al.	2505.14088	translate	read	null
2025-05-20	Scaling Vision Mamba Across Resolutions via Fractal Traversal	Bo Li et.al.	2505.14062	translate	read	null
2025-05-20	EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation	Zelin Zhang et.al.	2505.14014	translate	read	null
2025-05-19	Self-Supervised Learning for Image Segmentation: A Comprehensive Survey	Thangarajah Akilan et.al.	2505.13584	translate	read	null
2025-05-19	FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching	Alp Eren Sari et.al.	2505.13174	translate	read	null
2025-05-20	Industrial Synthetic Segment Pre-training	Shinichi Mae et.al.	2505.13099	translate	read	null
2025-05-19	Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation	Jiaqi Tan et.al.	2505.12861	translate	read	link
2025-05-19	Enhancing Transformers Through Conditioned Embedded Tokens	Hemanth Saratchandran et.al.	2505.12789	translate	read	null
2025-05-18	Temporal-Spectral-Spatial Unified Remote Sensing Dense Prediction	Sijie Zhao et.al.	2505.12280	translate	read	link
2025-05-17	SoftPQ: Robust Instance Segmentation Evaluation via Soft Matching and Tunable Thresholds	Ranit Karmakar et.al.	2505.12155	translate	read	link
2025-05-17	EarthSynth: Generating Informative Earth Observation with Diffusion Models	Jiancheng Pan et.al.	2505.12108	translate	read	null
2025-05-17	iSegMan: Interactive Segment-and-Manipulate 3D Gaussians	Yian Zhao et.al.	2505.11934	translate	read	null
2025-05-17	Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Boosting Off-Road Segmentation via Photometric Distortion and Exponential Moving Average	Wonjune Kim et.al.	2505.11769	translate	read	null
2025-05-16	DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation	Ziyu Zhao et.al.	2505.11676	translate	read	null
2025-05-16	SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision	Utsav Rai et.al.	2505.11439	translate	read	null
2025-05-16	Pseudo-Label Quality Decoupling and Correction for Semi-Supervised Instance Segmentation	Jianghang Lin et.al.	2505.11075	translate	read	null
2025-05-16	Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation	David Minkwan Kim et.al.	2505.10781	translate	read	null
2025-05-15	Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis	Francisco Raverta Capua et.al.	2505.10751	translate	read	null
2025-05-15	TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation	Manthan Patel et.al.	2505.10696	translate	read	link
2025-05-15	SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity	Shihao Zou et.al.	2505.10352	translate	read	null
2025-05-15	APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds	Yuan Gao et.al.	2505.09971	translate	read	link
2025-05-14	FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization	Xiaoyang Yu et.al.	2505.09385	translate	read	null
2025-05-14	MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning	Bin-Bin Gao et.al.	2505.09265	translate	read	link
2025-05-13	MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment	Barak Pinkovich et.al.	2505.08589	translate	read	null
2025-05-14	The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning	Mohamed Lamine Mekhalfi et.al.	2505.08537	translate	read	null
2025-05-13	Dynamic Snake Upsampling Operater and Boundary-Skeleton Weighted Loss for Tubular Structure Segmentation	Yiqi Chen et.al.	2505.08525	translate	read	null
2025-05-13	Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency	Adel Ammar et.al.	2505.08445	translate	read	null
2025-05-13	GNCAF: A GNN-based Neighboring Context Aggregation Framework for Tertiary Lymphoid Structures Semantic Segmentation in WSI	Lei Su et.al.	2505.08430	translate	read	null
2025-05-12	Vision Foundation Model Embedding-Based Semantic Anomaly Detection	Max Peter Ronecker et.al.	2505.07998	translate	read	null
2025-05-12	Privacy Risks of Robot Vision: A User Study on Image Modalities and Resolution	Xuying Huang et.al.	2505.07766	translate	read	null
2025-05-12	Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation	Negin Ghamsarian et.al.	2505.07691	translate	read	null
2025-05-12	MAIS: Memory-Attention for Interactive Segmentation	Mauricio Orbes-Arteaga et.al.	2505.07511	translate	read	null
2025-05-13	TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset	Olaf Wysocki et.al.	2505.07396	translate	read	null
2025-05-11	Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution	Zihang Liu et.al.	2505.07071	translate	read	link
2025-05-11	Depth-Sensitive Soft Suppression with RGB-D Inter-Modal Stylization Flow for Domain Generalization Semantic Segmentation	Binbin Wei et.al.	2505.07050	translate	read	null
2025-05-11	Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding	Chih-Chung Hsu et.al.	2505.06991	translate	read	null
2025-05-11	Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation	Seokjun Kwon et.al.	2505.06951	translate	read	null
2025-05-10	Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization	Xu Zheng et.al.	2505.06635	translate	read	null
2025-05-10	RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation	Zhiwen Zeng et.al.	2505.06515	translate	read	null
2025-05-09	Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet	Kodai Hirata et.al.	2505.06185	translate	read	null
2025-05-08	CottonSim: Development of an autonomous visual-guided robotic cotton-picking system in the Gazebo	Thevathayarajh Thayananthan et.al.	2505.05317	translate	read	null
2025-05-08	RepSNet: A Nucleus Instance Segmentation model based on Boundary Regression and Structural Re-parameterization	Shengchun Xiong et.al.	2505.05073	translate	read	null
2025-05-09	UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model	Timo Kaiser et.al.	2505.05049	translate	read	link
2025-05-08	Split Matching for Inductive Zero-shot Semantic Segmentation	Jialei Chen et.al.	2505.05023	translate	read	null
2025-05-08	Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model	Navin Ranjan et.al.	2505.04861	translate	read	null
2025-05-07	Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?	Shashank Agnihotri et.al.	2505.04835	translate	read	link
2025-05-07	Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer	Sainath Dey et.al.	2505.04740	translate	read	null
2025-05-07	DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception	Junjie Wang et.al.	2505.04410	translate	read	link
2025-05-07	MFSeg: Efficient Multi-frame 3D Semantic Segmentation	Chengjie Huang et.al.	2505.04408	translate	read	null
2025-05-06	Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach	Srecharan Selvam et.al.	2505.03702	translate	read	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679	translate	read	null
2025-05-06	Panoramic Out-of-Distribution Segmentation	Mengfei Duan et.al.	2505.03539	translate	read	link
2025-05-06	3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation	Andrew Caunes et.al.	2505.03300	translate	read	null
2025-05-05	Platelet enumeration in dense aggregates	H. Martin Gillis et.al.	2505.02751	translate	read	null
2025-05-04	Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation	Volodymyr Havrylov et.al.	2505.02075	translate	read	link
2025-05-04	Segment Any RGB-Thermal Model with Language-aided Distillation	Dong Xing et.al.	2505.01950	translate	read	null
2025-05-03	OODTE: A Differential Testing Engine for the ONNX Optimizer	Nikolaos Louloudakis et.al.	2505.01892	translate	read	null
2025-05-03	A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory	Chenyang Fan et.al.	2505.01656	translate	read	null
2025-05-02	A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative Learning	Anan Yaghmour et.al.	2505.01558	translate	read	null
2025-05-02	Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation	Zhen Yao et.al.	2505.01548	translate	read	link
2025-05-02	Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing	Fahong Zhang et.al.	2505.01385	translate	read	null
2025-05-02	GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation	Boris Kriuk et.al.	2505.01057	translate	read	null
2025-05-03	Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook	Muyi Bao et.al.	2505.00630	translate	read	null
2025-05-01	Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation	Feng Xue et.al.	2505.00378	translate	read	null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)