Semantic Segmentation - 2025-11

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-11-30	Stronger is not better: Better Augmentations in Contrastive Learning for Medical Image Segmentation	Azeez Idris et.al.	2512.05992	translate	read	null
2025-11-30	Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation	An Yang et.al.	2512.00944	translate	read	null
2025-11-30	The Outline of Deception: Physical Adversarial Attacks on Traffic Signs Using Edge Patches	Haojie Ji et.al.	2512.00765	translate	read	null
2025-11-30	VFM-ISRefiner: Towards Better Adapting Vision Foundation Models for Interactive Segmentation of Remote Sensing Images	Deliang Wang et.al.	2512.00718	translate	read	null
2025-11-29	Doppler-Enhanced Deep Learning: Improving Thyroid Nodule Segmentation with YOLOv5 Instance Segmentation	Mahmoud El Hussieni et.al.	2512.00639	translate	read	null
2025-11-29	EZ-SP: Fast and Lightweight Superpoint-Based 3D Segmentation	Louis Geist et.al.	2512.00385	translate	read	null
2025-11-29	Breaking It Down: Domain-Aware Semantic Segmentation for Retrieval Augmented Generation	Aparajitha Allamraju et.al.	2512.00367	translate	read	null
2025-11-29	Towards aligned body representations in vision models	Andrey Gizdov et.al.	2512.00365	translate	read	null
2025-11-24	Satellite to Street : Disaster Impact Estimator	Sreesritha Sai et.al.	2512.00065	translate	read	null
2025-11-28	Learning to Predict Aboveground Biomass from RGB Images with 3D Synthetic Scenes	Silvia Zuffi et.al.	2511.23249	translate	read	null
2025-11-28	Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM	Shouhe Zhang et.al.	2511.22968	translate	read	null
2025-11-28	Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation	Taeyeong Kim et.al.	2511.22948	translate	read	null
2025-11-27	GazeTrack: High-Precision Eye Tracking Based on Regularization and Spatial Computing	Xiaoyin Yang et.al.	2511.22607	translate	read	null
2025-11-27	3D Affordance Keypoint Detection for Robotic Manipulation	Zhiyang Liu et.al.	2511.22195	translate	read	null
2025-11-26	OpenTwinMap: An Open-Source Digital Twin Generator for Urban Autonomous Driving	Alex Richardson et.al.	2511.21925	translate	read	null
2025-11-26	ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images	M. Naseer Subhani et.al.	2511.21606	translate	read	null
2025-11-26	Shift-Equivariant Complex-Valued Convolutional Neural Networks	Quentin Gabot et.al.	2511.21250	translate	read	null
2025-11-25	Open Vocabulary Compositional Explanations for Neuron Alignment	Biagio La Rosa et.al.	2511.20931	translate	read	null
2025-11-25	Automated Monitoring of Cultural Heritage Artifacts Using Semantic Segmentation	Andrea Ranieri et.al.	2511.20541	translate	read	null
2025-11-25	CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation	Shilei Cao et.al.	2511.20302	translate	read	null
2025-11-25	SAM-MI: A Mask-Injected Framework for Enhancing Open-Vocabulary Semantic Segmentation with SAM	Lin Chen et.al.	2511.20027	translate	read	null
2025-11-25	Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting	Wen Zhang et.al.	2511.19953	translate	read	null
2025-11-24	Lightweight Transformer Framework for Weakly Supervised Semantic Segmentation	Ali Torabi et.al.	2511.19765	translate	read	null
2025-11-24	RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models	Omar Alama et.al.	2511.19704	translate	read	null
2025-11-24	Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration	Remi Petitpierre et.al.	2511.19538	translate	read	null
2025-11-24	BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation	Rachit Saluja et.al.	2511.19394	translate	read	null
2025-11-24	nnActive: A Framework for Evaluation of Active Learning in 3D Biomedical Segmentation	Carsten T. Lüth et.al.	2511.19183	translate	read	null
2025-11-24	DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection	Hai Ci et.al.	2511.19111	translate	read	null
2025-11-24	SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation	Nimeshika Udayangani et.al.	2511.18816	translate	read	null
2025-11-24	PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion	Yichen Yang et.al.	2511.18801	translate	read	null
2025-11-23	SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation	Peter Siegel et.al.	2511.18386	translate	read	null
2025-11-23	UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization	Siyi Li et.al.	2511.18254	translate	read	null
2025-11-22	Matching-Based Few-Shot Semantic Segmentation Models Are Interpretable by Design	Pasquale De Marinis et.al.	2511.18163	translate	read	null
2025-11-22	AdaPerceiver: Transformers with Adaptive Width, Depth, and Tokens	Purvish Jajal et.al.	2511.18105	translate	read	null
2025-11-18	HSMix: Hard and Soft Mixing Data Augmentation for Medical Image Segmentation	Danyang Sun et.al.	2511.17614	translate	read	null
2025-11-21	Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift	Björn Michele et.al.	2511.17455	translate	read	null
2025-11-21	REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing	Binger Chen et.al.	2511.17442	translate	read	null
2025-11-21	FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception	Shubham Sonarghare et.al.	2511.17210	translate	read	null
2025-11-20	Late-decoupled 3D Hierarchical Semantic Segmentation with Semantic Prototype Discrimination based Bi-branch Supervision	Shuyu Cao et.al.	2511.16650	translate	read	null
2025-11-20	Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling	Minseok Seo et.al.	2511.16301	translate	read	null
2025-11-20	Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective	Jiahao Li et.al.	2511.16170	translate	read	null
2025-11-20	InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation via Information-Theoretic Alignment Transfer	Muyao Yuan et.al.	2511.15967	translate	read	null
2025-11-19	Automatic Uncertainty-Aware Synthetic Data Bootstrapping for Historical Map Segmentation	Lukas Arzoumanidis et.al.	2511.15875	translate	read	null
2025-11-19	GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI	Naomi Simumba et.al.	2511.15658	translate	read	null
2025-11-19	Multi-Text Guided Few-Shot Semantic Segmentation	Qiang Jiao et.al.	2511.15515	translate	read	null
2025-11-19	WarNav: An Autonomous Driving Benchmark for Segmentation of Navigable Zones in War Scenes	Marc-Emmanuel Coupvent des Graviers et.al.	2511.15429	translate	read	null
2025-11-19	Controlling False Positives in Image Segmentation via Conformal Prediction	Luca Mossina et.al.	2511.15406	translate	read	null
2025-11-18	EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects	Gbenga Omotara et.al.	2511.14970	translate	read	null
2025-11-18	FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding	Zhenshi Li et.al.	2511.14901	translate	read	null
2025-11-18	Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation	Aditi Agarwal et.al.	2511.14481	translate	read	null
2025-11-18	Step by Step Network	Dongchen Han et.al.	2511.14329	translate	read	null
2025-11-18	Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution	N Dinesh Reddy et.al.	2511.14210	translate	read	null
2025-11-17	Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting	Jiangnan Ye et.al.	2511.13684	translate	read	null
2025-11-17	Mapping the Vanishing and Transformation of Urban Villages in China	Wenyu Zhang et.al.	2511.13507	translate	read	null
2025-11-17	Delineate Anything Flow: Fast, Country-Level Field Boundary Detection from Any Source	Mykola Lavreniuk et.al.	2511.13417	translate	read	null
2025-11-17	DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation	Yan Gong et.al.	2511.13047	translate	read	null
2025-11-15	FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention	Peng Zhang et.al.	2511.12215	translate	read	null
2025-11-15	Evaluation of Attention Mechanisms in U-Net Architectures for Semantic Segmentation of Brazilian Rock Art Petroglyphs	Leonardi Melo et.al.	2511.11959	translate	read	null
2025-11-14	Chain-of-Generation: Progressive Latent Diffusion for Text-Guided Molecular Design	Lingxiao Li et.al.	2511.11894	translate	read	null
2025-11-14	Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation	Camila Machado de Araujo et.al.	2511.11890	translate	read	null
2025-11-13	AdaptFly: Prompt-Guided Adaptation of Foundation Models for Low-Altitude UAV Networks	Jiao Chen et.al.	2511.11720	translate	read	null
2025-11-12	Enhancing Reinforcement Learning in 3D Environments through Semantic Segmentation: A Case Study in ViZDoom	Hugo Huang et.al.	2511.11703	translate	read	null
2025-11-12	EPSegFZ: Efficient Point Cloud Semantic Segmentation for Few- and Zero-Shot Scenarios with Language Guidance	Jiahui Wang et.al.	2511.11700	translate	read	null
2025-11-14	Terrain Costmap Generation via Scaled Preference Conditioning	Luisa Mao et.al.	2511.11529	translate	read	null
2025-11-13	Histology-informed tiling of whole tissue sections improves the interpretability and predictability of cancer relapse and genetic alterations	Willem Bonnaffé et.al.	2511.10432	translate	read	null
2025-11-13	Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators	Maximiliane Gruber et.al.	2511.10424	translate	read	null
2025-11-13	DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Semantic Instance Segmentation	Xuexun Liu et.al.	2511.10003	translate	read	null
2025-11-12	Soiling detection for Advanced Driver Assistance Systems	Filip Beránek et.al.	2511.09740	translate	read	null
2025-11-12	OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS	Haiyi Li et.al.	2511.09397	translate	read	null
2025-11-11	Empowering DINO Representations for Underwater Instance Segmentation via Aligner and Prompter	Zhiyang Chen et.al.	2511.08334	translate	read	null
2025-11-11	Re-coding for Uncertainties: Edge-awareness Semantic Concordance for Resilient Event-RGB Segmentation	Nan Bao et.al.	2511.08269	translate	read	null
2025-11-11	NERVE: Neighbourhood & Entropy-guided Random-walk for training free open-Vocabulary sEgmentation	Kunal Mahatha et.al.	2511.08248	translate	read	null
2025-11-10	FlowFeat: Pixel-Dense Embedding of Motion Profiles	Nikita Araslanov et.al.	2511.07696	translate	read	null
2025-11-10	Glioma C6: A Novel Dataset for Training and Benchmarking Cell Segmentation	Roman Malashin et.al.	2511.07286	translate	read	null
2025-11-10	StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and Compression	Yilong Chen et.al.	2511.07278	translate	read	null
2025-11-10	HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving	Zhongyu Xia et.al.	2511.07106	translate	read	null
2025-11-10	Metric Analysis for Spatial Semantic Segmentation of Sound Scenes	Mayank Mishra et.al.	2511.07075	translate	read	null
2025-11-10	TrueCity: Real and Simulated Urban Data for Cross-Domain 3D Scene Understanding	Duc Nguyen et.al.	2511.07007	translate	read	null
2025-11-10	Exploring the “Great Unseen” in Medieval Manuscripts: Instance-Level Labeling of Legacy Image Collections with Zero-Shot Models	Christofer Meinecke et.al.	2511.07004	translate	read	null
2025-11-10	Vision-Aided Online A* Path Planning for Efficient and Safe Navigation of Service Robots	Praveen Kumar et.al.	2511.06801	translate	read	null
2025-11-09	Video Dataset for Surgical Phase, Keypoint, and Instrument Recognition in Laparoscopic Surgery (PhaKIR)	Tobias Rueckert et.al.	2511.06549	translate	read	null
2025-11-09	EIDSeg: A Pixel-Level Semantic Segmentation Dataset for Post-Earthquake Damage Assessment from Social Media Images	Huili Huang et.al.	2511.06456	translate	read	null
2025-11-09	Label-Efficient 3D Forest Mapping: Self-Supervised and Transfer Learning for Individual, Structural, and Species Analysis	Aldino Rizaldy et.al.	2511.06331	translate	read	null
2025-11-09	Temporal-Guided Visual Foundation Models for Event-Based Vision	Ruihao Xia et.al.	2511.06238	translate	read	link
2025-11-08	Polymap: generating high definition map based on rasterized polygons	Shiyu Gao et.al.	2511.05944	translate	read	null
2025-11-07	CoT-X: An Adaptive Framework for Cross-Model Chain-of-Thought Transfer and Optimization	Ziqian Bi et.al.	2511.05747	translate	read	null
2025-11-04	Do Street View Imagery and Public Participation GIS align: Comparative Analysis of Urban Attractiveness	Milad Malekzadeh et.al.	2511.05570	translate	read	null
2025-11-03	Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation	Jiayuan Wang et.al.	2511.05557	translate	read	null
2025-11-07	How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?	Tuan Anh Tran et.al.	2511.05449	translate	read	null
2025-11-07	Canonical Space Representation for 4D Panoptic Segmentation of Articulated Objects	Manuel Gomes et.al.	2511.05356	translate	read	null
2025-11-07	No Pose Estimation? No Problem: Pose-Agnostic and Instance-Aware Test-Time Adaptation for Monocular Depth Estimation	Mingyu Sung et.al.	2511.05055	translate	read	null
2025-11-07	LG-NuSegHop: A Local-to-Global Self-Supervised Pipeline For Nuclei Instance Segmentation	Vasileios Magoulianitis et.al.	2511.04892	translate	read	null
2025-11-06	An Active Learning Pipeline for Biomedical Image Instance Segmentation with Minimal Human Intervention	Shuo Zhao et.al.	2511.04811	translate	read	null
2025-11-06	Cambrian-S: Towards Spatial Supersensing in Video	Shusheng Yang et.al.	2511.04670	translate	read	null
2025-11-06	Vitessce Link: A Mixed Reality and 2D Display Hybrid Approach for Visual Analysis of 3D Tissue Maps	Eric Mörth et.al.	2511.04262	translate	read	null
2025-11-06	CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation	Yuwen Tao et.al.	2511.03992	translate	read	null
2025-11-05	Laugh, Relate, Engage: Stylized Comment Generation for Short Videos	Xuan Ouyang et.al.	2511.03757	translate	read	null
2025-11-05	Computational Imaging Meets LLMs: Zero-Shot IDH Mutation Prediction in Brain Gliomas	Syed Muqeem Mahmood et.al.	2511.03376	translate	read	null
2025-11-05	Enhancing Medical Image Segmentation via Heat Conduction Equation	Rong Wu et.al.	2511.03260	translate	read	null
2025-11-05	Diffusion-Guided Mask-Consistent Paired Mixing for Endoscopic Image Segmentation	Pengyu Jie et.al.	2511.03219	translate	read	null
2025-11-05	Subsampled Randomized Fourier GaLore for Adapting Foundation Models in Depth-Driven Liver Landmark Segmentation	Yun-Chen Lin et.al.	2511.03163	translate	read	null
2025-11-05	Accelerating Physical Property Reasoning for Augmented Visual Cognition	Hongbo Lan et.al.	2511.03126	translate	read	null
2025-11-04	Learning with less: label-efficient land cover classification at very high spatial resolution using self-supervised deep learning	Dakota Hester et.al.	2511.03004	translate	read	null
2025-11-04	Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data	Syed Mostaquim Ali et.al.	2511.02994	translate	read	null
2025-11-04	Digital Twin-Driven Pavement Health Monitoring and Maintenance Optimization Using Graph Neural Networks	Mohsin Mahmud Topu et.al.	2511.02957	translate	read	null
2025-11-04	Optimizing the nnU-Net model for brain tumor (Glioma) segmentation Using a BraTS Sub-Saharan Africa (SSA) dataset	Chukwuemeka Arua Kalu et.al.	2511.02893	translate	read	null
2025-11-02	Digitizing Spermatogenesis Lineage at Nanoscale Resolution In Tissue-Level Electron Microscopy	Li Xiao et.al.	2511.02860	translate	read	null
2025-11-04	Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks	Dmitrii Pozdeev et.al.	2511.02830	translate	read	null
2025-11-04	PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing	Antonio Oroz et.al.	2511.02777	translate	read	null
2025-11-04	Resource-efficient Automatic Refinement of Segmentations via Weak Supervision from Light Feedback	Alix de Langlais et.al.	2511.02576	translate	read	null
2025-11-04	ESA: Energy-Based Shot Assembly Optimization for Automatic Video Editing	Yaosen Chen et.al.	2511.02505	translate	read	null
2025-11-04	Synthetic Crop-Weed Image Generation and its Impact on Model Generalization	Garen Boyadjian et.al.	2511.02417	translate	read	null
2025-11-04	Revisiting put-that-there, context aware window interactions via LLMs	Riccardo Bovo et.al.	2511.02378	translate	read	null
2025-11-04	From Instance Segmentation to 3D Growth Trajectory Reconstruction in Planktonic Foraminifera	Huahua Lin et.al.	2511.02142	translate	read	null
2025-11-03	Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation	Seongkyu Choi et.al.	2511.01434	translate	read	null
2025-11-03	MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement	Jierui Qu et.al.	2511.01345	translate	read	null
2025-11-03	Source-Only Cross-Weather LiDAR via Geometry-Aware Point Drop	YoungJae Cheong et.al.	2511.01250	translate	read	null
2025-11-03	CenterMamba-SAM: Center-Prioritized Scanning and Temporal Prototypes for Brain Lesion Segmentation	Yu Tian et.al.	2511.01243	translate	read	null
2025-11-03	An Enhanced Proprioceptive Method for Soft Robots Integrating Bend Sensors and IMUs	Dong Heon Han et.al.	2511.01165	translate	read	null
2025-11-03	MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation	Ziyi Wang et.al.	2511.01143	translate	read	null
2025-11-02	URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model	Zhe Li et.al.	2511.00940	translate	read	null
2025-11-02	TA-LSDiff:Topology-Aware Diffusion Guided by a Level Set Energy for Pancreas Segmentation	Yue Gou et.al.	2511.00815	translate	read	null
2025-11-02	Rhythm in the Air: Vision-based Real-Time Music Generation through Gestures	Barathi Subramanian et.al.	2511.00793	translate	read	null
2025-11-02	Class-agnostic 3D Segmentation by Granularity-Consistent Automatic 2D Mask Tracking	Juan Wang et.al.	2511.00785	translate	read	null
2025-11-01	Grounding Surgical Action Triplets with Instrument Instance Segmentation: A Dataset and Target-Aware Fusion Approach	Oluwatosin Alabi et.al.	2511.00643	translate	read	null
2025-11-01	Text-guided Fine-Grained Video Anomaly Detection	Jihao Gu et.al.	2511.00524	translate	read	null
2025-11-01	Optimization of continuous-flow over traffic networks with fundamental diagram constraints	Anqi Dong et.al.	2511.00500	translate	read	null
2025-11-01	HumanCrafter: Synergizing Generalizable Human Reconstruction and Semantic 3D Segmentation	Panwang Pan et.al.	2511.00468	translate	read	null
2025-11-01	Tree Training: Accelerating Agentic LLMs Training via Shared Prefix Reuse	Shaojie Wang et.al.	2511.00413	translate	read	null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)