Semantic Segmentation - 2025-07

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-07-25	Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing	Haichuan Li et.al.	2507.19691	translate	read	null
2025-07-25	SurgPIS: Surgical-instrument-level Instances and Part-level Semantics for Weakly-supervised Part-aware Instance Segmentation	Meng Wei et.al.	2507.19592	translate	read	null
2025-07-24	HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation	Xinyu Wang et.al.	2507.18575	translate	read	null
2025-07-24	Synthetic Data Augmentation for Enhanced Chicken Carcass Instance Segmentation	Yihong Feng et.al.	2507.18558	translate	read	null
2025-07-24	Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows	Simin Huo et.al.	2507.18405	translate	read	link
2025-07-24	GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences	Gabriel Jarry et.al.	2507.18330	translate	read	null
2025-07-24	SemiSegECG: A Multi-Dataset Benchmark for Semi-Supervised Semantic Segmentation in ECG Delineation	Minje Park et.al.	2507.18323	translate	read	link
2025-07-24	Unsupervised Domain Adaptation for 3D LiDAR Semantic Segmentation Using Contrastive Learning and Multi-Model Pseudo Labeling	Abhishek Kaushik et.al.	2507.18176	translate	read	null
2025-07-23	AFRDA: Attentive Feature Refinement for Domain Adaptive Semantic Segmentation	Md. Al-Masrur Khan et.al.	2507.17957	translate	read	link
2025-07-23	Exploring Spatial Diversity for Region-based Active Learning	Lile Cai et.al.	2507.17367	translate	read	null
2025-07-23	Exploring Active Learning for Semiconductor Defect Segmentation	Lile Cai et.al.	2507.17359	translate	read	null
2025-07-23	Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation	Haotian Chen et.al.	2507.17347	translate	read	null
2025-07-23	On Temporal Guidance and Iterative Refinement in Audio Source Separation	Tobias Morocutti et.al.	2507.17297	translate	read	null
2025-07-23	ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation	Bo Fang et.al.	2507.17149	translate	read	null
2025-07-22	MultiTaskDeltaNet: Change Detection-based Image Segmentation for Operando ETEM with Application to Carbon Gasification Kinetics	Yushuo Niu et.al.	2507.16803	translate	read	null
2025-07-22	A2Mamba: Attention-augmented State Space Models for Visual Recognition	Meng Lou et.al.	2507.16624	translate	read	link
2025-07-22	Semantic Segmentation for Preoperative Planning in Transcatheter Aortic Valve Replacement	Cedric Zöllner et.al.	2507.16573	translate	read	null
2025-07-22	Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge	Tobias Rueckert et.al.	2507.16559	translate	read	null
2025-07-23	EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion	Shang Liu et.al.	2507.16535	translate	read	null
2025-07-22	Advancing Visual Large Language Model for Multi-granular Versatile Perception	Wentao Xiang et.al.	2507.16213	translate	read	null
2025-07-22	AMMNet: An Asymmetric Multi-Modal Network for Remote Sensing Semantic Segmentation	Hui Ye et.al.	2507.16158	translate	read	null
2025-07-21	Improved Semantic Segmentation from Ultra-Low-Resolution RGB Images Applied to Privacy-Preserving Object-Goal Navigation	Xuying Huang et.al.	2507.16034	translate	read	null
2025-07-21	ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction	Danhui Chen et.al.	2507.15803	translate	read	null
2025-07-21	ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting	Ruijie Zhu et.al.	2507.15454	translate	read	link
2025-07-21	Rethinking Occlusion in FER: A Semantic-Aware Perspective and Go Beyond	Huiyu Zhai et.al.	2507.15401	translate	read	null
2025-07-20	Towards Geometric and Textural Consistency 3D Scene Generation via Single Image-guided Model Generation and Layout Optimization	Xiang Tang et.al.	2507.14841	translate	read	null
2025-07-20	A Novel Downsampling Strategy Based on Information Complementarity for Medical Image Segmentation	Wenbo Yue et.al.	2507.14790	translate	read	null
2025-07-19	GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset	Zhiwei Zhang et.al.	2507.14697	translate	read	null
2025-07-19	Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall	Shayan Rokhva et.al.	2507.14662	translate	read	null
2025-07-19	Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection	Jifeng Shen et.al.	2507.14643	translate	read	null
2025-07-19	DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF	Doriand Petit et.al.	2507.14596	translate	read	null
2025-07-18	Semantic Segmentation based Scene Understanding in Autonomous Vehicles	Ehsan Rassekh et.al.	2507.14303	translate	read	null
2025-07-18	Leveraging Pathology Foundation Models for Panoptic Segmentation of Melanoma in H&E Images	Jiaqi Lv et.al.	2507.13974	translate	read	null
2025-07-17	SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation	Shiqi Huang et.al.	2507.12857	translate	read	null
2025-07-17	A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique	Homare Sueyoshi et.al.	2507.12730	translate	read	null
2025-07-16	VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians	Siyuan Yao et.al.	2507.12667	translate	read	null
2025-07-16	NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting	Kuangshi Ai et.al.	2507.12621	translate	read	null
2025-07-16	Out-of-distribution data supervision towards biomedical semantic segmentation	Yiquan Gao et.al.	2507.12105	translate	read	null
2025-07-16	Tree-SLAM: semantic object SLAM for efficient mapping of individual trees in orchards	David Rapado-Rincon et.al.	2507.12093	translate	read	null
2025-07-16	Frequency-Dynamic Attention Modulation for Dense Prediction	Linwei Chen et.al.	2507.12006	translate	read	null
2025-07-16	SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation	Jun Yin et.al.	2507.11994	translate	read	null
2025-07-16	Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation	Yuhang Zhang et.al.	2507.11955	translate	read	null
2025-07-16	Spatial Frequency Modulation for Semantic Segmentation	Linwei Chen et.al.	2507.11893	translate	read	link
2025-07-15	SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics	Suyuan Zhao et.al.	2507.11588	translate	read	null
2025-07-15	Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping	Yujie Zhang et.al.	2507.11279	translate	read	null
2025-07-15	Personalized OVSS: Understanding Personal Concept in Open-Vocabulary Semantic Segmentation	Sunghyun Park et.al.	2507.11030	translate	read	null
2025-07-15	Graph Aggregation Prototype Learning for Semantic Change Detection in Remote Sensing	Zhengyi Xu et.al.	2507.10938	translate	read	null
2025-07-14	Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision	Justin M. Kasowski et.al.	2507.10813	translate	read	null
2025-07-14	rt-RISeg: Real-Time Model-Free Robot Interactive Segmentation for Active Instance-Level Object Understanding	Howard H. Qian et.al.	2507.10776	translate	read	null
2025-07-14	FGSSNet: Feature-Guided Semantic Segmentation of Real World Floorplans	Hugo Norrby et.al.	2507.10343	translate	read	null
2025-07-14	Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks	Ben Hamscher et.al.	2507.10239	translate	read	null
2025-07-14	Spatial Lifting for Dense Prediction	Mingzhi Xu et.al.	2507.10222	translate	read	null
2025-07-14	DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation	Ivan Martinović et.al.	2507.10118	translate	read	null
2025-07-13	MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression	Ofir Gordon et.al.	2507.09616	translate	read	null
2025-07-13	Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive	You Huang et.al.	2507.09612	translate	read	null
2025-07-13	SegVec3D: A Method for Vector Embedding of 3D Objects Oriented Towards Robot manipulation	Zhihan Kang et.al.	2507.09459	translate	read	null
2025-07-11	Multimodal HD Mapping for Intersections by Intelligent Roadside Units	Zhongzhang Chen et.al.	2507.08903	translate	read	null
2025-07-11	Image Translation with Kernel Prediction Networks for Semantic Segmentation	Cristina Mata et.al.	2507.08554	translate	read	null
2025-07-11	From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning	Sen Wang et.al.	2507.08380	translate	read	null
2025-07-11	SurfDist: Interpretable Three-Dimensional Instance Segmentation Using Curved Surface Patches	Jackson Borchardt et.al.	2507.08223	translate	read	null
2025-07-10	RAPS-3D: Efficient interactive segmentation for 3D radiological imaging	Théo Danielou et.al.	2507.07730	translate	read	null
2025-07-10	LOSC: LiDAR Open-voc Segmentation Consolidator	Nermin Samet et.al.	2507.07605	translate	read	null
2025-07-10	Diffusion-Guided Knowledge Distillation for Weakly-Supervised Low-Light Semantic Segmentation	Chunyan Wang et.al.	2507.07578	translate	read	null
2025-07-10	Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections	Yongtang Bao et.al.	2507.07395	translate	read	null
2025-07-08	CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings	Cristina Mata et.al.	2507.07125	translate	read	null
2025-07-09	A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level	Johanna Orsholm et.al.	2507.06972	translate	read	null
2025-07-09	SemRaFiner: Panoptic Segmentation in Sparse and Noisy Radar Point Clouds	Matthias Zeller et.al.	2507.06906	translate	read	null
2025-07-09	Know Your Attention Maps: Class-specific Token Masking for Weakly Supervised Semantic Segmentation	Joelle Hanna et.al.	2507.06848	translate	read	null
2025-07-09	Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning	Yang Chen et.al.	2507.06592	translate	read	null
2025-07-08	Centralized Copy-Paste: Enhanced Data Augmentation Strategy for Wildland Fire Semantic Segmentation	Joon Tai Kim et.al.	2507.06321	translate	read	null
2025-07-08	FineGrasp: Towards Robust Grasping for Delicate Objects	Yun Du et.al.	2507.05978	translate	read	null
2025-07-08	Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation	Quanzhu Niu et.al.	2507.05948	translate	read	link
2025-07-08	I $^2$ R: Inter and Intra-image Refinement in Few Shot Segmentation	Ourui Fu et.al.	2507.05838	translate	read	null
2025-07-09	Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework	Wang Wang et.al.	2507.05814	translate	read	null
2025-07-08	SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning	Xin Hu et.al.	2507.05798	translate	read	null
2025-07-08	DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation	Young Hun Kim et.al.	2507.05627	translate	read	null
2025-07-07	OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts	Shiting Xiao et.al.	2507.05427	translate	read	null
2025-07-07	Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations	Xiang Xu et.al.	2507.05260	translate	read	null
2025-07-07	All in One: Visual-Description-Guided Unified Point Cloud Segmentation	Zongyan Han et.al.	2507.05211	translate	read	null
2025-07-07	RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis	Songxiao Yang et.al.	2507.05193	translate	read	null
2025-07-07	MOSU: Autonomous Long-range Robot Navigation with Multi-modal Scene Understanding	Jing Liang et.al.	2507.04686	translate	read	null
2025-07-06	Street design and driving behavior: evidence from a large-scale study in Milan, Amsterdam, and Dubai	Giacomo Orsi et.al.	2507.04434	translate	read	null
2025-07-06	CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning	Fatmaelzahraa Ali Ahmed et.al.	2507.04317	translate	read	null
2025-07-06	Surg-SegFormer: A Dual Transformer-Based Model for Holistic Surgical Scene Segmentation	Fatimaelzahraa Ahmed et.al.	2507.04304	translate	read	null
2025-07-05	Differentiable High-Performance Ray Tracing-Based Simulation of Radio Propagation with Point Clouds	Niklas Vaara et.al.	2507.04021	translate	read	null
2025-07-05	NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models	Siyu Li et.al.	2507.04002	translate	read	null
2025-07-05	CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes via Chain-of-Thought Reasoning	Jeonghyo Song et.al.	2507.03984	translate	read	null
2025-07-03	LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion	Fangfu Liu et.al.	2507.02813	translate	read	link
2025-07-03	No time to train! Training-Free Reference-Based Instance Segmentation	Miguel Espinosa et.al.	2507.02798	translate	read	link
2025-07-03	From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images	Danrong Zhang et.al.	2507.02781	translate	read	null
2025-07-03	MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention	Zunhui Xia et.al.	2507.02488	translate	read	null
2025-07-03	Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis	Byung Hyun Lee et.al.	2507.02395	translate	read	null
2025-07-03	Perception Activator: An intuitive and portable framework for brain cognitive exploration	Le Xu et.al.	2507.02311	translate	read	null
2025-07-02	How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks	Rahul Ramachandran et.al.	2507.01955	translate	read	link
2025-07-02	3D Reconstruction and Information Fusion between Dormant and Canopy Seasons in Commercial Orchards Using Deep Learning and Fast GICP	Ranjan Sapkota et.al.	2507.01912	translate	read	null
2025-07-02	A Gift from the Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation	Hao Wang et.al.	2507.01573	translate	read	null
2025-07-02	NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation	Max Gandyra et.al.	2507.01463	translate	read	null
2025-07-01	Towards Open-World Human Action Segmentation Using Graph Convolutional Networks	Hao Xing et.al.	2507.00756	translate	read	null
2025-07-01	Rectifying Magnitude Neglect in Linear Attention	Qihang Fan et.al.	2507.00698	translate	read	link
2025-07-02	ExPaMoE: An Expandable Parallel Mixture of Experts for Continual Test-Time Adaptation	JianChao Zhao et.al.	2507.00502	translate	read	null
2025-07-01	Process-aware and high-fidelity microstructure generation using stable diffusion	Hoang Cuong Phan et.al.	2507.00459	translate	read	null
2025-07-01	PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching	Xin Yang et.al.	2507.00371	translate	read	null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)