Semantic Segmentation - 2026-03
Semantic Segmentation - 2026-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-03-31 | Polyhedral Unmixing: Bridging Semantic Segmentation with Hyperspectral Unmixing via Polyhedral-Cone Partitioning | Antoine Bottenmuller et.al. | 2603.29438 | translate | read | null |
| 2026-03-31 | ConInfer: Context-Aware Inference for Training-Free Open-Vocabulary Remote Sensing Segmentation | Wenyang Chen et.al. | 2603.29271 | translate | read | null |
| 2026-03-30 | Detection of Adversarial Attacks in Robotic Perception | Ziad Sharawy et.al. | 2603.28594 | translate | read | null |
| 2026-03-30 | Unified Restoration-Perception Learning: Maritime Infrared-Visible Image Fusion and Segmentation | Weichao Cai et.al. | 2603.28414 | translate | read | null |
| 2026-03-30 | DinoDental: Benchmarking DINOv3 as a Unified Vision Encoder for Dental Image Analysis | Kun Tang et.al. | 2603.28297 | translate | read | null |
| 2026-03-30 | RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Semantic Segmentation | Chanseul Cho et.al. | 2603.28142 | translate | read | null |
| 2026-03-30 | Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models | Luigi Curini et.al. | 2603.28103 | translate | read | null |
| 2026-03-30 | Adapting SAM to Nuclei Instance Segmentation and Classification via Cooperative Fine-Grained Refinement | Jingze Su et.al. | 2603.28027 | translate | read | null |
| 2026-03-30 | SegRGB-X: General RGB-X Semantic Segmentation Model | Jiong Liu et.al. | 2603.28023 | translate | read | null |
| 2026-03-30 | Progressive Prompt-Guided Cross-Modal Reasoning for Referring Image Segmentation | Jiachen Li et.al. | 2603.27993 | translate | read | null |
| 2026-03-30 | A Cross-Scale Decoder with Token Refinement for Off-Road Semantic Segmentation | Seongkyu Choi Jhonghyun An et.al. | 2603.27931 | translate | read | null |
| 2026-03-30 | ForestSim: A Synthetic Benchmark for Intelligent Vehicle Perception in Unstructured Forest Environments | Pragat Wagle et.al. | 2603.27923 | translate | read | null |
| 2026-03-25 | Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions | Shiqin Wang et.al. | 2603.24322 | translate | read | null |
| 2026-03-25 | RS-SSM: Refining Forgotten Specifics in State Space Model for Video Semantic Segmentation | Kai Zhu et.al. | 2603.24295 | translate | read | null |
| 2026-03-25 | InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment | Zixin Guo et.al. | 2603.24240 | translate | read | null |
| 2026-03-24 | Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation | ByeongCheol Lee et.al. | 2603.23030 | translate | read | null |
| 2026-03-23 | Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion | Abu Noman Md Sakib et.al. | 2603.22624 | translate | read | null |
| 2026-03-23 | CanViT: Toward Active-Vision Foundation Models | Yohaï-Eliel Berreby et.al. | 2603.22570 | translate | read | null |
| 2026-03-23 | UrbanVGGT: Scalable Sidewalk Width Estimation from Street View Images | Kaizhen Tan et.al. | 2603.22531 | translate | read | null |
| 2026-03-23 | Spatially-Aware Evaluation Framework for Aerial LiDAR Point Cloud Semantic Segmentation: Distance-Based Metrics on Challenging Regions | Alex Salvatierra et.al. | 2603.22420 | translate | read | null |
| 2026-03-23 | Riverine Land Cover Mapping through Semantic Segmentation of Multispectral Point Clouds | Sopitta Thurachen et.al. | 2603.22230 | translate | read | null |
| 2026-03-23 | Benchmarking Deep Learning Models for Aerial LiDAR Point Cloud Semantic Segmentation under Real Acquisition Conditions: A Case Study in Navarre | Alex Salvatierra et.al. | 2603.22229 | translate | read | null |
| 2026-03-23 | Look, Listen and Segment: Towards Weakly Supervised Audio-visual Semantic Segmentation | Chengzhi Li et.al. | 2603.21948 | translate | read | null |
| 2026-03-23 | CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation | Mohammad Eslami et.al. | 2603.21566 | translate | read | null |
| 2026-03-23 | PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation | Gensheng Pei et.al. | 2603.21528 | translate | read | null |
| 2026-03-22 | Mitigating Objectness Bias and Region-to-Text Misalignment for Open-Vocabulary Panoptic Segmentation | Nikolay Kormushev et.al. | 2603.21386 | translate | read | null |
| 2026-03-22 | Boundary-Aware Instance Segmentation in Microscopy Imaging | Thomas Mendelson et.al. | 2603.21206 | translate | read | null |
| 2026-03-22 | LiFR-Seg: Anytime High-Frame-Rate Segmentation via Event-Guided Propagation | Xiaoshan Wu et.al. | 2603.21115 | translate | read | null |
| 2026-03-22 | CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels | Ping Guo et.al. | 2603.21071 | translate | read | null |
| 2026-03-21 | Elite Lanes: Evolutionary Generation of Realistic Small-Scale Road Networks | Artur Morys-Magiera et.al. | 2603.20964 | translate | read | null |
| 2026-03-21 | Lean Learning Beyond Clouds: Efficient Discrepancy-Conditioned Optical-SAR Fusion for Semantic Segmentation | Chenxing Meng et.al. | 2603.20811 | translate | read | null |
| 2026-03-21 | OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation | Aarush Aggarwal et.al. | 2603.20777 | translate | read | null |
| 2026-03-20 | An Open Source Computer Vision and Machine Learning Framework for Affordable Life Science Robotic Automation | Zachary Logan et.al. | 2603.20465 | translate | read | null |
| 2026-03-20 | MFil-Mamba: Multi-Filter Scanning for Spatial Redundancy-Aware Visual State Space Models | Puskal Khadka et.al. | 2603.20074 | translate | read | null |
| 2026-03-20 | SegVGGT: Joint 3D Reconstruction and Instance Segmentation from Multi-View Images | Jinyuan Qu et.al. | 2603.19926 | translate | read | null |
| 2026-03-20 | PanORama: Multiview Consistent Panoptic Segmentation in Operating Rooms | Tuna Gürbüz et.al. | 2603.19920 | translate | read | null |
| 2026-03-20 | Evaluating Vision Foundation Models for Pixel and Object Classification in Microscopy | Carolin Teuber et.al. | 2603.19802 | translate | read | null |
| 2026-03-20 | Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation | Yifei Zhao et.al. | 2603.19757 | translate | read | null |
| 2026-03-20 | LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment | Shuaibang Peng et.al. | 2603.19609 | translate | read | null |
| 2026-03-20 | MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation | Kaixin Cai et.al. | 2603.19575 | translate | read | null |
| 2026-03-19 | dinov3.seg: Open-Vocabulary Semantic Segmentation with DINOv3 | Saikat Dutta et.al. | 2603.19531 | translate | read | null |
| 2026-03-19 | DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding | Dong Zhuo et.al. | 2603.19219 | translate | read | null |
| 2026-03-19 | Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting | Yiren Lu et.al. | 2603.19193 | translate | read | null |
| 2026-03-19 | Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation | Yuchen Li et.al. | 2603.18795 | translate | read | null |
| 2026-03-19 | EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation | Longfei Liu et.al. | 2603.18739 | translate | read | null |
| 2026-03-19 | Towards High-Quality Image Segmentation: Improving Topology Accuracy by Penalizing Neighbor Pixels | Juan Miguel Valverde et.al. | 2603.18671 | translate | read | null |
| 2026-03-19 | R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation | Huy Che et.al. | 2603.18427 | translate | read | null |
| 2026-03-18 | Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting | Guillem Casadesus Vila et.al. | 2603.18218 | translate | read | null |
| 2026-03-18 | SegFly: A 2D-3D-2D Paradigm for Aerial RGB-Thermal Semantic Segmentation at Scale | Markus Gross et.al. | 2603.17920 | translate | read | null |
| 2026-03-18 | Revisiting foundation models for cell instance segmentation | Anwai Archit et.al. | 2603.17845 | translate | read | null |
| 2026-03-18 | Parameter-Efficient Modality-Balanced Symmetric Fusion for Multimodal Remote Sensing Semantic Segmentation | Haocheng Li et.al. | 2603.17705 | translate | read | null |
| 2026-03-18 | AdaMuS: Adaptive Multi-view Sparsity Learning for Dimensionally Unbalanced Data | Cai Xu et.al. | 2603.17610 | translate | read | null |
| 2026-03-18 | Learning Coordinate-based Convolutional Kernels for Continuous SE(3) Equivariant and Efficient Point Cloud Analysis | Jaein Kim et.al. | 2603.17538 | translate | read | null |
| 2026-03-18 | SafeLand: Safe Autonomous Landing in Unknown Environments with Bayesian Semantic Mapping | Markus Gross et.al. | 2603.17430 | translate | read | null |
| 2026-03-18 | Full Stack Navigation, Mapping, and Planning for the Lunar Autonomy Challenge | Adam Dai et.al. | 2603.17232 | translate | read | null |
| 2026-03-17 | DesertFormer: Transformer-Based Semantic Segmentation for Off-Road Desert Terrain Classification in Autonomous Navigation Systems | Yasaswini Chebolu et.al. | 2603.17056 | translate | read | null |
| 2026-03-17 | vAccSOL: Efficient and Transparent AI Vision Offloading for Mobile Robots | Adam Zahir et.al. | 2603.16685 | translate | read | null |
| 2026-03-17 | TCATSeg: A Tooth Center-Wise Attention Network for 3D Dental Model Semantic Segmentation | Qiang He et.al. | 2603.16620 | translate | read | null |
| 2026-03-17 | Segmentation-Based Attention Entropy: Detecting and Mitigating Object Hallucinations in Large Vision-Language Models | Jiale Song et.al. | 2603.16558 | translate | read | null |
| 2026-03-17 | SF-Mamba: Rethinking State Space Model for Vision | Masakazu Yoshimura et.al. | 2603.16423 | translate | read | null |
| 2026-03-17 | Poisoning the Pixels: Revisiting Backdoor Attacks on Semantic Segmentation | Guangsheng Zhang et.al. | 2603.16405 | translate | read | null |
| 2026-03-17 | Exclusivity-Guided Mask Learning for Semi-Supervised Crowd Instance Segmentation and Counting | Jiyang Huang et.al. | 2603.16241 | translate | read | null |
| 2026-03-17 | Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery | Jecia Z. Y. Mao et.al. | 2603.16024 | translate | read | null |
| 2026-03-16 | Self-Distillation of Hidden Layers for Self-Supervised Representation Learning | Scott C. Lowe et.al. | 2603.15553 | translate | read | null |
| 2026-03-16 | Seeing Beyond: Extrapolative Domain Adaptive Panoramic Segmentation | Yuanfan Zheng et.al. | 2603.15475 | translate | read | null |
| 2026-03-16 | A Tutorial on ALOS2 SAR Utilization: Dataset Preparation, Self-Supervised Pretraining, and Semantic Segmentation | Nevrez Imamoglu et.al. | 2603.15119 | translate | read | null |
| 2026-03-15 | Seeing Where to Deploy: Metric RGB-Based Traversability Analysis for Aerial-to-Ground Hidden Space Inspection | Seoyoung Lee et.al. | 2603.14639 | translate | read | null |
| 2026-03-15 | In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels | Tomislav Medic et.al. | 2603.14309 | translate | read | null |
| 2026-03-12 | CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation | Ziqi Ye et.al. | 2603.12008 | translate | read | null |
| 2026-03-12 | ActiveFreq: Integrating Active Learning and Frequency Domain Analysis for Interactive Segmentation | Lijun Guo et.al. | 2603.11498 | translate | read | null |
| 2026-03-11 | World Mouse: Exploring Interactions with a Cross-Reality Cursor | Esen K. Tütüncü et.al. | 2603.10984 | translate | read | null |
| 2026-03-11 | BALD-SAM: Disagreement-based Active Prompting in Interactive Segmentation | Prithwijit Chowdhury et.al. | 2603.10828 | translate | read | null |
| 2026-03-11 | A dataset of medication images with instance segmentation masks for preventing adverse drug events | W. I. Chu et.al. | 2603.10825 | translate | read | null |
| 2026-03-11 | Phase-Interface Instance Segmentation as a Visual Sensor for Laboratory Process Monitoring | Mingyue Li et.al. | 2603.10782 | translate | read | null |
| 2026-03-10 | From Semantics to Pixels: Coarse-to-Fine Masked Autoencoders for Hierarchical Visual Understanding | Wenzhao Xiang et.al. | 2603.09955 | translate | read | null |
| 2026-03-10 | World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models | Shouwei Ruan et.al. | 2603.09774 | translate | read | null |
| 2026-03-10 | Grounding Synthetic Data Generation With Vision and Language Models | Ümit Mert Çağlar et.al. | 2603.09625 | translate | read | null |
| 2026-03-10 | SpaceSense-Bench: A Large-Scale Multi-Modal Benchmark for Spacecraft Perception and Pose Estimation | Aodi Wu et.al. | 2603.09320 | translate | read | null |
| 2026-03-10 | Towards Instance Segmentation with Polygon Detection Transformers | Jiacheng Sun et.al. | 2603.09245 | translate | read | null |
| 2026-03-10 | RTFDNet: Fusion-Decoupling for Robust RGB-T Segmentation | Kunyu Tan et.al. | 2603.09149 | translate | read | null |
| 2026-03-10 | Rotation Equivariant Mamba for Vision Tasks | Zhongchen Zhao et.al. | 2603.09138 | translate | read | null |
| 2026-03-10 | Intelligent Spatial Estimation for Fire Hazards in Engineering Sites: An Enhanced YOLOv8-Powered Proximity Analysis Framework | Ammar K. AlMhdawi et.al. | 2603.09069 | translate | read | null |
| 2026-03-09 | Weakly Supervised Teacher-Student Framework with Progressive Pseudo-mask Refinement for Gland Segmentation | Hikmat Khan et.al. | 2603.08605 | translate | read | null |
| 2026-03-09 | Viewpoint-Agnostic Grasp Pipeline using VLM and Partial Observations | Dilermando Almeida et.al. | 2603.07866 | translate | read | null |
| 2026-03-08 | Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance | Guodong Sun et.al. | 2603.07570 | translate | read | null |
| 2026-03-08 | Defect Detection in Magnetic Systems Using U-Net and Statistical Measures | Ross Knapman et.al. | 2603.07542 | translate | read | null |
| 2026-03-08 | SIGMAE: A Spectral-Index-Guided Foundation Model for Multispectral Remote Sensing | Xiaokang Zhang et.al. | 2603.07463 | translate | read | null |
| 2026-03-06 | SG-DOR: Learning Scene Graphs with Direction-Conditioned Occlusion Reasoning for Pepper Plants | Rohit Menon et.al. | 2603.06512 | translate | read | null |
| 2026-03-06 | CLoPA: Continual Low Parameter Adaptation of Interactive Segmentation for Medical Image Annotation | Parhom Esmaeili et.al. | 2603.06426 | translate | read | null |
| 2026-03-06 | Rewis3d: Reconstruction Improves Weakly-Supervised Semantic Segmentation | Jonas Ernst et.al. | 2603.06374 | translate | read | null |
| 2026-03-06 | P-SLCR: Unsupervised Point Cloud Semantic Segmentation via Prototypes Structure Learning and Consistent Reasoning | Lixin Zhan et.al. | 2603.06321 | translate | read | null |
| 2026-03-06 | Making Training-Free Diffusion Segmentors Scale with the Generative Power | Benyuan Meng et.al. | 2603.06178 | translate | read | null |
| 2026-03-06 | JOPP-3D: Joint Open Vocabulary Semantic Segmentation on Point Clouds and Panoramas | Sandeep Inuganti et.al. | 2603.06168 | translate | read | null |
| 2026-03-05 | Safe-SAGE: Social-Semantic Adaptive Guidance for Safe Engagement through Laplace-Modulated Poisson Safety Functions | Lizhi Yang et.al. | 2603.05497 | translate | read | null |
| 2026-03-05 | VinePT-Map: Pole-Trunk Semantic Mapping for Resilient Autonomous Robotics in Vineyards | Giorgio Audrito et.al. | 2603.05070 | translate | read | null |
| 2026-03-05 | Generalizable Multiscale Segmentation of Heterogeneous Map Collections | Remi Petitpierre et.al. | 2603.05037 | translate | read | null |
| 2026-03-04 | Semantic Bridging Domains: Pseudo-Source as Test-Time Connector | Xizhong Yang et.al. | 2603.03844 | translate | read | null |
| 2026-03-04 | Field imaging framework for morphological characterization of aggregates with computer vision: Algorithms and applications | Haohang Huang et.al. | 2603.03654 | translate | read | null |
| 2026-03-04 | LeafInst - Unified Instance Segmentation Network for Fine-Grained Forestry Leaf Phenotype Analysis: A New UAV based Benchmark | Taige Luo et.al. | 2603.03616 | translate | read | null |
| 2026-03-03 | TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference | Mhd Rashed Al Koutayni et.al. | 2603.03075 | translate | read | null |
| 2026-03-03 | Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers | Youngjun Jun et.al. | 2603.02919 | translate | read | null |
| 2026-03-03 | DREAM: Where Visual Understanding Meets Text-to-Image Generation | Chao Li et.al. | 2603.02667 | translate | read | null |
| 2026-03-03 | SEP-YOLO: Fourier-Domain Feature Representation for Transparent Object Instance Segmentation | Fengming Zhang et.al. | 2603.02648 | translate | read | null |
| 2026-03-03 | CAWM-Mamba: A unified model for infrared-visible image fusion and compound adverse weather restoration | Huichun Liu et.al. | 2603.02560 | translate | read | null |
| 2026-03-03 | Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation | Chonghua Lv et.al. | 2603.02554 | translate | read | null |
| 2026-03-03 | SGMA: Semantic-Guided Modality-Aware Segmentation for Remote Sensing with Incomplete Multimodal Data | Lekang Wen et.al. | 2603.02505 | translate | read | null |
| 2026-03-02 | Downstream Task Inspired Underwater Image Enhancement: A Perception-Aware Study from Dataset Construction to Network Design | Bosen Lin et.al. | 2603.01767 | translate | read | null |
| 2026-03-02 | Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing | Zijin Yin et.al. | 2603.01535 | translate | read | null |
| 2026-03-02 | SeaVIS: Sound-Enhanced Association for Online Audio-Visual Instance Segmentation | Yingjian Zhu et.al. | 2603.01431 | translate | read | null |
| 2026-03-01 | Open-Vocabulary vs Supervised Learning Methods for Post-Disaster Visual Scene Understanding | Anna Michailidou et.al. | 2603.01324 | translate | read | null |
| 2026-03-01 | CoSMo3D: Open-World Promptable 3D Semantic Part Segmentation through LLM-Guided Canonical Spatial Modeling | Li Jin et.al. | 2603.01205 | translate | read | null |
| 2026-03-01 | Adaptive Augmentation-Aware Latent Learning for Robust LiDAR Semantic Segmentation | Wangkai Li et.al. | 2603.01074 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)