Semantic Segmentation - 2026-01
Semantic Segmentation - 2026-01
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-01-31 | StomataSeg: Semi-Supervised Instance Segmentation for Sorghum Stomatal Components | Zhongtian Huang et.al. | 2602.00703 | translate | read | null |
| 2026-01-31 | SPARK: Stochastic Propagation via Affinity-guided Random walK for training-free unsupervised segmentation | Kunal Mahatha et.al. | 2602.00516 | translate | read | null |
| 2026-01-31 | ZS-TreeSeg: A Zero-Shot Framework for Tree Crown Instance Segmentation | Pengyu Chen et.al. | 2602.00470 | translate | read | null |
| 2026-01-30 | Vision-Language Model Purified Semi-Supervised Semantic Segmentation for Remote Sensing Images | Shanwen Wang et.al. | 2602.00202 | translate | read | null |
| 2026-01-29 | YOLOE-26: Integrating YOLO26 with YOLOE for Real-Time Open-Vocabulary Instance Segmentation | Ranjan Sapkota et.al. | 2602.00168 | translate | read | null |
| 2026-01-30 | Segment Any Events with Language | Seungjun Lee et.al. | 2601.23159 | translate | read | link |
| 2026-01-30 | ExpAlign: Expectation-Guided Vision-Language Alignment for Open-Vocabulary Grounding | Junyi Hu et.al. | 2601.22666 | translate | read | null |
| 2026-01-30 | SHED Light on Segmentation for Dense Prediction | Seung Hyun Lee et.al. | 2601.22529 | translate | read | null |
| 2026-01-30 | Class-Aware Permutation-Invariant Signal-to-Distortion Ratio for Semantic Segmentation of Sound Scene with Same-Class Sources | Binh Thien Nguyen et.al. | 2601.22504 | translate | read | null |
| 2026-01-29 | BLO-Inst: Bi-Level Optimization Based Alignment of YOLO and SAM for Robust Instance Segmentation | Li Zhang et.al. | 2601.22061 | translate | read | null |
| 2026-01-29 | Just Noticeable Difference Modeling for Deep Visual Features | Rui Zhao et.al. | 2601.21933 | translate | read | null |
| 2026-01-29 | Depth-Aware Machine Learning Framework for Bubble Characterization in Two-Phase Flows | Chaitanya S Nayak et.al. | 2601.21175 | translate | read | null |
| 2026-01-29 | Bidirectional Cross-Perception for Open-Vocabulary Semantic Segmentation in Remote Sensing Imagery | Jianzheng Wang et.al. | 2601.21159 | translate | read | null |
| 2026-01-27 | NucFuseRank: Dataset Fusion and Performance Ranking for Nuclei Instance Segmentation | Nima Torbati et.al. | 2601.20104 | translate | read | null |
| 2026-01-27 | DiSa: Saliency-Aware Foreground-Background Disentangled Framework for Open-Vocabulary Semantic Segmentation | Zhen Yao et.al. | 2601.20064 | translate | read | null |
| 2026-01-27 | Enhancing Inverse Perspective Mapping for Automatic Vectorized Road Map Generation | Hongji Liu et.al. | 2601.19536 | translate | read | null |
| 2026-01-27 | Learned split-spectrum metalens for obstruction-free broadband imaging in the visible | Seungwoo Yoon et.al. | 2601.19403 | translate | read | null |
| 2026-01-27 | Instance-Guided Radar Depth Estimation for 3D Object Detection | Chen-Chou Lo et.al. | 2601.19314 | translate | read | null |
| 2026-01-23 | Domain-invariant Mixed-domain Semi-supervised Medical Image Segmentation with Clustered Maximum Mean Discrepancy Alignment | Ba-Thinh Lam et.al. | 2601.16954 | translate | read | null |
| 2026-01-23 | REL-SF4PASS: Panoramic Semantic Segmentation with REL Depth Representation and Spherical Fusion | Xuewei Li et.al. | 2601.16788 | translate | read | null |
| 2026-01-23 | PanopMamba: Vision State Space Modeling for Nuclei Panoptic Segmentation | Ming Kang et.al. | 2601.16631 | translate | read | null |
| 2026-01-23 | VISTA-PATH: An interactive foundation model for pathology image segmentation and quantitative analysis in computational pathology | Peixian Liang et.al. | 2601.16451 | translate | read | null |
| 2026-01-22 | SAMTok: Representing Any Mask with Two Words | Yikang Zhou et.al. | 2601.16093 | translate | read | link |
| 2026-01-22 | RadJEPA: Radiology Encoder for Chest X-Rays via Joint Embedding Predictive Architecture | Anas Anwarul Haq Khan et.al. | 2601.15891 | translate | read | null |
| 2026-01-22 | Enhanced LULC Segmentation via Lightweight Model Refinements on ALOS-2 SAR Data | Ali Caglayan et.al. | 2601.15705 | translate | read | null |
| 2026-01-21 | AI-Based Culvert-Sewer Inspection | Christina Thrainer et.al. | 2601.15366 | translate | read | null |
| 2026-01-21 | ZENITH: Automated Gradient Norm Informed Stochastic Optimization | Dhrubo Saha et.al. | 2601.15212 | translate | read | null |
| 2026-01-21 | Erosion Attack for Adversarial Training to Enhance Semantic Segmentation Robustness | Yufei Song et.al. | 2601.14950 | translate | read | null |
| 2026-01-21 | Context Patch Fusion With Class Token Enhancement for Weakly Supervised Semantic Segmentation | Yiyang Fu et.al. | 2601.14718 | translate | read | null |
| 2026-01-20 | XD-MAP: Cross-Modal Domain Adaptation using Semantic Parametric Mapping | Frank Bieder et.al. | 2601.14477 | translate | read | null |
| 2026-01-20 | Active Cross-Modal Visuo-Tactile Perception of Deformable Linear Objects | Raffaele Mazza et.al. | 2601.13979 | translate | read | null |
| 2026-01-20 | SUNSET – A Sensor-fUsioN based semantic SegmEnTation exemplar for ROS-based self-adaptation | Andreas Wiedholz et.al. | 2601.13732 | translate | read | null |
| 2026-01-19 | Deep Learning for Semantic Segmentation of 3D Ultrasound Data | Chenyu Liu et.al. | 2601.13263 | translate | read | null |
| 2026-01-19 | ObjectVisA-120: Object-based Visual Attention Prediction in Interactive Street-crossing Environments | Igor Vozniak et.al. | 2601.13218 | translate | read | null |
| 2026-01-19 | GridNet-HD: A High-Resolution Multi-Modal Dataset for LiDAR-Image Fusion on Power Line Infrastructure | Antoine Carreaud et.al. | 2601.13052 | translate | read | null |
| 2026-01-19 | Cross-Scale Pretraining: Enhancing Self-Supervised Learning for Low-Resolution Satellite Imagery for Semantic Segmentation | John Waithaka et.al. | 2601.12964 | translate | read | null |
| 2026-01-19 | Open Vocabulary Panoptic Segmentation With Retrieval Augmentation | Nafis Sadeq et.al. | 2601.12779 | translate | read | null |
| 2026-01-16 | ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes | Emily Steiner et.al. | 2601.11508 | translate | read | null |
| 2026-01-16 | Context-Aware Semantic Segmentation via Stage-Wise Attention | Antoine Carreaud et.al. | 2601.11310 | translate | read | null |
| 2026-01-16 | SAMannot: A Memory-Efficient, Local, Open-source Framework for Interactive Video Instance Segmentation based on SAM2 | Gergely Dinya et.al. | 2601.11301 | translate | read | null |
| 2026-01-16 | Graph Smoothing for Enhanced Local Geometry Learning in Point Cloud Analysis | Shangbo Yuan et.al. | 2601.11102 | translate | read | null |
| 2026-01-16 | Sparse Data Tree Canopy Segmentation: Fine-Tuning Leading Pretrained Models on Only 150 Images | David Szczecina et.al. | 2601.10931 | translate | read | null |
| 2026-01-15 | Urban Socio-Semantic Segmentation with Vision-Language Reasoning | Yu Wang et.al. | 2601.10477 | translate | read | link |
| 2026-01-15 | An effective interactive brain cytoarchitectonic parcellation framework using pretrained foundation model | Shiqi Zhang et.al. | 2601.10412 | translate | read | null |
| 2026-01-14 | MedVL-SAM2: A unified 3D medical vision-language model for multimodal reasoning and prompt-driven segmentation | Yang Xing et.al. | 2601.09879 | translate | read | null |
| 2026-01-14 | SAM-Aug: Leveraging SAM Priors for Few-Shot Parcel Segmentation in Satellite Time Series | Kai Hu et.al. | 2601.09110 | translate | read | null |
| 2026-01-14 | Vision Foundation Models for Domain Generalisable Cross-View Localisation in Planetary Ground-Aerial Robotic Teams | Lachlan Holden et.al. | 2601.09107 | translate | read | null |
| 2026-01-13 | Instance camera focus prediction for crystal agglomeration classification | Xiaoyu Ji et.al. | 2601.09004 | translate | read | null |
| 2026-01-13 | SAM-pose2seg: Pose-Guided Human Instance Segmentation in Crowds | Constantin Kolomiiets et.al. | 2601.08982 | translate | read | null |
| 2026-01-13 | 3AM: Segment Anything with Geometric Consistency in Videos | Yang-Che Sun et.al. | 2601.08831 | translate | read | null |
| 2026-01-13 | DentalX: Context-Aware Dental Disease Detection with Radiographs | Zhi Qin Tan et.al. | 2601.08797 | translate | read | link |
| 2026-01-13 | WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation | Zishan Shu et.al. | 2601.08602 | translate | read | null |
| 2026-01-13 | Edge-Optimized Multimodal Learning for UAV Video Understanding via BLIP-2 | Yizhan Feng et.al. | 2601.08408 | translate | read | null |
| 2026-01-13 | Source-Free Domain Adaptation for Geospatial Point Cloud Semantic Segmentation | Yuan Gao et.al. | 2601.08375 | translate | read | null |
| 2026-01-13 | Semantic Misalignment in Vision-Language Models under Perceptual Degradation | Guo Cheng et.al. | 2601.08355 | translate | read | null |
| 2026-01-13 | Human-inspired Global-to-Parallel Multi-scale Encoding for Lightweight Vision Models | Wei Xu et.al. | 2601.08190 | translate | read | null |
| 2026-01-13 | How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation? | Peng Gao et.al. | 2601.08133 | translate | read | null |
| 2026-01-12 | Exchange Is All You Need for Remote Sensing Change Detection | Sijun Dong et.al. | 2601.07805 | translate | read | null |
| 2026-01-12 | PanoSAMic: Panoramic Image Segmentation from SAM Feature Encoding and Dual View Fusion | Mahdi Chamseddine et.al. | 2601.07447 | translate | read | null |
| 2026-01-10 | Boosting Overlapping Organoid Instance Segmentation Using Pseudo-Label Unmixing and Synthesis-Assisted Learning | Gui Huang et.al. | 2601.06642 | translate | read | null |
| 2026-01-08 | When Imbalance Comes Twice: Active Learning under Simulated Class Imbalance and Label Shift in Binary Semantic Segmentation | Julien Combes et.al. | 2601.06209 | translate | read | null |
| 2026-01-09 | Adapting Vision Transformers to Ultra-High Resolution Semantic Segmentation with Relay Tokens | Yohann Perron et.al. | 2601.05927 | translate | read | null |
| 2026-01-08 | UniLiPs: Unified LiDAR Pseudo-Labeling with Geometry-Grounded Dynamic Scene Decomposition | Filippo Ghilotti et.al. | 2601.05105 | translate | read | null |
| 2026-01-08 | TEA: Temporal Adaptive Satellite Image Semantic Segmentation | Juyuan Kang et.al. | 2601.04956 | translate | read | null |
| 2026-01-07 | Systematic Evaluation of Depth Backbones and Semantic Cues for Monocular Pseudo-LiDAR 3D Detection | Samson Oseiwe Ajadalu et.al. | 2601.03617 | translate | read | null |
| 2026-01-07 | Physics-Constrained Cross-Resolution Enhancement Network for Optics-Guided Thermal UAV Image Super-Resolution | Zhicheng Zhao et.al. | 2601.03526 | translate | read | null |
| 2026-01-07 | A Vision-Language-Action Model with Visual Prompt for OFF-Road Autonomous Driving | Liangdong Zhang et.al. | 2601.03519 | translate | read | null |
| 2026-01-07 | G2P: Gaussian-to-Point Attribute Alignment for Boundary-Aware 3D Semantic Segmentation | Hojun Song et.al. | 2601.03510 | translate | read | null |
| 2026-01-06 | LSP-DETR: Efficient and Scalable Nuclei Segmentation in Whole Slide Images | Matěj Pekár et.al. | 2601.03163 | translate | read | link |
| 2026-01-06 | EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework | Junjue Wang et.al. | 2601.02783 | translate | read | null |
| 2026-01-06 | M-SEVIQ: A Multi-band Stereo Event Visual-Inertial Quadruped-based Dataset for Perception under Rapid Motion and Challenging Illumination | Jingcheng Cao et.al. | 2601.02777 | translate | read | null |
| 2026-01-05 | Joint Semantic and Rendering Enhancements in 3D Gaussian Modeling with Anisotropic Local Encoding | Jingming He et.al. | 2601.02339 | translate | read | null |
| 2026-01-05 | Prithvi-Complimentary Adaptive Fusion Encoder (CAFE): unlocking full-potential for flood inundation mapping | Saurabh Kaushik et.al. | 2601.02315 | translate | read | link |
| 2026-01-05 | TopoLoRA-SAM: Topology-Aware Parameter-Efficient Adaptation of Foundation Segmenters for Thin-Structure and Cross-Domain Binary Semantic Segmentation | Salim Khazem et.al. | 2601.02273 | translate | read | null |
| 2026-01-05 | Leveraging 2D-VLM for Label-Free 3D Segmentation in Large-Scale Outdoor Scene Understanding | Toshihiko Nishimura et.al. | 2601.02029 | translate | read | null |
| 2026-01-05 | Subimage Overlap Prediction: Task-Aligned Self-Supervised Pretraining For Semantic Segmentation In Remote Sensing Imagery | Lakshay Sharma et.al. | 2601.01781 | translate | read | null |
| 2026-01-04 | A Novel Deep Learning Method for Segmenting the Left Ventricle in Cardiac Cine MRI | Wenhui Chu et.al. | 2601.01512 | translate | read | null |
| 2026-01-04 | Rethinking Multimodal Few-Shot 3D Point Cloud Segmentation: From Fused Refinement to Decoupled Arbitration | Wentao Bian et.al. | 2601.01456 | translate | read | null |
| 2026-01-04 | In defense of the two-stage framework for open-set domain adaptive semantic segmentation | Wenqi Ren et.al. | 2601.01439 | translate | read | null |
| 2026-01-03 | Seamlessly Natural: Image Stitching with Natural Appearance Preservation | Gaetane Lorna N. Tchana et.al. | 2601.01257 | translate | read | null |
| 2026-01-03 | Cross-Layer Attentive Feature Upsampling for Low-latency Semantic Segmentation | Tianheng Cheng et.al. | 2601.01167 | translate | read | null |
| 2026-01-02 | Learning to Segment Liquids in Real-world Images | Jonas Li et.al. | 2601.00940 | translate | read | null |
| 2026-01-01 | MetaFormer-driven Encoding Network for Robust Medical Semantic Segmentation | Le-Anh Tran et.al. | 2601.00922 | translate | read | null |
| 2026-01-01 | Efficient Prediction of Dense Visual Embeddings via Distillation and RGB-D Transformers | Söhnke Benedikt Fischedick et.al. | 2601.00359 | translate | read | null |
| 2026-01-01 | CropNeRF: A Neural Radiance Field-Based Framework for Crop Counting | Md Ahmed Al Muzaddid et.al. | 2601.00207 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)