Semantic Segmentation - 2025-11
Semantic Segmentation - 2025-11
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-11-30 | Stronger is not better: Better Augmentations in Contrastive Learning for Medical Image Segmentation | Azeez Idris et.al. | 2512.05992 | translate | read | null |
| 2025-11-30 | Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation | An Yang et.al. | 2512.00944 | translate | read | null |
| 2025-11-30 | The Outline of Deception: Physical Adversarial Attacks on Traffic Signs Using Edge Patches | Haojie Ji et.al. | 2512.00765 | translate | read | null |
| 2025-11-30 | VFM-ISRefiner: Towards Better Adapting Vision Foundation Models for Interactive Segmentation of Remote Sensing Images | Deliang Wang et.al. | 2512.00718 | translate | read | null |
| 2025-11-29 | Doppler-Enhanced Deep Learning: Improving Thyroid Nodule Segmentation with YOLOv5 Instance Segmentation | Mahmoud El Hussieni et.al. | 2512.00639 | translate | read | null |
| 2025-11-29 | EZ-SP: Fast and Lightweight Superpoint-Based 3D Segmentation | Louis Geist et.al. | 2512.00385 | translate | read | null |
| 2025-11-29 | Breaking It Down: Domain-Aware Semantic Segmentation for Retrieval Augmented Generation | Aparajitha Allamraju et.al. | 2512.00367 | translate | read | null |
| 2025-11-29 | Towards aligned body representations in vision models | Andrey Gizdov et.al. | 2512.00365 | translate | read | null |
| 2025-11-24 | Satellite to Street : Disaster Impact Estimator | Sreesritha Sai et.al. | 2512.00065 | translate | read | null |
| 2025-11-28 | Learning to Predict Aboveground Biomass from RGB Images with 3D Synthetic Scenes | Silvia Zuffi et.al. | 2511.23249 | translate | read | null |
| 2025-11-28 | Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM | Shouhe Zhang et.al. | 2511.22968 | translate | read | null |
| 2025-11-28 | Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation | Taeyeong Kim et.al. | 2511.22948 | translate | read | null |
| 2025-11-27 | GazeTrack: High-Precision Eye Tracking Based on Regularization and Spatial Computing | Xiaoyin Yang et.al. | 2511.22607 | translate | read | null |
| 2025-11-27 | 3D Affordance Keypoint Detection for Robotic Manipulation | Zhiyang Liu et.al. | 2511.22195 | translate | read | null |
| 2025-11-26 | OpenTwinMap: An Open-Source Digital Twin Generator for Urban Autonomous Driving | Alex Richardson et.al. | 2511.21925 | translate | read | null |
| 2025-11-26 | ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images | M. Naseer Subhani et.al. | 2511.21606 | translate | read | null |
| 2025-11-26 | Shift-Equivariant Complex-Valued Convolutional Neural Networks | Quentin Gabot et.al. | 2511.21250 | translate | read | null |
| 2025-11-25 | Open Vocabulary Compositional Explanations for Neuron Alignment | Biagio La Rosa et.al. | 2511.20931 | translate | read | null |
| 2025-11-25 | Automated Monitoring of Cultural Heritage Artifacts Using Semantic Segmentation | Andrea Ranieri et.al. | 2511.20541 | translate | read | null |
| 2025-11-25 | CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation | Shilei Cao et.al. | 2511.20302 | translate | read | null |
| 2025-11-25 | SAM-MI: A Mask-Injected Framework for Enhancing Open-Vocabulary Semantic Segmentation with SAM | Lin Chen et.al. | 2511.20027 | translate | read | null |
| 2025-11-25 | Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting | Wen Zhang et.al. | 2511.19953 | translate | read | null |
| 2025-11-24 | Lightweight Transformer Framework for Weakly Supervised Semantic Segmentation | Ali Torabi et.al. | 2511.19765 | translate | read | null |
| 2025-11-24 | RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models | Omar Alama et.al. | 2511.19704 | translate | read | null |
| 2025-11-24 | Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration | Remi Petitpierre et.al. | 2511.19538 | translate | read | null |
| 2025-11-24 | BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation | Rachit Saluja et.al. | 2511.19394 | translate | read | null |
| 2025-11-24 | nnActive: A Framework for Evaluation of Active Learning in 3D Biomedical Segmentation | Carsten T. Lüth et.al. | 2511.19183 | translate | read | null |
| 2025-11-24 | DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection | Hai Ci et.al. | 2511.19111 | translate | read | null |
| 2025-11-24 | SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation | Nimeshika Udayangani et.al. | 2511.18816 | translate | read | null |
| 2025-11-24 | PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion | Yichen Yang et.al. | 2511.18801 | translate | read | null |
| 2025-11-23 | SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation | Peter Siegel et.al. | 2511.18386 | translate | read | null |
| 2025-11-23 | UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization | Siyi Li et.al. | 2511.18254 | translate | read | null |
| 2025-11-22 | Matching-Based Few-Shot Semantic Segmentation Models Are Interpretable by Design | Pasquale De Marinis et.al. | 2511.18163 | translate | read | null |
| 2025-11-22 | AdaPerceiver: Transformers with Adaptive Width, Depth, and Tokens | Purvish Jajal et.al. | 2511.18105 | translate | read | null |
| 2025-11-18 | HSMix: Hard and Soft Mixing Data Augmentation for Medical Image Segmentation | Danyang Sun et.al. | 2511.17614 | translate | read | null |
| 2025-11-21 | Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift | Björn Michele et.al. | 2511.17455 | translate | read | null |
| 2025-11-21 | REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing | Binger Chen et.al. | 2511.17442 | translate | read | null |
| 2025-11-21 | FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception | Shubham Sonarghare et.al. | 2511.17210 | translate | read | null |
| 2025-11-20 | Late-decoupled 3D Hierarchical Semantic Segmentation with Semantic Prototype Discrimination based Bi-branch Supervision | Shuyu Cao et.al. | 2511.16650 | translate | read | null |
| 2025-11-20 | Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling | Minseok Seo et.al. | 2511.16301 | translate | read | null |
| 2025-11-20 | Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective | Jiahao Li et.al. | 2511.16170 | translate | read | null |
| 2025-11-20 | InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation via Information-Theoretic Alignment Transfer | Muyao Yuan et.al. | 2511.15967 | translate | read | null |
| 2025-11-19 | Automatic Uncertainty-Aware Synthetic Data Bootstrapping for Historical Map Segmentation | Lukas Arzoumanidis et.al. | 2511.15875 | translate | read | null |
| 2025-11-19 | GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI | Naomi Simumba et.al. | 2511.15658 | translate | read | null |
| 2025-11-19 | Multi-Text Guided Few-Shot Semantic Segmentation | Qiang Jiao et.al. | 2511.15515 | translate | read | null |
| 2025-11-19 | WarNav: An Autonomous Driving Benchmark for Segmentation of Navigable Zones in War Scenes | Marc-Emmanuel Coupvent des Graviers et.al. | 2511.15429 | translate | read | null |
| 2025-11-19 | Controlling False Positives in Image Segmentation via Conformal Prediction | Luca Mossina et.al. | 2511.15406 | translate | read | null |
| 2025-11-18 | EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects | Gbenga Omotara et.al. | 2511.14970 | translate | read | null |
| 2025-11-18 | FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding | Zhenshi Li et.al. | 2511.14901 | translate | read | null |
| 2025-11-18 | Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation | Aditi Agarwal et.al. | 2511.14481 | translate | read | null |
| 2025-11-18 | Step by Step Network | Dongchen Han et.al. | 2511.14329 | translate | read | null |
| 2025-11-18 | Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution | N Dinesh Reddy et.al. | 2511.14210 | translate | read | null |
| 2025-11-17 | Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting | Jiangnan Ye et.al. | 2511.13684 | translate | read | null |
| 2025-11-17 | Mapping the Vanishing and Transformation of Urban Villages in China | Wenyu Zhang et.al. | 2511.13507 | translate | read | null |
| 2025-11-17 | Delineate Anything Flow: Fast, Country-Level Field Boundary Detection from Any Source | Mykola Lavreniuk et.al. | 2511.13417 | translate | read | null |
| 2025-11-17 | DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation | Yan Gong et.al. | 2511.13047 | translate | read | null |
| 2025-11-15 | FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention | Peng Zhang et.al. | 2511.12215 | translate | read | null |
| 2025-11-15 | Evaluation of Attention Mechanisms in U-Net Architectures for Semantic Segmentation of Brazilian Rock Art Petroglyphs | Leonardi Melo et.al. | 2511.11959 | translate | read | null |
| 2025-11-14 | Chain-of-Generation: Progressive Latent Diffusion for Text-Guided Molecular Design | Lingxiao Li et.al. | 2511.11894 | translate | read | null |
| 2025-11-14 | Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation | Camila Machado de Araujo et.al. | 2511.11890 | translate | read | null |
| 2025-11-13 | AdaptFly: Prompt-Guided Adaptation of Foundation Models for Low-Altitude UAV Networks | Jiao Chen et.al. | 2511.11720 | translate | read | null |
| 2025-11-12 | Enhancing Reinforcement Learning in 3D Environments through Semantic Segmentation: A Case Study in ViZDoom | Hugo Huang et.al. | 2511.11703 | translate | read | null |
| 2025-11-12 | EPSegFZ: Efficient Point Cloud Semantic Segmentation for Few- and Zero-Shot Scenarios with Language Guidance | Jiahui Wang et.al. | 2511.11700 | translate | read | null |
| 2025-11-14 | Terrain Costmap Generation via Scaled Preference Conditioning | Luisa Mao et.al. | 2511.11529 | translate | read | null |
| 2025-11-13 | Histology-informed tiling of whole tissue sections improves the interpretability and predictability of cancer relapse and genetic alterations | Willem Bonnaffé et.al. | 2511.10432 | translate | read | null |
| 2025-11-13 | Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators | Maximiliane Gruber et.al. | 2511.10424 | translate | read | null |
| 2025-11-13 | DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Semantic Instance Segmentation | Xuexun Liu et.al. | 2511.10003 | translate | read | null |
| 2025-11-12 | Soiling detection for Advanced Driver Assistance Systems | Filip Beránek et.al. | 2511.09740 | translate | read | null |
| 2025-11-12 | OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS | Haiyi Li et.al. | 2511.09397 | translate | read | null |
| 2025-11-11 | Empowering DINO Representations for Underwater Instance Segmentation via Aligner and Prompter | Zhiyang Chen et.al. | 2511.08334 | translate | read | null |
| 2025-11-11 | Re-coding for Uncertainties: Edge-awareness Semantic Concordance for Resilient Event-RGB Segmentation | Nan Bao et.al. | 2511.08269 | translate | read | null |
| 2025-11-11 | NERVE: Neighbourhood & Entropy-guided Random-walk for training free open-Vocabulary sEgmentation | Kunal Mahatha et.al. | 2511.08248 | translate | read | null |
| 2025-11-10 | FlowFeat: Pixel-Dense Embedding of Motion Profiles | Nikita Araslanov et.al. | 2511.07696 | translate | read | null |
| 2025-11-10 | Glioma C6: A Novel Dataset for Training and Benchmarking Cell Segmentation | Roman Malashin et.al. | 2511.07286 | translate | read | null |
| 2025-11-10 | StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and Compression | Yilong Chen et.al. | 2511.07278 | translate | read | null |
| 2025-11-10 | HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving | Zhongyu Xia et.al. | 2511.07106 | translate | read | null |
| 2025-11-10 | Metric Analysis for Spatial Semantic Segmentation of Sound Scenes | Mayank Mishra et.al. | 2511.07075 | translate | read | null |
| 2025-11-10 | TrueCity: Real and Simulated Urban Data for Cross-Domain 3D Scene Understanding | Duc Nguyen et.al. | 2511.07007 | translate | read | null |
| 2025-11-10 | Exploring the “Great Unseen” in Medieval Manuscripts: Instance-Level Labeling of Legacy Image Collections with Zero-Shot Models | Christofer Meinecke et.al. | 2511.07004 | translate | read | null |
| 2025-11-10 | Vision-Aided Online A* Path Planning for Efficient and Safe Navigation of Service Robots | Praveen Kumar et.al. | 2511.06801 | translate | read | null |
| 2025-11-09 | Video Dataset for Surgical Phase, Keypoint, and Instrument Recognition in Laparoscopic Surgery (PhaKIR) | Tobias Rueckert et.al. | 2511.06549 | translate | read | null |
| 2025-11-09 | EIDSeg: A Pixel-Level Semantic Segmentation Dataset for Post-Earthquake Damage Assessment from Social Media Images | Huili Huang et.al. | 2511.06456 | translate | read | null |
| 2025-11-09 | Label-Efficient 3D Forest Mapping: Self-Supervised and Transfer Learning for Individual, Structural, and Species Analysis | Aldino Rizaldy et.al. | 2511.06331 | translate | read | null |
| 2025-11-09 | Temporal-Guided Visual Foundation Models for Event-Based Vision | Ruihao Xia et.al. | 2511.06238 | translate | read | link |
| 2025-11-08 | Polymap: generating high definition map based on rasterized polygons | Shiyu Gao et.al. | 2511.05944 | translate | read | null |
| 2025-11-07 | CoT-X: An Adaptive Framework for Cross-Model Chain-of-Thought Transfer and Optimization | Ziqian Bi et.al. | 2511.05747 | translate | read | null |
| 2025-11-04 | Do Street View Imagery and Public Participation GIS align: Comparative Analysis of Urban Attractiveness | Milad Malekzadeh et.al. | 2511.05570 | translate | read | null |
| 2025-11-03 | Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation | Jiayuan Wang et.al. | 2511.05557 | translate | read | null |
| 2025-11-07 | How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need? | Tuan Anh Tran et.al. | 2511.05449 | translate | read | null |
| 2025-11-07 | Canonical Space Representation for 4D Panoptic Segmentation of Articulated Objects | Manuel Gomes et.al. | 2511.05356 | translate | read | null |
| 2025-11-07 | No Pose Estimation? No Problem: Pose-Agnostic and Instance-Aware Test-Time Adaptation for Monocular Depth Estimation | Mingyu Sung et.al. | 2511.05055 | translate | read | null |
| 2025-11-07 | LG-NuSegHop: A Local-to-Global Self-Supervised Pipeline For Nuclei Instance Segmentation | Vasileios Magoulianitis et.al. | 2511.04892 | translate | read | null |
| 2025-11-06 | An Active Learning Pipeline for Biomedical Image Instance Segmentation with Minimal Human Intervention | Shuo Zhao et.al. | 2511.04811 | translate | read | null |
| 2025-11-06 | Cambrian-S: Towards Spatial Supersensing in Video | Shusheng Yang et.al. | 2511.04670 | translate | read | null |
| 2025-11-06 | Vitessce Link: A Mixed Reality and 2D Display Hybrid Approach for Visual Analysis of 3D Tissue Maps | Eric Mörth et.al. | 2511.04262 | translate | read | null |
| 2025-11-06 | CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation | Yuwen Tao et.al. | 2511.03992 | translate | read | null |
| 2025-11-05 | Laugh, Relate, Engage: Stylized Comment Generation for Short Videos | Xuan Ouyang et.al. | 2511.03757 | translate | read | null |
| 2025-11-05 | Computational Imaging Meets LLMs: Zero-Shot IDH Mutation Prediction in Brain Gliomas | Syed Muqeem Mahmood et.al. | 2511.03376 | translate | read | null |
| 2025-11-05 | Enhancing Medical Image Segmentation via Heat Conduction Equation | Rong Wu et.al. | 2511.03260 | translate | read | null |
| 2025-11-05 | Diffusion-Guided Mask-Consistent Paired Mixing for Endoscopic Image Segmentation | Pengyu Jie et.al. | 2511.03219 | translate | read | null |
| 2025-11-05 | Subsampled Randomized Fourier GaLore for Adapting Foundation Models in Depth-Driven Liver Landmark Segmentation | Yun-Chen Lin et.al. | 2511.03163 | translate | read | null |
| 2025-11-05 | Accelerating Physical Property Reasoning for Augmented Visual Cognition | Hongbo Lan et.al. | 2511.03126 | translate | read | null |
| 2025-11-04 | Learning with less: label-efficient land cover classification at very high spatial resolution using self-supervised deep learning | Dakota Hester et.al. | 2511.03004 | translate | read | null |
| 2025-11-04 | Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data | Syed Mostaquim Ali et.al. | 2511.02994 | translate | read | null |
| 2025-11-04 | Digital Twin-Driven Pavement Health Monitoring and Maintenance Optimization Using Graph Neural Networks | Mohsin Mahmud Topu et.al. | 2511.02957 | translate | read | null |
| 2025-11-04 | Optimizing the nnU-Net model for brain tumor (Glioma) segmentation Using a BraTS Sub-Saharan Africa (SSA) dataset | Chukwuemeka Arua Kalu et.al. | 2511.02893 | translate | read | null |
| 2025-11-02 | Digitizing Spermatogenesis Lineage at Nanoscale Resolution In Tissue-Level Electron Microscopy | Li Xiao et.al. | 2511.02860 | translate | read | null |
| 2025-11-04 | Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks | Dmitrii Pozdeev et.al. | 2511.02830 | translate | read | null |
| 2025-11-04 | PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing | Antonio Oroz et.al. | 2511.02777 | translate | read | null |
| 2025-11-04 | Resource-efficient Automatic Refinement of Segmentations via Weak Supervision from Light Feedback | Alix de Langlais et.al. | 2511.02576 | translate | read | null |
| 2025-11-04 | ESA: Energy-Based Shot Assembly Optimization for Automatic Video Editing | Yaosen Chen et.al. | 2511.02505 | translate | read | null |
| 2025-11-04 | Synthetic Crop-Weed Image Generation and its Impact on Model Generalization | Garen Boyadjian et.al. | 2511.02417 | translate | read | null |
| 2025-11-04 | Revisiting put-that-there, context aware window interactions via LLMs | Riccardo Bovo et.al. | 2511.02378 | translate | read | null |
| 2025-11-04 | From Instance Segmentation to 3D Growth Trajectory Reconstruction in Planktonic Foraminifera | Huahua Lin et.al. | 2511.02142 | translate | read | null |
| 2025-11-03 | Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation | Seongkyu Choi et.al. | 2511.01434 | translate | read | null |
| 2025-11-03 | MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement | Jierui Qu et.al. | 2511.01345 | translate | read | null |
| 2025-11-03 | Source-Only Cross-Weather LiDAR via Geometry-Aware Point Drop | YoungJae Cheong et.al. | 2511.01250 | translate | read | null |
| 2025-11-03 | CenterMamba-SAM: Center-Prioritized Scanning and Temporal Prototypes for Brain Lesion Segmentation | Yu Tian et.al. | 2511.01243 | translate | read | null |
| 2025-11-03 | An Enhanced Proprioceptive Method for Soft Robots Integrating Bend Sensors and IMUs | Dong Heon Han et.al. | 2511.01165 | translate | read | null |
| 2025-11-03 | MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation | Ziyi Wang et.al. | 2511.01143 | translate | read | null |
| 2025-11-02 | URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model | Zhe Li et.al. | 2511.00940 | translate | read | null |
| 2025-11-02 | TA-LSDiff:Topology-Aware Diffusion Guided by a Level Set Energy for Pancreas Segmentation | Yue Gou et.al. | 2511.00815 | translate | read | null |
| 2025-11-02 | Rhythm in the Air: Vision-based Real-Time Music Generation through Gestures | Barathi Subramanian et.al. | 2511.00793 | translate | read | null |
| 2025-11-02 | Class-agnostic 3D Segmentation by Granularity-Consistent Automatic 2D Mask Tracking | Juan Wang et.al. | 2511.00785 | translate | read | null |
| 2025-11-01 | Grounding Surgical Action Triplets with Instrument Instance Segmentation: A Dataset and Target-Aware Fusion Approach | Oluwatosin Alabi et.al. | 2511.00643 | translate | read | null |
| 2025-11-01 | Text-guided Fine-Grained Video Anomaly Detection | Jihao Gu et.al. | 2511.00524 | translate | read | null |
| 2025-11-01 | Optimization of continuous-flow over traffic networks with fundamental diagram constraints | Anqi Dong et.al. | 2511.00500 | translate | read | null |
| 2025-11-01 | HumanCrafter: Synergizing Generalizable Human Reconstruction and Semantic 3D Segmentation | Panwang Pan et.al. | 2511.00468 | translate | read | null |
| 2025-11-01 | Tree Training: Accelerating Agentic LLMs Training via Shared Prefix Reuse | Shaojie Wang et.al. | 2511.00413 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)