Semantic Segmentation - 2025-05
Semantic Segmentation - 2025-05
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-05-31 | BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation | Wei Tao et.al. | 2506.00475 | translate | read | null |
| 2025-05-30 | Bi-Manual Joint Camera Calibration and Scene Representation | Haozhan Tang et.al. | 2505.24819 | translate | read | null |
| 2025-05-30 | SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds | Cheng Zeng et.al. | 2505.24475 | translate | read | null |
| 2025-05-30 | Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation | Roger Ferrod et.al. | 2505.24361 | translate | read | null |
| 2025-05-30 | Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors | Peiran Xu et.al. | 2505.24103 | translate | read | null |
| 2025-05-29 | MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking | Numair Nadeem et.al. | 2505.24026 | translate | read | null |
| 2025-05-29 | Semantics-Guided Generative Image Compression | Cheng-Lin Wu et.al. | 2505.24015 | translate | read | null |
| 2025-05-29 | Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts | Xuweiyi Chen et.al. | 2505.23926 | translate | read | null |
| 2025-05-29 | TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models | Yao Xiao et.al. | 2505.23769 | translate | read | link |
| 2025-05-29 | Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation | Georgios Voulgaris et.al. | 2505.23597 | translate | read | null |
| 2025-05-29 | VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration | Ben Li et.al. | 2505.23439 | translate | read | link |
| 2025-05-29 | Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation | Lingyan Ran et.al. | 2505.23438 | translate | read | null |
| 2025-05-29 | Federated Unsupervised Semantic Segmentation | Evangelos Charalampakis et.al. | 2505.23292 | translate | read | null |
| 2025-05-29 | LeMoRe: Learn More Details for Lightweight Semantic Segmentation | Mian Muhammad Naeem Abid et.al. | 2505.23093 | translate | read | link |
| 2025-05-28 | ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions | Maxence Wynen et.al. | 2505.22537 | translate | read | null |
| 2025-05-28 | Universal Domain Adaptation for Semantic Segmentation | Seun-An Choe et.al. | 2505.22458 | translate | read | null |
| 2025-05-28 | LiDAR Based Semantic Perception for Forklifts in Outdoor Environments | Benjamin Serfling et.al. | 2505.22258 | translate | read | null |
| 2025-05-29 | YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction | Mingzhuang Wang et.al. | 2505.22250 | translate | read | null |
| 2025-05-28 | Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation | Zhisong Wang et.al. | 2505.22230 | translate | read | null |
| 2025-05-28 | A Survey on Training-free Open-Vocabulary Semantic Segmentation | Naomi Kombol et.al. | 2505.22209 | translate | read | null |
| 2025-05-28 | S2AFormer: Strip Self-Attention for Efficient Vision Transformer | Guoan Xu et.al. | 2505.22195 | translate | read | null |
| 2025-05-28 | LiDARDustX: A LiDAR Dataset for Dusty Unstructured Road Environments | Chenfeng Wei et.al. | 2505.21914 | translate | read | null |
| 2025-05-29 | CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation | Pardis Taghavi et.al. | 2505.21904 | translate | read | null |
| 2025-05-28 | Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation | Mehrdad Noori et.al. | 2505.21844 | translate | read | null |
| 2025-05-27 | Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO | Muzhi Zhu et.al. | 2505.21457 | translate | read | link |
| 2025-05-27 | Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning | Nikos Giannakakis et.al. | 2505.20962 | translate | read | null |
| 2025-05-27 | DSOcc: Leveraging Depth Awareness and Semantic Aid to Boost Camera-Based 3D Semantic Occupancy Prediction | Naiyu Fang et.al. | 2505.20951 | translate | read | null |
| 2025-05-26 | Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments | Julio de la Torre-Vanegas et.al. | 2505.20423 | translate | read | null |
| 2025-05-26 | A fully automated urban PV parameterization framework for improved estimation of energy production profiles | Bowen Tian et.al. | 2505.19876 | translate | read | null |
| 2025-05-26 | Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation | Nagito Saito et.al. | 2505.19846 | translate | read | null |
| 2025-05-26 | The Missing Point in Vision Transformers for Universal Image Segmentation | Sajjad Shahabodini et.al. | 2505.19795 | translate | read | link |
| 2025-05-26 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting | Wenhua Wu et.al. | 2505.19420 | translate | read | null |
| 2025-05-25 | A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic Segmentation | Yuze Wang et.al. | 2505.19159 | translate | read | link |
| 2025-05-25 | SPARS: Self-Play Adversarial Reinforcement Learning for Segmentation of Liver Tumours | Catalina Tan et.al. | 2505.18989 | translate | read | link |
| 2025-05-25 | How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation | Yining Pan et.al. | 2505.18956 | translate | read | link |
| 2025-05-25 | LLM-Guided Taxonomy and Hierarchical Uncertainty for 3D Point CLoud Active Learning | Chenxi Li et.al. | 2505.18924 | translate | read | null |
| 2025-05-24 | ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts | Shiu-hong Kao et.al. | 2505.18561 | translate | read | null |
| 2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153 | translate | read | null |
| 2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015 | translate | read | null |
| 2025-05-23 | Semantic segmentation with reward | Xie Ting et.al. | 2505.17905 | translate | read | null |
| 2025-05-23 | Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring | Nikolas Papadopoulos et.al. | 2505.17782 | translate | read | null |
| 2025-05-23 | EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy | Yichun Yu et.al. | 2505.17665 | translate | read | null |
| 2025-05-22 | Deep mineralogical segmentation of thin section images based on QEMSCAN maps | Jean Pablo Vieira de Mello et.al. | 2505.17008 | translate | read | link |
| 2025-05-22 | OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning | Zongyan Han et.al. | 2505.16974 | translate | read | link |
| 2025-05-22 | NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification | NovelSeek Team et.al. | 2505.16938 | translate | read | link |
| 2025-05-22 | TextureSAM: Towards a Texture Aware Foundation Model for Segmentation | Inbal Cohen et.al. | 2505.16540 | translate | read | null |
| 2025-05-22 | Detailed Evaluation of Modern Machine Learning Approaches for Optic Plastics Sorting | Vaishali Maheshkar et.al. | 2505.16513 | translate | read | null |
| 2025-05-22 | Sketchy Bounding-box Supervision for 3D Instance Segmentation | Qian Deng et.al. | 2505.16399 | translate | read | null |
| 2025-05-22 | Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation | Estelle Chigot et.al. | 2505.16360 | translate | read | link |
| 2025-05-22 | RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition | Yechan Park et.al. | 2505.16165 | translate | read | link |
| 2025-05-21 | VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation | Niccolo Avogaro et.al. | 2505.15592 | translate | read | null |
| 2025-05-21 | UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset | Hua Li et.al. | 2505.15581 | translate | read | link |
| 2025-05-21 | seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation | Andrew Caunes et.al. | 2505.15545 | translate | read | link |
| 2025-05-21 | Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation | Ce Zhang et.al. | 2505.15491 | translate | read | null |
| 2025-05-21 | gen2seg: Generative Models Enable Generalizable Instance Segmentation | Om Khangaonkar et.al. | 2505.15263 | translate | read | link |
| 2025-05-21 | Zero-Shot Gaze-based Volumetric Medical Image Segmentation | Tatyana Shmykova et.al. | 2505.15256 | translate | read | null |
| 2025-05-21 | From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation | Quanwei Liu et.al. | 2505.15147 | translate | read | null |
| 2025-05-20 | Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning | Amine Elhafsi et.al. | 2505.14938 | translate | read | null |
| 2025-05-20 | Instance Segmentation for Point Sets | Abhimanyu Talwar et.al. | 2505.14583 | translate | read | null |
| 2025-05-20 | ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains | Guillaume Vray et.al. | 2505.14511 | translate | read | link |
| 2025-05-20 | Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation | Bin-Bin Gao et.al. | 2505.14239 | translate | read | link |
| 2025-05-20 | Intra-class Patch Swap for Self-Distillation | Hongjun Choi et.al. | 2505.14124 | translate | read | link |
| 2025-05-20 | Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts | Xi Chen et.al. | 2505.14088 | translate | read | null |
| 2025-05-20 | Scaling Vision Mamba Across Resolutions via Fractal Traversal | Bo Li et.al. | 2505.14062 | translate | read | null |
| 2025-05-20 | EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation | Zelin Zhang et.al. | 2505.14014 | translate | read | null |
| 2025-05-19 | Self-Supervised Learning for Image Segmentation: A Comprehensive Survey | Thangarajah Akilan et.al. | 2505.13584 | translate | read | null |
| 2025-05-19 | FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching | Alp Eren Sari et.al. | 2505.13174 | translate | read | null |
| 2025-05-20 | Industrial Synthetic Segment Pre-training | Shinichi Mae et.al. | 2505.13099 | translate | read | null |
| 2025-05-19 | Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation | Jiaqi Tan et.al. | 2505.12861 | translate | read | link |
| 2025-05-19 | Enhancing Transformers Through Conditioned Embedded Tokens | Hemanth Saratchandran et.al. | 2505.12789 | translate | read | null |
| 2025-05-18 | Temporal-Spectral-Spatial Unified Remote Sensing Dense Prediction | Sijie Zhao et.al. | 2505.12280 | translate | read | link |
| 2025-05-17 | SoftPQ: Robust Instance Segmentation Evaluation via Soft Matching and Tunable Thresholds | Ranit Karmakar et.al. | 2505.12155 | translate | read | link |
| 2025-05-17 | EarthSynth: Generating Informative Earth Observation with Diffusion Models | Jiancheng Pan et.al. | 2505.12108 | translate | read | null |
| 2025-05-17 | iSegMan: Interactive Segment-and-Manipulate 3D Gaussians | Yian Zhao et.al. | 2505.11934 | translate | read | null |
| 2025-05-17 | Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Boosting Off-Road Segmentation via Photometric Distortion and Exponential Moving Average | Wonjune Kim et.al. | 2505.11769 | translate | read | null |
| 2025-05-16 | DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation | Ziyu Zhao et.al. | 2505.11676 | translate | read | null |
| 2025-05-16 | SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision | Utsav Rai et.al. | 2505.11439 | translate | read | null |
| 2025-05-16 | Pseudo-Label Quality Decoupling and Correction for Semi-Supervised Instance Segmentation | Jianghang Lin et.al. | 2505.11075 | translate | read | null |
| 2025-05-16 | Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation | David Minkwan Kim et.al. | 2505.10781 | translate | read | null |
| 2025-05-15 | Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis | Francisco Raverta Capua et.al. | 2505.10751 | translate | read | null |
| 2025-05-15 | TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation | Manthan Patel et.al. | 2505.10696 | translate | read | link |
| 2025-05-15 | SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity | Shihao Zou et.al. | 2505.10352 | translate | read | null |
| 2025-05-15 | APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds | Yuan Gao et.al. | 2505.09971 | translate | read | link |
| 2025-05-14 | FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization | Xiaoyang Yu et.al. | 2505.09385 | translate | read | null |
| 2025-05-14 | MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning | Bin-Bin Gao et.al. | 2505.09265 | translate | read | link |
| 2025-05-13 | MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment | Barak Pinkovich et.al. | 2505.08589 | translate | read | null |
| 2025-05-14 | The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning | Mohamed Lamine Mekhalfi et.al. | 2505.08537 | translate | read | null |
| 2025-05-13 | Dynamic Snake Upsampling Operater and Boundary-Skeleton Weighted Loss for Tubular Structure Segmentation | Yiqi Chen et.al. | 2505.08525 | translate | read | null |
| 2025-05-13 | Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency | Adel Ammar et.al. | 2505.08445 | translate | read | null |
| 2025-05-13 | GNCAF: A GNN-based Neighboring Context Aggregation Framework for Tertiary Lymphoid Structures Semantic Segmentation in WSI | Lei Su et.al. | 2505.08430 | translate | read | null |
| 2025-05-12 | Vision Foundation Model Embedding-Based Semantic Anomaly Detection | Max Peter Ronecker et.al. | 2505.07998 | translate | read | null |
| 2025-05-12 | Privacy Risks of Robot Vision: A User Study on Image Modalities and Resolution | Xuying Huang et.al. | 2505.07766 | translate | read | null |
| 2025-05-12 | Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation | Negin Ghamsarian et.al. | 2505.07691 | translate | read | null |
| 2025-05-12 | MAIS: Memory-Attention for Interactive Segmentation | Mauricio Orbes-Arteaga et.al. | 2505.07511 | translate | read | null |
| 2025-05-13 | TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset | Olaf Wysocki et.al. | 2505.07396 | translate | read | null |
| 2025-05-11 | Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution | Zihang Liu et.al. | 2505.07071 | translate | read | link |
| 2025-05-11 | Depth-Sensitive Soft Suppression with RGB-D Inter-Modal Stylization Flow for Domain Generalization Semantic Segmentation | Binbin Wei et.al. | 2505.07050 | translate | read | null |
| 2025-05-11 | Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding | Chih-Chung Hsu et.al. | 2505.06991 | translate | read | null |
| 2025-05-11 | Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation | Seokjun Kwon et.al. | 2505.06951 | translate | read | null |
| 2025-05-10 | Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization | Xu Zheng et.al. | 2505.06635 | translate | read | null |
| 2025-05-10 | RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation | Zhiwen Zeng et.al. | 2505.06515 | translate | read | null |
| 2025-05-09 | Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet | Kodai Hirata et.al. | 2505.06185 | translate | read | null |
| 2025-05-08 | CottonSim: Development of an autonomous visual-guided robotic cotton-picking system in the Gazebo | Thevathayarajh Thayananthan et.al. | 2505.05317 | translate | read | null |
| 2025-05-08 | RepSNet: A Nucleus Instance Segmentation model based on Boundary Regression and Structural Re-parameterization | Shengchun Xiong et.al. | 2505.05073 | translate | read | null |
| 2025-05-09 | UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model | Timo Kaiser et.al. | 2505.05049 | translate | read | link |
| 2025-05-08 | Split Matching for Inductive Zero-shot Semantic Segmentation | Jialei Chen et.al. | 2505.05023 | translate | read | null |
| 2025-05-08 | Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model | Navin Ranjan et.al. | 2505.04861 | translate | read | null |
| 2025-05-07 | Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions? | Shashank Agnihotri et.al. | 2505.04835 | translate | read | link |
| 2025-05-07 | Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer | Sainath Dey et.al. | 2505.04740 | translate | read | null |
| 2025-05-07 | DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception | Junjie Wang et.al. | 2505.04410 | translate | read | link |
| 2025-05-07 | MFSeg: Efficient Multi-frame 3D Semantic Segmentation | Chengjie Huang et.al. | 2505.04408 | translate | read | null |
| 2025-05-06 | Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach | Srecharan Selvam et.al. | 2505.03702 | translate | read | null |
| 2025-05-06 | CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting | Huawei Sun et.al. | 2505.03679 | translate | read | null |
| 2025-05-06 | Panoramic Out-of-Distribution Segmentation | Mengfei Duan et.al. | 2505.03539 | translate | read | link |
| 2025-05-06 | 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation | Andrew Caunes et.al. | 2505.03300 | translate | read | null |
| 2025-05-05 | Platelet enumeration in dense aggregates | H. Martin Gillis et.al. | 2505.02751 | translate | read | null |
| 2025-05-04 | Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation | Volodymyr Havrylov et.al. | 2505.02075 | translate | read | link |
| 2025-05-04 | Segment Any RGB-Thermal Model with Language-aided Distillation | Dong Xing et.al. | 2505.01950 | translate | read | null |
| 2025-05-03 | OODTE: A Differential Testing Engine for the ONNX Optimizer | Nikolaos Louloudakis et.al. | 2505.01892 | translate | read | null |
| 2025-05-03 | A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory | Chenyang Fan et.al. | 2505.01656 | translate | read | null |
| 2025-05-02 | A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative Learning | Anan Yaghmour et.al. | 2505.01558 | translate | read | null |
| 2025-05-02 | Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation | Zhen Yao et.al. | 2505.01548 | translate | read | link |
| 2025-05-02 | Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing | Fahong Zhang et.al. | 2505.01385 | translate | read | null |
| 2025-05-02 | GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation | Boris Kriuk et.al. | 2505.01057 | translate | read | null |
| 2025-05-03 | Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook | Muyi Bao et.al. | 2505.00630 | translate | read | null |
| 2025-05-01 | Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation | Feng Xue et.al. | 2505.00378 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)