Semantic Segmentation - 2025-04
Semantic Segmentation - 2025-04
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-04-30 | MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection | Qiushi Yang et.al. | 2505.00739 | translate | read | null |
| 2025-04-30 | Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space | Leonhard Sommer et.al. | 2504.21749 | translate | read | null |
| 2025-04-30 | Real Time Semantic Segmentation of High Resolution Automotive LiDAR Scans | Hannes Reichert et.al. | 2504.21602 | translate | read | null |
| 2025-04-30 | Make Both Ends Meet: A Synergistic Optimization Infrared Small Target Detection with Streamlined Computational Overhead | Yuxin Jing et.al. | 2504.21581 | translate | read | null |
| 2025-04-30 | ClassWise-CRF: Category-Specific Fusion for Enhanced Semantic Segmentation of Remote Sensing Imagery | Qinfeng Zhu et.al. | 2504.21491 | translate | read | null |
| 2025-04-29 | DeepVoid: A Deep Learning Void Detector | Sam Kumagai et.al. | 2504.21134 | translate | read | null |
| 2025-04-29 | Learning a General Model: Folding Clothing with Topological Dynamics | Yiming Liu et.al. | 2504.20720 | translate | read | null |
| 2025-04-29 | OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation | Long Liu et.al. | 2504.20682 | translate | read | link |
| 2025-04-28 | DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes | Junlin Guo et.al. | 2504.20303 | translate | read | null |
| 2025-04-28 | Learning Streaming Video Representation via Multitask Training | Yibin Yan et.al. | 2504.20041 | translate | read | null |
| 2025-04-28 | SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation | Yulong Guo et.al. | 2504.19839 | translate | read | null |
| 2025-04-28 | Open-set Anomaly Segmentation in Complex Scenarios | Song Xia et.al. | 2504.19706 | translate | read | null |
| 2025-04-28 | SubGrapher: Visual Fingerprinting of Chemical Structures | Lucas Morin et.al. | 2504.19695 | translate | read | null |
| 2025-04-28 | BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation | Pin-Chi Pan et.al. | 2504.19643 | translate | read | null |
| 2025-04-28 | Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding | Yan Wang et.al. | 2504.19500 | translate | read | null |
| 2025-04-28 | GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field | Zuxing Lu et.al. | 2504.19409 | translate | read | null |
| 2025-04-27 | OpenFusion++: An Open-vocabulary Real-time Scene Understanding System | Xiaofeng Jin et.al. | 2504.19266 | translate | read | null |
| 2025-04-27 | DeepSPG: Exploring Deep Semantic Prior Guidance for Low-light Image Enhancement with Multimodal Learning | Jialang Lu et.al. | 2504.19127 | translate | read | null |
| 2025-04-26 | VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation | Niaz Ahmad et.al. | 2504.19032 | translate | read | null |
| 2025-04-25 | A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes | Nicolas Münger et.al. | 2504.18213 | translate | read | null |
| 2025-04-25 | Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition | Yin Tang et.al. | 2504.18201 | translate | read | null |
| 2025-04-25 | What is the Added Value of UDA in the VFM Era? | Brunó B. Englert et.al. | 2504.18190 | translate | read | null |
| 2025-04-25 | Back to Fundamentals: Low-Level Visual Features Guided Progressive Token Pruning | Yuanbing Ouyang et.al. | 2504.17996 | translate | read | null |
| 2025-04-24 | Virtual Roads, Smarter Safety: A Digital Twin Framework for Mixed Autonomous Traffic Safety Analysis | Hao Zhang et.al. | 2504.17968 | translate | read | null |
| 2025-04-24 | Masked strategies for images with small objects | H. Martin Gillis et.al. | 2504.17935 | translate | read | null |
| 2025-04-24 | Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images | Zebo Huang et.al. | 2504.17582 | translate | read | null |
| 2025-04-23 | Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection | Jens Petersen et.al. | 2504.17076 | translate | read | null |
| 2025-04-23 | SemanticSugarBeets: A Multi-Task Framework and Dataset for Inspecting Harvest and Storage Characteristics of Sugar Beets | Gerardus Croonen et.al. | 2504.16684 | translate | read | null |
| 2025-04-23 | Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections | Max Kirchner et.al. | 2504.16612 | translate | read | null |
| 2025-04-23 | SAIP-Net: Enhancing Remote Sensing Image Segmentation via Spectral Adaptive Information Propagation | Zhongtao Wang et.al. | 2504.16564 | translate | read | null |
| 2025-04-23 | Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks | Murat Bilgehan Ertan et.al. | 2504.16557 | translate | read | null |
| 2025-04-22 | Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications | Leonardo Olivi et.al. | 2504.15991 | translate | read | null |
| 2025-04-22 | DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining | Wei Zhuo et.al. | 2504.15669 | translate | read | null |
| 2025-04-21 | Segmentation with Noisy Labels via Spatially Correlated Distributions | Ryu Tadokoro et.al. | 2504.14795 | translate | read | link |
| 2025-04-20 | NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation | Junyuan Fang et.al. | 2504.14638 | translate | read | null |
| 2025-04-19 | Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation | Johannes Spoecklberger et.al. | 2504.14231 | translate | read | null |
| 2025-04-19 | Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection | Ghodsiyeh Rostami et.al. | 2504.14138 | translate | read | null |
| 2025-04-19 | Lightweight Road Environment Segmentation using Vector Quantization | Jiyong Kwag et.al. | 2504.14113 | translate | read | null |
| 2025-04-18 | Occlusion-Ordered Semantic Instance Segmentation | Soroosh Baselizadeh et.al. | 2504.14054 | translate | read | null |
| 2025-04-18 | HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework | Shuobin Wei et.al. | 2504.13579 | translate | read | null |
| 2025-04-18 | Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping | Wang Liu et.al. | 2504.13458 | translate | read | link |
| 2025-04-18 | DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images | Racheal Mukisa et.al. | 2504.13415 | translate | read | null |
| 2025-04-18 | Cardiac MRI Semantic Segmentation for Ventricles and Myocardium using Deep Learning | Racheal Mukisa et.al. | 2504.13391 | translate | read | null |
| 2025-04-17 | SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling | Yasin Almalioglu et.al. | 2504.13310 | translate | read | null |
| 2025-04-17 | Digital Twin Generation from Visual Data: A Survey | Andrew Melnik et.al. | 2504.13159 | translate | read | null |
| 2025-04-17 | High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion | Libo Zhang et.al. | 2504.12844 | translate | read | null |
| 2025-04-17 | Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation | Siyu Chen et.al. | 2504.12753 | translate | read | link |
| 2025-04-17 | Parsimonious Dataset Construction for Laparoscopic Cholecystectomy Structure Segmentation | Yuning Zhou et.al. | 2504.12573 | translate | read | null |
| 2025-04-17 | Privacy-Preserving Operating Room Workflow Analysis using Digital Twins | Alejandra Perez et.al. | 2504.12552 | translate | read | null |
| 2025-04-16 | 3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap | Minmin Yang et.al. | 2504.12442 | translate | read | null |
| 2025-04-16 | Remote sensing colour image semantic segmentation of trails created by large herbivorous Mammals | Jose Francisco Diez-Pastor et.al. | 2504.12121 | translate | read | null |
| 2025-04-17 | DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Mengshi Qi et.al. | 2504.12080 | translate | read | link |
| 2025-04-16 | Single-shot Star-convex Polygon-based Instance Segmentation for Spatially-correlated Biomedical Objects | Trina De et.al. | 2504.12078 | translate | read | null |
| 2025-04-16 | CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting | Wei Sun et.al. | 2504.11893 | translate | read | null |
| 2025-04-15 | CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image | Jingshun Huang et.al. | 2504.11230 | translate | read | null |
| 2025-04-15 | Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation | Andrea Simonelli et.al. | 2504.11024 | translate | read | null |
| 2025-04-15 | PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation | Bo-Cheng Hu et.al. | 2504.10986 | translate | read | null |
| 2025-04-15 | LightFormer: A lightweight and efficient decoder for remote sensing image segmentation | Sihang Chen et.al. | 2504.10834 | translate | read | null |
| 2025-04-15 | OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding | Dianbing Xi et.al. | 2504.10825 | translate | read | null |
| 2025-04-15 | Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics’ Gramian on the Manifold Underlying the Patch Space | Kelum Gajamannage et.al. | 2504.10820 | translate | read | null |
| 2025-04-14 | Real-time Seafloor Segmentation and Mapping | Michele Grimaldi et.al. | 2504.10750 | translate | read | null |
| 2025-04-14 | FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation | Yasser Benigmim et.al. | 2504.10487 | translate | read | null |
| 2025-04-14 | The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Weixian Lei et.al. | 2504.10462 | translate | read | null |
| 2025-04-14 | M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data | Tzu-Yun Tseng et.al. | 2504.10123 | translate | read | null |
| 2025-04-14 | DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation | Beomseok Kang et.al. | 2504.09814 | translate | read | null |
| 2025-04-14 | IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme | Dinh Dai Quan Tran et.al. | 2504.09797 | translate | read | null |
| 2025-04-14 | Advancing RFI-Detection in Radio Astronomy with Liquid State Machines | Nicholas J Pritchard et.al. | 2504.09796 | translate | read | null |
| 2025-04-12 | Evolved Hierarchical Masking for Self-Supervised Learning | Zhanzhou Feng et.al. | 2504.09155 | translate | read | null |
| 2025-04-11 | Data-Importance-Aware Power Allocation for Adaptive Real-Time Communication in Computer Vision Applications | Chunmei Xu et.al. | 2504.08922 | translate | read | null |
| 2025-04-11 | Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing | Vinal Asodia et.al. | 2504.08704 | translate | read | null |
| 2025-04-11 | Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Bram Vanherle et.al. | 2504.08473 | translate | read | link |
| 2025-04-11 | SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis | Yi Chen et.al. | 2504.08361 | translate | read | null |
| 2025-04-11 | DSM: Building A Diverse Semantic Map for 3D Visual Grounding | Qinghongbing Xie et.al. | 2504.08307 | translate | read | null |
| 2025-04-10 | ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings | Astitva Srivastava et.al. | 2504.08022 | translate | read | null |
| 2025-04-10 | P2Object: Single Point Supervised Object Detection and Instance Segmentation | Pengfei Chen et.al. | 2504.07813 | translate | read | null |
| 2025-04-10 | Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation | Yanglin Huang et.al. | 2504.07691 | translate | read | null |
| 2025-04-10 | SydneyScapes: Image Segmentation for Australian Environments | Hongyu Lyu et.al. | 2504.07542 | translate | read | null |
| 2025-04-10 | RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability | Jonggwon Park et.al. | 2504.07416 | translate | read | null |
| 2025-04-09 | RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration | Omar Alama et.al. | 2504.06994 | translate | read | null |
| 2025-04-09 | Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Daiwei Zhang et.al. | 2504.06978 | translate | read | null |
| 2025-04-09 | Domain Generalization through Attenuation of Domain-Specific Information | Reiji Saito et.al. | 2504.06781 | translate | read | null |
| 2025-04-08 | SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation | Hritam Basak et.al. | 2504.06389 | translate | read | null |
| 2025-04-09 | Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation | Xiaoxing Hu et.al. | 2504.06220 | translate | read | null |
| 2025-04-08 | WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care | Vanessa Borst et.al. | 2504.06185 | translate | read | null |
| 2025-04-08 | Towards Varroa destructor mite detection using a narrow spectra illumination | Samuel Bielik et.al. | 2504.06099 | translate | read | null |
| 2025-04-08 | econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians | Can Zhang et.al. | 2504.06003 | translate | read | null |
| 2025-04-08 | Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques | Luca Barco et.al. | 2504.05882 | translate | read | null |
| 2025-04-08 | DefMamba: Deformable Visual State Space Model | Leiye Liu et.al. | 2504.05794 | translate | read | null |
| 2025-04-08 | Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation | Enming Zhang et.al. | 2504.05774 | translate | read | null |
| 2025-04-07 | S^4M: Boosting Semi-Supervised Instance Segmentation with SAM | Heeji Yoon et.al. | 2504.05301 | translate | read | null |
| 2025-04-07 | BoxSeg: Quality-Aware and Peer-Assisted Learning for Box-supervised Instance Segmentation | Jinxiang Lai et.al. | 2504.05137 | translate | read | null |
| 2025-04-07 | Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection | Jon Gutiérrez Zaballa et.al. | 2504.05119 | translate | read | null |
| 2025-04-07 | Prior2Former – Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation | Sebastian Schmidt et.al. | 2504.04841 | translate | read | null |
| 2025-04-07 | DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation | Bo-Wen Yin et.al. | 2504.04701 | translate | read | link |
| 2025-04-06 | Statistical Guarantees Of False Discovery Rate In Medical Instance Segmentation Tasks Based on Conformal Risk Control | Mengxia Dai et.al. | 2504.04482 | translate | read | null |
| 2025-04-06 | Evaluation framework for Image Segmentation Algorithms | Tatiana Merkulova et.al. | 2504.04435 | translate | read | null |
| 2025-04-05 | CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation | Kai Fang et.al. | 2504.04156 | translate | read | null |
| 2025-04-05 | DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning | Xiao-Hui Li et.al. | 2504.04085 | translate | read | null |
| 2025-04-04 | Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation | Xin Zhang et.al. | 2504.03193 | translate | read | null |
| 2025-04-03 | Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation | Feng Gao et.al. | 2504.02647 | translate | read | null |
| 2025-04-03 | Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results | Andrei Dumitriu et.al. | 2504.02558 | translate | read | null |
| 2025-04-03 | Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery | Mykola Lavreniuk et.al. | 2504.02534 | translate | read | null |
| 2025-04-03 | Semantic segmentation of forest stands using deep learning | Håkon Næss Sandum et.al. | 2504.02471 | translate | read | null |
| 2025-04-03 | Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation | Changshuo Wang et.al. | 2504.02454 | translate | read | null |
| 2025-04-03 | Benchmark of Segmentation Techniques for Pelvic Fracture in CT and X-ray: Summary of the PENGWIN 2024 Challenge | Yudi Sang et.al. | 2504.02382 | translate | read | null |
| 2025-04-03 | APSeg: Auto-Prompt Model with Acquired and Injected Knowledge for Nuclear Instance Segmentation and Classification | Liying Xu et.al. | 2504.02222 | translate | read | null |
| 2025-04-02 | Scene-Centric Unsupervised Panoptic Segmentation | Oliver Hahn et.al. | 2504.01955 | translate | read | link |
| 2025-04-02 | Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation | Junjie Chen et.al. | 2504.01668 | translate | read | null |
| 2025-04-03 | Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks | Haosheng Li et.al. | 2504.01659 | translate | read | null |
| 2025-04-02 | ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation | Haosheng Li et.al. | 2504.01648 | translate | read | null |
| 2025-04-02 | Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions | Giulia Marchiori Pietrosanti et.al. | 2504.01632 | translate | read | null |
| 2025-04-02 | Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology | Lirui Qi et.al. | 2504.01577 | translate | read | null |
| 2025-04-02 | Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training | Luca Ciampi et.al. | 2504.01547 | translate | read | null |
| 2025-04-02 | Beyond Nearest Neighbor Interpolation in Data Augmentation | Olivier Rukundo et.al. | 2504.01527 | translate | read | null |
| 2025-04-02 | Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement | Zaipeng Duan et.al. | 2504.01449 | translate | read | null |
| 2025-04-02 | v-CLR: View-Consistent Learning for Open-World Instance Segmentation | Chang-Bin Zhang et.al. | 2504.01383 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)