Semantic Segmentation - 2025-02
Semantic Segmentation - 2025-02
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-02-28 | The Common Objects Underwater (COU) Dataset for Robust Underwater Object Detection | Rishi Mukherjee et.al. | 2502.20651 | translate | read | null |
| 2025-02-27 | Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds | Mohamed Abdelsamad et.al. | 2502.20316 | translate | read | null |
| 2025-02-27 | OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels | Meng Lou et.al. | 2502.20087 | translate | read | link |
| 2025-02-28 | SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation | Zijie Zhou et.al. | 2502.20077 | translate | read | link |
| 2025-02-27 | Learning Mask Invariant Mutual Information for Masked Image Modeling | Tao Huang et.al. | 2502.19718 | translate | read | null |
| 2025-02-28 | You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving | Guangfeng Jiang et.al. | 2502.19698 | translate | read | null |
| 2025-02-26 | Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach | Anton Backhaus et.al. | 2502.19177 | translate | read | null |
| 2025-02-26 | Enhanced Neuromorphic Semantic Segmentation Latency through Stream Event | D. Hareb et.al. | 2502.18982 | translate | read | null |
| 2025-02-28 | OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation | Yunpeng Gao et.al. | 2502.18041 | translate | read | null |
| 2025-02-25 | CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems | Rui Liu et.al. | 2502.17821 | translate | read | null |
| 2025-02-24 | CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation | Vishal Thengane et.al. | 2502.17429 | translate | read | link |
| 2025-02-25 | DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks | Canyu Zhao et.al. | 2502.17157 | translate | read | link |
| 2025-02-24 | SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations | Wendi Liu et.al. | 2502.17056 | translate | read | null |
| 2025-02-25 | VPNeXt – Rethinking Dense Decoding for Plain Vision Transformer | Xikai Tang et.al. | 2502.16654 | translate | read | null |
| 2025-02-23 | Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration | Kim Jun-Seong et.al. | 2502.16652 | translate | read | null |
| 2025-02-23 | OpenVox: Real-time Instance-level Open-vocabulary Probabilistic Voxel Representation | Yinan Deng et.al. | 2502.16528 | translate | read | null |
| 2025-02-23 | Deep learning approaches to surgical video segmentation and object detection: A Scoping Review | Devanish N. Kamtam et.al. | 2502.16459 | translate | read | null |
| 2025-02-22 | Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field | Wenhao Hu et.al. | 2502.16303 | translate | read | null |
| 2025-02-22 | Importance-Aware Source-Channel Coding for Multi-Modal Task-Oriented Semantic Communication | Yi Ma et.al. | 2502.16194 | translate | read | null |
| 2025-02-22 | FeatSharp: Your Vision Model Features, Sharper | Mike Ranzinger et.al. | 2502.16025 | translate | read | link |
| 2025-02-21 | Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence | Yufeng Diao et.al. | 2502.15472 | translate | read | null |
| 2025-02-21 | DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation | Luzhou Ge et.al. | 2502.15309 | translate | read | link |
| 2025-02-21 | Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation | Ebenezer Tarubinga et.al. | 2502.15152 | translate | read | link |
| 2025-02-20 | RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation | Henrique Piñeiro Monteagudo et.al. | 2502.14792 | translate | read | null |
| 2025-02-20 | Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes | Lukas Rauch et.al. | 2502.14721 | translate | read | null |
| 2025-02-20 | Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2502.14416 | translate | read | null |
| 2025-02-20 | Bayesian SegNet for Semantic Segmentation with Improved Interpretation of Microstructural Evolution During Irradiation of Materials | Marjolein Oostrom et.al. | 2502.14184 | translate | read | null |
| 2025-02-19 | SegRet: An Efficient Design for Semantic Segmentation with Retentive Network | Zhiyuan Li et.al. | 2502.14014 | translate | read | link |
| 2025-02-19 | Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model | Huiying Shi et.al. | 2502.13990 | translate | read | null |
| 2025-02-19 | MGFI-Net: A Multi-Grained Feature Integration Network for Enhanced Medical Image Segmentation | Yucheng Zeng et.al. | 2502.13808 | translate | read | null |
| 2025-02-19 | CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models | Nikolaos Dionelis et.al. | 2502.13734 | translate | read | null |
| 2025-02-18 | WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields | Ekin Celikkan et.al. | 2502.13103 | translate | read | link |
| 2025-02-18 | Enhancing Power Grid Inspections with Machine Learning | Diogo Lavado et.al. | 2502.13037 | translate | read | null |
| 2025-02-18 | DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Tanzhe Li et.al. | 2502.12627 | translate | read | null |
| 2025-02-17 | From Open-Vocabulary to Vocabulary-Free Semantic Segmentation | Klara Reichard et.al. | 2502.11891 | translate | read | null |
| 2025-02-16 | Leveraging Multimodal-LLMs Assisted by Instance Segmentation for Intelligent Traffic Monitoring | Murat Arda Onsu et.al. | 2502.11304 | translate | read | null |
| 2025-02-16 | Text-promptable Propagation for Referring Medical Image Sequence Segmentation | Runtian Yuan et.al. | 2502.11093 | translate | read | null |
| 2025-02-16 | Detecting Cadastral Boundary from Satellite Images Using U-Net model | Neda Rahimpour Anaraki et.al. | 2502.11044 | translate | read | null |
| 2025-02-15 | NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing | Shutong Zhang et.al. | 2502.10720 | translate | read | null |
| 2025-02-15 | Deep Learning for Wound Tissue Segmentation: A Comprehensive Evaluation using A Novel Dataset | Muhammad Ashad Kabir et.al. | 2502.10652 | translate | read | null |
| 2025-02-14 | Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs – A Multinational Study | Yin-Chih Chelsea Wang et.al. | 2502.10277 | translate | read | null |
| 2025-02-14 | FrGNet: A fourier-guided weakly-supervised framework for nuclear instance segmentation | Peng Ling et.al. | 2502.09874 | translate | read | null |
| 2025-02-12 | Towards Fine-grained Interactive Segmentation in Images and Videos | Yuan Yao et.al. | 2502.09660 | translate | read | null |
| 2025-02-13 | Instance Segmentation of Scene Sketches Using Natural Image Priors | Mia Tang et.al. | 2502.09608 | translate | read | null |
| 2025-02-13 | SQ-GAN: Semantic Image Communications Using Masked Vector Quantization | Francesco Pezone et.al. | 2502.09520 | translate | read | null |
| 2025-02-13 | FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation | Bin Yang et.al. | 2502.09274 | translate | read | null |
| 2025-02-13 | Memory-based Ensemble Learning in CMR Semantic Segmentation | Yiwei Liu et.al. | 2502.09269 | translate | read | link |
| 2025-02-13 | Latents of latents to delineate pixels: hybrid Matryoshka autoencoder-to-U-Net pairing for segmenting large medical images in GPU-poor and low-data regimes | Tahir Syed et.al. | 2502.08988 | translate | read | null |
| 2025-02-12 | HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification | Valentina Vadori et.al. | 2502.08754 | translate | read | link |
| 2025-02-12 | Generalized Class Discovery in Instance Segmentation | Cuong Manh Hoang et.al. | 2502.08149 | translate | read | null |
| 2025-02-12 | Knowledge Swapping via Learning and Unlearning | Mingyu Xing et.al. | 2502.08075 | translate | read | null |
| 2025-02-11 | Efficient Continuous Group Convolutions for Local SE(3) Equivariance in 3D Point Clouds | Lisa Weijler et.al. | 2502.07505 | translate | read | link |
| 2025-02-11 | A Survey on Mamba Architecture for Vision Applications | Fady Ibrahim et.al. | 2502.07161 | translate | read | null |
| 2025-02-09 | A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation | Wang Jiangtao et.al. | 2502.06895 | translate | read | null |
| 2025-02-10 | SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement | Yuqi Lin et.al. | 2502.06756 | translate | read | null |
| 2025-02-10 | A Large-scale AI-generated Image Inpainting Benchmark | Paschalis Giakoumoglou et.al. | 2502.06593 | translate | read | link |
| 2025-02-11 | Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation | Emanuele Mule et.al. | 2502.06288 | translate | read | null |
| 2025-02-10 | Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds | Lassi Ruoppa et.al. | 2502.06227 | translate | read | null |
| 2025-02-09 | Traveling Waves Integrate Spatial Information Into Spectral Representations | Mozes Jacobs et.al. | 2502.06034 | translate | read | null |
| 2025-02-11 | VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer | Xinyu Liu et.al. | 2502.05979 | translate | read | null |
| 2025-02-09 | LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification | Shubham Kumar Nigam et.al. | 2502.05836 | translate | read | null |
| 2025-02-08 | Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture | Mitul Goswami et.al. | 2502.05476 | translate | read | null |
| 2025-02-08 | LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation | Shengdong Zhang et.al. | 2502.05473 | translate | read | null |
| 2025-02-08 | A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation | Canxuan Gang et.al. | 2502.05396 | translate | read | null |
| 2025-02-07 | IPSeg: Image Posterior Mitigates Semantic Drift in Class-Incremental Segmentation | Xiao Yu et.al. | 2502.04870 | translate | read | null |
| 2025-02-07 | AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Runqing Jiang et.al. | 2502.04628 | translate | read | null |
| 2025-02-05 | DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation | Luciano Baresi et.al. | 2502.04378 | translate | read | link |
| 2025-02-06 | Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation | Jiahao Lu et.al. | 2502.04139 | translate | read | null |
| 2025-02-06 | Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation | Yang Chen et.al. | 2502.04111 | translate | read | null |
| 2025-02-06 | LeAP: Consistent multi-domain 3D labeling using Foundation Models | Simon Gebraad et.al. | 2502.03901 | translate | read | null |
| 2025-02-06 | Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation | Xuan Li et.al. | 2502.03813 | translate | read | null |
| 2025-02-05 | Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Indrashis Das et.al. | 2502.03654 | translate | read | link |
| 2025-02-05 | ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models | Ying Zhang et.al. | 2502.03266 | translate | read | link |
| 2025-02-05 | Disentangling CLIP Features for Enhanced Localized Understanding | Samyak Rawelekar et.al. | 2502.02977 | translate | read | null |
| 2025-02-05 | From DeepSense to Open RAN: AI/ML Advancements in Dynamic Spectrum Sensing and Their Applications | Ryan Barker et.al. | 2502.02889 | translate | read | null |
| 2025-02-04 | Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications | William O’Donnell et.al. | 2502.02624 | translate | read | null |
| 2025-02-04 | COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation | Xueqing Deng et.al. | 2502.02589 | translate | read | null |
| 2025-02-04 | Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation | Junha Lee et.al. | 2502.02548 | translate | read | null |
| 2025-02-04 | Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification | Valentina Vadori et.al. | 2502.02471 | translate | read | null |
| 2025-02-04 | Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation | Shutong Duan et.al. | 2502.02340 | translate | read | null |
| 2025-02-04 | UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation | Tao Zhang et.al. | 2502.02257 | translate | read | link |
| 2025-02-04 | Deep Ensemble approach for Enhancing Brain Tumor Segmentation in Resource-Limited Settings | Jeremiah Fadugba et.al. | 2502.02179 | translate | read | null |
| 2025-02-04 | Memory Efficient Transformer Adapter for Dense Predictions | Dong Zhang et.al. | 2502.01962 | translate | read | null |
| 2025-02-03 | Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis | Haowen Bai et.al. | 2502.01467 | translate | read | null |
| 2025-02-03 | Temporal-consistent CAMs for Weakly Supervised Video Segmentation in Waste Sorting | Andrea Marelli et.al. | 2502.01455 | translate | read | null |
| 2025-02-03 | ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies | Costin F. Ciusdel et.al. | 2502.01335 | translate | read | null |
| 2025-02-03 | Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models | Hao Dong et.al. | 2501.18592 | translate | read | link |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)