Semantic Segmentation

Publish Date Title Authors PDF Code
2025-12-18 Next-Embedding Prediction Makes Strong Vision Learners Sihan Xu et.al. 2512.16922 null
2025-12-18 Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation Yunkai Yang et.al. 2512.16740 null
2025-12-18 Causal-Tune: Mining Causal Factors from Vision Foundation Models for Domain Generalized Semantic Segmentation Yin Zhang et.al. 2512.16567 null
2025-12-18 PixelArena: A benchmark for Pixel-Precision Visual Intelligence Feng Liang et.al. 2512.16303 null
2025-12-17 In Pursuit of Pixel Supervision for Visual Pre-training Lihe Yang et.al. 2512.15715 null
2025-12-17 MoonSeg3R: Monocular Online Zero-Shot Segment Anything in 3D with Reconstructive Foundation Priors Zhipeng Du et.al. 2512.15577 null
2025-12-17 SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis Maximilian Kellner et.al. 2512.15369 null
2025-12-17 Vision-based module for accurately reading linear scales in a laboratory Parvesh Saini et.al. 2512.15327 null
2025-12-17 SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2512.15310 null
2025-12-16 Segmental Attention Decoding With Long Form Acoustic Encodings Pawel Swietojanski et.al. 2512.14652 null
2025-12-16 S2D: Sparse-To-Dense Keymask Distillation for Unsupervised Video Instance Segmentation Leon Sick et.al. 2512.14440 null
2025-12-16 DriverGaze360: OmniDirectional Driver Attention with Object-Level Guidance Shreedhar Govil et.al. 2512.14266 null
2025-12-16 Consistent Instance Field for Dynamic Scene Understanding Junyi Wu et.al. 2512.14126 null
2025-12-16 ChartAgent: A Chart Understanding Framework with Tool Integrated Reasoning Boran Wang et.al. 2512.14040 null
2025-12-16 Deep Learning Perspective of Scene Understanding in Autonomous Robots Afia Maham et.al. 2512.14020 null
2025-12-15 Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation Hongxuan Sun et.al. 2512.13175 null
2025-12-15 JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion Haoyu Wang et.al. 2512.13014 null
2025-12-15 TWLR: Text-Guided Weakly-Supervised Lesion Localization and Severity Regression for Explainable Diabetic Retinopathy Grading Xi Luo et.al. 2512.13008 null
2025-12-13 OMUDA: Omni-level Masking for Unsupervised Domain Adaptation in Semantic Segmentation Yang Ou et.al. 2512.12303 null
2025-12-12 Enhancing deep learning performance on burned area delineation from SPOT-6/7 imagery for emergency management Maria Rodriguez et.al. 2512.12056 null
2025-12-09 Generalization vs. Specialization: Evaluating Segment Anything Model (SAM3) Zero-Shot Segmentation Against Fine-Tuned YOLO Detectors Ranjan Sapkota et.al. 2512.11884 null
2025-12-07 Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training Jiahao Jiang et.al. 2512.11874 null
2025-12-12 Referring Change Detection in Remote Sensing Imagery Yilmaz Korkmaz et.al. 2512.11719 null
2025-12-12 DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation Mohamed Abdelsamad et.al. 2512.11465 null
2025-12-12 Out-of-Distribution Segmentation via Wasserstein-Based Evidential Uncertainty Arnold Brosch et.al. 2512.11373 null
2025-12-12 VFMF: World Modeling by Forecasting Vision Foundation Model Features Gabrijel Boduljak et.al. 2512.11225 null
2025-12-11 Take a Peek: Efficient Encoder Adaptation for Few-Shot Semantic Segmentation via LoRA Pasquale De Marinis et.al. 2512.10521 null
2025-12-11 Hybrid Transformer-Mamba Architecture for Weakly Supervised Volumetric Medical Segmentation Yiheng Lyu et.al. 2512.10353 null
2025-12-11 ConStruct: Structural Distillation of Foundation Models for Prototype-Based Weakly Supervised Histopathology Segmentation Khang Le et.al. 2512.10316 null
2025-12-11 DualProtoSeg: Simple and Efficient Design with Text- and Image-Guided Prototype Learning for Weakly Supervised Histopathology Image Segmentation Anh M. Vu et.al. 2512.10314 null
2025-12-10 NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway Sander Riisøen Jyhne et.al. 2512.09913 null
2025-12-10 ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation Shengchao Zhou et.al. 2512.09364 null
2025-12-10 ROI-Packing: Efficient Region-Based Compression for Machine Vision Md Eimran Hossain Eimon et.al. 2512.09258 null
2025-12-09 SIP: Site in Pieces- A Dataset of Disaggregated Construction-Phase 3D Scans for Semantic Segmentation and Scene Understanding Seongyong Kim et.al. 2512.09062 null
2025-12-09 Persistent Homology for Labeled Datasets: Gromov-Hausdorff Stability and Generalized Landscapes Yaoying Fu et.al. 2512.08794 null
2025-12-09 SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images Kaiyu Li et.al. 2512.08730 null
2025-12-09 Instance-Aware Test-Time Segmentation for Continual Domain Shifts Seunghwan Lee et.al. 2512.08569 null
2025-12-09 Query-aware Hub Prototype Learning for Few-Shot 3D Point Cloud Semantic Segmentation YiLin Zhou et.al. 2512.08253 null
2025-12-08 Restrictive Hierarchical Semantic Segmentation for Stratified Tooth Layer Detection Ryan Banks et.al. 2512.07984 null
2025-12-08 Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-Free Open-Vocabulary Semantic Segmentation Qiming Huang et.al. 2512.07360 null
2025-12-08 Generalized Referring Expression Segmentation on Aerial Photos Luís Marnoto et.al. 2512.07338 null
2025-12-08 A graph generation pipeline for critical infrastructures based on heuristics, images and depth data Mike Diessner et.al. 2512.07269 null
2025-12-07 Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues Tuan-Anh Vu et.al. 2512.07034 null
2025-12-07 Selective Masking based Self-Supervised Learning for Image Semantic Segmentation Yuemin Wang et.al. 2512.06981 null
2025-12-07 Balanced Learning for Domain Adaptive Semantic Segmentation Wangkai Li et.al. 2512.06886 null
2025-12-07 Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion Yu Zhu et.al. 2512.06882 null
2025-12-07 Towards Robust Pseudo-Label Learning in Semantic Segmentation: An Encoding Perspective Wangkai Li et.al. 2512.06870 null
2025-12-07 Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training Kaixuan Lu et.al. 2512.06864 null
2025-12-07 FedDSR: Federated Deep Supervision and Regularization Towards Autonomous Driving Wei-Bin Kou et.al. 2512.06676 null
2025-12-07 Statistic-Augmented, Decoupled MoE Routing and Aggregating in Autonomous Driving Wei-Bin Kou et.al. 2512.06664 null
2025-12-07 CoT4Det: A Chain-of-Thought Framework for Perception-Oriented Vision-Language Tasks Yu Qi et.al. 2512.06663 null
2025-12-06 Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework Xinhao Xiang et.al. 2512.06376 null
2025-12-03 Fast and Flexible Robustness Certificates for Semantic Segmentation Thomas Massena et.al. 2512.06010 null
2025-11-30 Stronger is not better: Better Augmentations in Contrastive Learning for Medical Image Segmentation Azeez Idris et.al. 2512.05992 null
2025-12-05 LPD: Learnable Prototypes with Diversity Regularization for Weakly Supervised Histopathology Segmentation Khang Le et.al. 2512.05922 null
2025-12-05 Label-Efficient Point Cloud Segmentation with Active Learning Johannes Meyer et.al. 2512.05759 null
2025-12-05 DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model Pasquale De Marinis et.al. 2512.05613 null
2025-12-01 FlowEO: Generative Unsupervised Domain Adaptation for Earth Observation Georges Le Bellier et.al. 2512.05140 null
2025-12-04 GeoPE:A Unified Geometric Positional Embedding for Structured Tensors Yupu Yao et.al. 2512.04963 null
2025-12-04 MT-Depth: Multi-task Instance feature analysis for the Depth Completion Abdul Haseeb Nizamani et.al. 2512.04734 null
2025-12-04 DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance Yinghui Xing et.al. 2512.04511 null
2025-12-03 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2512.03684 null
2025-12-03 OpenTrack3D: Towards Accurate and Generalizable Open-Vocabulary 3D Instance Segmentation Zhishan Zhou et.al. 2512.03532 null
2025-12-03 Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation Seogkyu Jeon et.al. 2512.03508 null
2025-12-02 Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks Matthew Dutson et.al. 2512.03014 null
2025-12-02 Enhancing Floor Plan Recognition: A Hybrid Mix-Transformer and U-Net Approach for Precise Wall Segmentation Dmitriy Parashchuk et.al. 2512.02413 null
2025-12-02 Reproducing and Extending RaDelft 4D Radar with Camera-Assisted Labels Kejia Hu et.al. 2512.02394 null
2025-12-02 SAGE: Style-Adaptive Generalization for Privacy-Constrained Semantic Segmentation Across Domains Qingmei Li et.al. 2512.02369 null
2025-12-01 Multifractal Recalibration of Neural Networks for Medical Imaging Segmentation Miguel L. Martins et.al. 2512.02198 null
2025-12-01 Evaluating SAM2 for Video Semantic Segmentation Syed Hesham Syed Ariff et.al. 2512.01774 null
2025-12-01 SSR: Semantic and Spatial Rectification for CLIP-based Weakly Supervised Segmentation Xiuli Bi et.al. 2512.01701 null
2025-12-01 ViT $^3$ : Unlocking Test-Time Training in Vision Dongchen Han et.al. 2512.01643 null
2025-12-01 Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation Thao Thi Phuong Dao et.al. 2512.01589 null
2025-12-01 ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark Joanne Lin et.al. 2512.01495 null
2025-12-01 Panda: Self-distillation of Reusable Sensor-level Representations for High Energy Physics Samuel Young et.al. 2512.01324 null
2025-12-01 TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image Ziqian Wang et.al. 2512.01204 null
2025-11-30 Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation An Yang et.al. 2512.00944 null
2025-11-30 The Outline of Deception: Physical Adversarial Attacks on Traffic Signs Using Edge Patches Haojie Ji et.al. 2512.00765 null
2025-11-30 VFM-ISRefiner: Towards Better Adapting Vision Foundation Models for Interactive Segmentation of Remote Sensing Images Deliang Wang et.al. 2512.00718 null
2025-11-29 Doppler-Enhanced Deep Learning: Improving Thyroid Nodule Segmentation with YOLOv5 Instance Segmentation Mahmoud El Hussieni et.al. 2512.00639 null
2025-11-29 EZ-SP: Fast and Lightweight Superpoint-Based 3D Segmentation Louis Geist et.al. 2512.00385 null
2025-11-29 Breaking It Down: Domain-Aware Semantic Segmentation for Retrieval Augmented Generation Aparajitha Allamraju et.al. 2512.00367 null
2025-11-29 Towards aligned body representations in vision models Andrey Gizdov et.al. 2512.00365 null
2025-11-28 Learning to Predict Aboveground Biomass from RGB Images with 3D Synthetic Scenes Silvia Zuffi et.al. 2511.23249 null
2025-11-28 Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM Shouhe Zhang et.al. 2511.22968 null
2025-11-28 Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation Taeyeong Kim et.al. 2511.22948 null
2025-11-27 GazeTrack: High-Precision Eye Tracking Based on Regularization and Spatial Computing Xiaoyin Yang et.al. 2511.22607 null
2025-11-27 3D Affordance Keypoint Detection for Robotic Manipulation Zhiyang Liu et.al. 2511.22195 null
2025-11-26 OpenTwinMap: An Open-Source Digital Twin Generator for Urban Autonomous Driving Alex Richardson et.al. 2511.21925 null
2025-11-26 ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images M. Naseer Subhani et.al. 2511.21606 null
2025-11-26 Shift-Equivariant Complex-Valued Convolutional Neural Networks Quentin Gabot et.al. 2511.21250 null
2025-11-25 Open Vocabulary Compositional Explanations for Neuron Alignment Biagio La Rosa et.al. 2511.20931 null
2025-11-25 Automated Monitoring of Cultural Heritage Artifacts Using Semantic Segmentation Andrea Ranieri et.al. 2511.20541 null
2025-11-25 CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation Shilei Cao et.al. 2511.20302 null
2025-11-25 SAM-MI: A Mask-Injected Framework for Enhancing Open-Vocabulary Semantic Segmentation with SAM Lin Chen et.al. 2511.20027 null
2025-11-25 Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting Wen Zhang et.al. 2511.19953 null
2025-11-24 Lightweight Transformer Framework for Weakly Supervised Semantic Segmentation Ali Torabi et.al. 2511.19765 null
2025-11-24 RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models Omar Alama et.al. 2511.19704 null
2025-11-24 Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration Remi Petitpierre et.al. 2511.19538 null
2025-11-24 BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation Rachit Saluja et.al. 2511.19394 null
2025-11-24 nnActive: A Framework for Evaluation of Active Learning in 3D Biomedical Segmentation Carsten T. Lüth et.al. 2511.19183 null
2025-11-24 DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection Hai Ci et.al. 2511.19111 null
2025-11-24 SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation Nimeshika Udayangani et.al. 2511.18816 null
2025-11-24 PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion Yichen Yang et.al. 2511.18801 null
2025-11-23 SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation Peter Siegel et.al. 2511.18386 null
2025-11-23 UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization Siyi Li et.al. 2511.18254 null
2025-11-22 Matching-Based Few-Shot Semantic Segmentation Models Are Interpretable by Design Pasquale De Marinis et.al. 2511.18163 null
2025-11-22 AdaPerceiver: Transformers with Adaptive Width, Depth, and Tokens Purvish Jajal et.al. 2511.18105 null
2025-11-18 HSMix: Hard and Soft Mixing Data Augmentation for Medical Image Segmentation Danyang Sun et.al. 2511.17614 null
2025-11-21 Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift Björn Michele et.al. 2511.17455 null
2025-11-21 REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing Binger Chen et.al. 2511.17442 null
2025-11-21 FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception Shubham Sonarghare et.al. 2511.17210 null
2025-11-20 Late-decoupled 3D Hierarchical Semantic Segmentation with Semantic Prototype Discrimination based Bi-branch Supervision Shuyu Cao et.al. 2511.16650 null
2025-11-20 Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling Minseok Seo et.al. 2511.16301 null
2025-11-20 Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective Jiahao Li et.al. 2511.16170 null
2025-11-20 InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation via Information-Theoretic Alignment Transfer Muyao Yuan et.al. 2511.15967 null
2025-11-19 Automatic Uncertainty-Aware Synthetic Data Bootstrapping for Historical Map Segmentation Lukas Arzoumanidis et.al. 2511.15875 null
2025-11-19 GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI Naomi Simumba et.al. 2511.15658 null
2025-11-19 Multi-Text Guided Few-Shot Semantic Segmentation Qiang Jiao et.al. 2511.15515 null
2025-11-19 WarNav: An Autonomous Driving Benchmark for Segmentation of Navigable Zones in War Scenes Marc-Emmanuel Coupvent des Graviers et.al. 2511.15429 null
2025-11-19 Controlling False Positives in Image Segmentation via Conformal Prediction Luca Mossina et.al. 2511.15406 null
2025-11-18 EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects Gbenga Omotara et.al. 2511.14970 null
2025-11-18 FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding Zhenshi Li et.al. 2511.14901 null
2025-11-18 Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation Aditi Agarwal et.al. 2511.14481 null
2025-11-18 Step by Step Network Dongchen Han et.al. 2511.14329 null
2025-11-18 Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution N Dinesh Reddy et.al. 2511.14210 null
2025-11-17 Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting Jiangnan Ye et.al. 2511.13684 null
2025-11-17 Mapping the Vanishing and Transformation of Urban Villages in China Wenyu Zhang et.al. 2511.13507 null
2025-11-17 Delineate Anything Flow: Fast, Country-Level Field Boundary Detection from Any Source Mykola Lavreniuk et.al. 2511.13417 null
2025-11-17 DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation Yan Gong et.al. 2511.13047 null
2025-11-15 FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention Peng Zhang et.al. 2511.12215 null
2025-11-15 Evaluation of Attention Mechanisms in U-Net Architectures for Semantic Segmentation of Brazilian Rock Art Petroglyphs Leonardi Melo et.al. 2511.11959 null
2025-11-14 Chain-of-Generation: Progressive Latent Diffusion for Text-Guided Molecular Design Lingxiao Li et.al. 2511.11894 null
2025-11-14 Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation Camila Machado de Araujo et.al. 2511.11890 null
2025-11-13 AdaptFly: Prompt-Guided Adaptation of Foundation Models for Low-Altitude UAV Networks Jiao Chen et.al. 2511.11720 null
2025-11-14 Terrain Costmap Generation via Scaled Preference Conditioning Luisa Mao et.al. 2511.11529 null
2025-11-13 Histology-informed tiling of whole tissue sections improves the interpretability and predictability of cancer relapse and genetic alterations Willem Bonnaffé et.al. 2511.10432 null
2025-11-13 Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators Maximiliane Gruber et.al. 2511.10424 null
2025-11-13 DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Semantic Instance Segmentation Xuexun Liu et.al. 2511.10003 null
2025-11-04 Do Street View Imagery and Public Participation GIS align: Comparative Analysis of Urban Attractiveness Milad Malekzadeh et.al. 2511.05570 null
2025-11-03 Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation Jiayuan Wang et.al. 2511.05557 null
2025-11-06 An Active Learning Pipeline for Biomedical Image Instance Segmentation with Minimal Human Intervention Shuo Zhao et.al. 2511.04811 null
2025-11-06 Cambrian-S: Towards Spatial Supersensing in Video Shusheng Yang et.al. 2511.04670 null
2025-11-06 Vitessce Link: A Mixed Reality and 2D Display Hybrid Approach for Visual Analysis of 3D Tissue Maps Eric Mörth et.al. 2511.04262 null
2025-11-06 CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation Yuwen Tao et.al. 2511.03992 null
2025-11-05 Laugh, Relate, Engage: Stylized Comment Generation for Short Videos Xuan Ouyang et.al. 2511.03757 null
2025-11-05 Computational Imaging Meets LLMs: Zero-Shot IDH Mutation Prediction in Brain Gliomas Syed Muqeem Mahmood et.al. 2511.03376 null
2025-11-05 Enhancing Medical Image Segmentation via Heat Conduction Equation Rong Wu et.al. 2511.03260 null
2025-11-05 Diffusion-Guided Mask-Consistent Paired Mixing for Endoscopic Image Segmentation Pengyu Jie et.al. 2511.03219 null
2025-11-05 Subsampled Randomized Fourier GaLore for Adapting Foundation Models in Depth-Driven Liver Landmark Segmentation Yun-Chen Lin et.al. 2511.03163 null
2025-11-05 Accelerating Physical Property Reasoning for Augmented Visual Cognition Hongbo Lan et.al. 2511.03126 null
2025-11-04 Learning with less: label-efficient land cover classification at very high spatial resolution using self-supervised deep learning Dakota Hester et.al. 2511.03004 null
2025-11-04 Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data Syed Mostaquim Ali et.al. 2511.02994 null
2025-11-04 Digital Twin-Driven Pavement Health Monitoring and Maintenance Optimization Using Graph Neural Networks Mohsin Mahmud Topu et.al. 2511.02957 null
2025-11-04 Optimizing the nnU-Net model for brain tumor (Glioma) segmentation Using a BraTS Sub-Saharan Africa (SSA) dataset Chukwuemeka Arua Kalu et.al. 2511.02893 null
2025-11-02 Digitizing Spermatogenesis Lineage at Nanoscale Resolution In Tissue-Level Electron Microscopy Li Xiao et.al. 2511.02860 null
2025-11-04 Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks Dmitrii Pozdeev et.al. 2511.02830 null
2025-11-04 PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing Antonio Oroz et.al. 2511.02777 null
2025-11-04 Resource-efficient Automatic Refinement of Segmentations via Weak Supervision from Light Feedback Alix de Langlais et.al. 2511.02576 null
2025-11-04 ESA: Energy-Based Shot Assembly Optimization for Automatic Video Editing Yaosen Chen et.al. 2511.02505 null
2025-11-04 Synthetic Crop-Weed Image Generation and its Impact on Model Generalization Garen Boyadjian et.al. 2511.02417 null
2025-11-04 Revisiting put-that-there, context aware window interactions via LLMs Riccardo Bovo et.al. 2511.02378 null
2025-11-04 From Instance Segmentation to 3D Growth Trajectory Reconstruction in Planktonic Foraminifera Huahua Lin et.al. 2511.02142 null
2025-11-03 Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation Seongkyu Choi et.al. 2511.01434 null
2025-11-03 MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement Jierui Qu et.al. 2511.01345 null
2025-11-03 Source-Only Cross-Weather LiDAR via Geometry-Aware Point Drop YoungJae Cheong et.al. 2511.01250 null
2025-11-03 CenterMamba-SAM: Center-Prioritized Scanning and Temporal Prototypes for Brain Lesion Segmentation Yu Tian et.al. 2511.01243 null
2025-11-03 An Enhanced Proprioceptive Method for Soft Robots Integrating Bend Sensors and IMUs Dong Heon Han et.al. 2511.01165 null
2025-11-03 MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation Ziyi Wang et.al. 2511.01143 null
2025-11-02 URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model Zhe Li et.al. 2511.00940 null
2025-11-02 TA-LSDiff:Topology-Aware Diffusion Guided by a Level Set Energy for Pancreas Segmentation Yue Gou et.al. 2511.00815 null
2025-11-02 Rhythm in the Air: Vision-based Real-Time Music Generation through Gestures Barathi Subramanian et.al. 2511.00793 null
2025-11-02 Class-agnostic 3D Segmentation by Granularity-Consistent Automatic 2D Mask Tracking Juan Wang et.al. 2511.00785 null
2025-11-01 Grounding Surgical Action Triplets with Instrument Instance Segmentation: A Dataset and Target-Aware Fusion Approach Oluwatosin Alabi et.al. 2511.00643 null
2025-11-01 Text-guided Fine-Grained Video Anomaly Detection Jihao Gu et.al. 2511.00524 null
2025-11-01 Optimization of continuous-flow over traffic networks with fundamental diagram constraints Anqi Dong et.al. 2511.00500 null
2025-11-01 HumanCrafter: Synergizing Generalizable Human Reconstruction and Semantic 3D Segmentation Panwang Pan et.al. 2511.00468 null
2025-11-01 Tree Training: Accelerating Agentic LLMs Training via Shared Prefix Reuse Shaojie Wang et.al. 2511.00413 null
2025-10-31 Predicting the spatial distribution and demographics of commercial swine farms in the United States Felipe E. Sanchez et.al. 2511.00132 null
2025-10-29 Habitat and Land Cover Change Detection in Alpine Protected Areas: A Comparison of AI Architectures Harald Kristen et.al. 2511.00073 null
2025-10-31 VessShape: Few-shot 2D blood vessel segmentation by leveraging shape priors from synthetic images Cesar H. Comin et.al. 2510.27646 null
2025-10-31 Context-Gated Cross-Modal Perception with Visual Mamba for PET-CT Lung Tumor Segmentation Elena Mulero Ayllón et.al. 2510.27508 null
2025-10-31 Mask-to-Height: A YOLOv11-Based Architecture for Joint Building Instance Segmentation and Height Classification from Satellite Imagery Mahmoud El Hussieni et.al. 2510.27224 null
2025-10-31 SpecAware: A Spectral-Content Aware Foundation Model for Unifying Multi-Sensor Learning in Hyperspectral Remote Sensing Mapping Renjie Ji et.al. 2510.27219 null
2025-10-31 MLPerf Automotive Radoyeh Shojaei et.al. 2510.27065 null
2025-10-30 AD-SAM: Fine-Tuning the Segment Anything Vision Foundation Model for Autonomous Driving Perception Mario Camarena et.al. 2510.27047 null
2025-10-30 Photometric Redshifts in JWST Deep Fields: A Pixel-Based Alternative with DeepDISC Grant Merz et.al. 2510.27032 null
2025-10-30 Surpassing state of the art on AMD area estimation from RGB fundus images through careful selection of U-Net architectures and loss functions for class imbalance Valentyna Starodub et.al. 2510.26778 null
2025-10-30 Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws Lin Guo et.al. 2510.26268 null
2025-10-29 BikeScenes: Online LiDAR Semantic Segmentation for Bicycles Denniz Goren et.al. 2510.25901 null
2025-10-29 StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA Yuhang Hu et.al. 2510.25332 null
2025-10-29 LangHOPS: Language Grounded Hierarchical Open-Vocabulary Part Segmentation Yang Miao et.al. 2510.25263 null
2025-10-29 Mapping and Classification of Trees Outside Forests using Deep Learning Moritz Lucas et.al. 2510.25239 null
2025-10-29 Classifier Enhancement Using Extended Context and Domain Experts for Semantic Segmentation Huadong Tang et.al. 2510.25174 null
2025-10-29 EA3D: Online Open-World 3D Object Extraction from Streaming Videos Xiaoyu Zhou et.al. 2510.25146 null
2025-10-29 Region-CAM: Towards Accurate Object Regions in Class Activation Maps for Weakly Supervised Learning Tasks Qingdong Cai et.al. 2510.25134 null
2025-10-28 A Critical Study towards the Detection of Parkinsons Disease using ML Technologies Vivek Chetia et.al. 2510.24456 null
2025-10-28 A Quantitative Evaluation Framework for Explainable AI in Semantic Segmentation Reem Hammoud et.al. 2510.24414 null
2025-10-27 Improving Visual Discriminability of CLIP for Training-Free Open-Vocabulary Semantic Segmentation Jinxin Zhou et.al. 2510.23894 null
2025-10-27 DPGLA: Bridging the Gap between Synthetic and Real Data for Unsupervised Domain Adaptation in 3D LiDAR Semantic Segmentation Wanmeng Li et.al. 2510.23525 null
2025-10-27 One-Timestep is Enough: Achieving High-performance ANN-to-SNN Conversion via Scale-and-Fire Neurons Qiuyang Chen et.al. 2510.23383 null
2025-10-27 Seq-DeepIPC: Sequential Sensing for End-to-End Control in Legged Robot Navigation Oskar Natan et.al. 2510.23057 null
2025-10-26 WaveMAE: Wavelet decomposition Masked Auto-Encoder for Remote Sensing Vittorio Bernuzzi et.al. 2510.22697 null
2025-10-26 A Critical Study on Tea Leaf Disease Detection using Deep Learning Techniques Nabajyoti Borah et.al. 2510.22647 null
2025-10-26 SABlock: Semantic-Aware KV Cache Eviction with Adaptive Compression Block Size Jinhan Chen et.al. 2510.22556 null
2025-10-25 Real-Time Semantic Segmentation on FPGA for Autonomous Vehicles Using LMIINet with the CGRA4ML Framework Amir Mohammad Khadem Hosseini et.al. 2510.22243 null
2025-10-25 Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation Jeongin Kim et.al. 2510.22229 null
2025-10-25 Simplifying Knowledge Transfer in Pretrained Models Siddharth Jain et.al. 2510.22208 null
2025-10-25 Bridging Perception and Reasoning: Dual-Pipeline Neuro-Symbolic Landing for UAVs in Cluttered Environments Weixian Qian et.al. 2510.22204 null
2025-10-24 AURASeg: Attention Guided Upsampling with Residual Boundary-Assistive Refinement for Drivable-Area Segmentation Narendhiran Vijayakumar et.al. 2510.21536 null
2025-10-24 Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks Jieyuan Zhang et.al. 2510.21403 null
2025-10-24 Urban 3D Change Detection Using LiDAR Sensor for HD Map Maintenance and Smart Mobility Hezam Albagami et.al. 2510.21112 null
2025-10-24 WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition Guoan Xu et.al. 2510.21079 null
2025-10-23 ACS-SegNet: An Attention-Based CNN-SegFormer Segmentation Network for Tissue Segmentation in Histopathology Nima Torbati et.al. 2510.20754 null
2025-10-22 Uncertainty evaluation of segmentation models for Earth observation Melanie Rey et.al. 2510.19586 null
2025-10-22 Automated Morphological Analysis of Neurons in Fluorescence Microscopy Using YOLOv8 Banan Alnemri et.al. 2510.19455 null
2025-10-21 ε-Seg: Sparsely Supervised Semantic Segmentation of Microscopy Data Sheida Rahnamai Kordasiabi et.al. 2510.18637 null
2025-10-21 Learning to Navigate Under Imperfect Perception: Conformalised Segmentation for Safe Reinforcement Learning Daniel Bethell et.al. 2510.18485 null
2025-10-21 DART: A Structured Dataset of Regulatory Drug Documents in Italian for Clinical NLP Mariano Barone et.al. 2510.18475 null
2025-10-20 Accelerating Vision Transformers with Adaptive Patch Sizes Rohan Choudhury et.al. 2510.18091 link
2025-10-17 3D Weakly Supervised Semantic Segmentation via Class-Aware and Geometry-Guided Pseudo-Label Refinement Xiaoxu Xu et.al. 2510.17875 null
2025-10-20 4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads Ling Liu et.al. 2510.17664 null
2025-10-20 Expose Camouflage in the Water: Underwater Camouflaged Instance Segmentation and Dataset Chuhong Wang et.al. 2510.17585 null
2025-10-20 M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Monocular Spatial Perception U. V. B. L Udugama et.al. 2510.17363 null
2025-10-20 Exploring Structural Degradation in Dense Representations for Self-supervised Learning Siran Dai et.al. 2510.17299 null
2025-10-19 ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification Akhila Kambhatla et.al. 2510.16854 null
2025-10-19 Needles in the Landscape: Semi-Supervised Pseudolabeling for Archaeological Site Discovery under Label Scarcity Simon Jaxy et.al. 2510.16814 null
2025-10-19 An Efficient Semantic Segmentation Decoder for In-Car or Distributed Applications Danish Nazir et.al. 2510.16747 null
2025-10-19 UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid Tianyang Dou et.al. 2510.16730 null
2025-10-18 Self-Supervised Learning to Fly using Efficient Semantic Segmentation and Metric Depth Estimation for Low-Cost Autonomous UAVs Sebastian Mocanu et.al. 2510.16624 null
2025-10-18 Cataract-LMM: Large-Scale, Multi-Source, Multi-Task Benchmark for Deep Learning in Surgical Video Analysis Mohammad Javad Ahmadi et.al. 2510.16371 null
2025-10-17 Neuro-Symbolic Spatial Reasoning in Segmentation Jiayi Lin et.al. 2510.15841 null
2025-10-17 Semantic segmentation with coarse annotations Jort de Jong et.al. 2510.15756 null
2025-10-17 Semantic4Safety: Causal Insights from Zero-shot Street View Imagery Segmentation for Urban Road Safety Huan Chen et.al. 2510.15434 null
2025-10-17 MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment Bingyu Li et.al. 2510.15398 null
2025-10-17 TranSimHub:A Unified Air-Ground Simulation Platform for Multi-Modal Perception and Decision-Making Maonan Wang et.al. 2510.15365 null
2025-10-17 RankSEG-RMA: An Efficient Segmentation Algorithm via Reciprocal Moment Approximation Zixun Wang et.al. 2510.15362 null
2025-10-17 Symmetric Entropy-Constrained Video Coding for Machines Yuxiao Sun et.al. 2510.15347 null
2025-10-16 Comprehensive language-image pre-training for 3D medical image understanding Tassilo Wald et.al. 2510.15042 null
2025-10-16 MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning Mattia Segu et.al. 2510.15026 null
2025-10-16 Multi-modal video data-pipelines for machine learning with minimal human supervision Mihai-Cristian Pîrvu et.al. 2510.14862 null
2025-10-15 PoissonNet: A Local-Global Approach for Learning on Surfaces Arman Maesumi et.al. 2510.14146 null
2025-10-15 Multi-Scale High-Resolution Logarithmic Grapher Module for Efficient Vision GNNs Mustafa Munir et.al. 2510.13740 null
2025-10-15 Dedelayed: Deleting remote inference delay via on-device correction Dan Jacobellis et.al. 2510.13714 null
2025-10-15 Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning Yang Li et.al. 2510.13307 null
2025-10-15 FlyAwareV2: A Multimodal Cross-Domain UAV Dataset for Urban Scene Understanding Francesco Barbato et.al. 2510.13243 null
2025-10-14 SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding Zhiliu Yang et.al. 2510.12749 null
2025-10-14 Multiplicative Loss for Enhancing Semantic Segmentation in Medical and Cellular Images Yuto Yokoi et.al. 2510.12258 null
2025-10-14 BEEP3D: Box-Supervised End-to-End Pseudo-Mask Generation for 3D Instance Segmentation Youngju Yoo et.al. 2510.12182 null
2025-10-13 A Framework for Low-Effort Training Data Generation for Urban Semantic Segmentation Denis Zavadski et.al. 2510.11567 null
2025-10-13 Building and Evaluating a Realistic Virtual World for Large Scale Urban Exploration from 360° Videos Mizuki Takenawa et.al. 2510.11447 null
2025-10-13 Uncertainty-Aware ControlNet: Bridging Domain Gaps with Synthetic Image Generation Joshua Niemeijer et.al. 2510.11346 null
2025-10-12 DAGLFNet:Deep Attention-Guided Global-Local Feature Fusion for Pseudo-Image Point Cloud Segmentation Chuang Chen et.al. 2510.10471 null
2025-10-11 MRI Brain Tumor Detection with Computer Vision Jack Krolik et.al. 2510.10250 null
2025-10-11 SparseUWSeg: Active Sparse Point-Label Augmentation for Underwater Semantic Segmentation César Borja et.al. 2510.10163 null
2025-10-11 An Unsupervised Time Series Anomaly Detection Approach for Efficient Online Process Monitoring of Additive Manufacturing Frida Cantu et.al. 2510.09977 null
2025-10-10 Cell Instance Segmentation: The Devil Is in the Boundaries Peixian Liang et.al. 2510.09848 null
2025-10-10 A methodology for clinically driven interactive segmentation evaluation Parhom Esmaeili et.al. 2510.09499 null
2025-10-10 SilvaScenes: Tree Segmentation and Species Classification from Under-Canopy Images in Natural Forests David-Alexandre Duclos et.al. 2510.09458 null
2025-10-10 Instance-Aware Robust Consistency Regularization for Semi-Supervised Nuclei Instance Segmentation Zenan Lin et.al. 2510.09329 null
2025-10-10 SOS: Synthetic Object Segments Improve Detection, Segmentation, and Grounding Weikai Huang et.al. 2510.09110 null
2025-10-10 Exploring Single Domain Generalization of LiDAR-based Semantic Segmentation under Imperfect Labels Weitong Kong et.al. 2510.09035 null
2025-10-10 Pinpointing crucial steps: Attribution-based Credit Assignment for Verifiable Reinforcement Learning Junxi Yin et.al. 2510.08899 null
2025-10-09 FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation Hongrui Wu et.al. 2510.08849 null
2025-10-08 Out-of-Distribution Detection in LiDAR Semantic Segmentation Using Epistemic Uncertainty from Hierarchical GMMs Hanieh Shojaei Miandashti et.al. 2510.08631 null
2025-10-08 HARP-NeXt: High-Speed and Accurate Range-Point Fusion Network for 3D LiDAR Semantic Segmentation Samir Abou Haidar et.al. 2510.06876 null
2025-10-08 Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion Jie Luo et.al. 2510.06687 null
2025-10-08 Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation Fei Zhang et.al. 2510.06582 null
2025-10-07 Dropping the D: RGB-D SLAM Without the Depth Sensor Mert Kiray et.al. 2510.06216 link
2025-10-07 Overlap-aware segmentation for topological reconstruction of obscured objects J. Schueler et.al. 2510.06194 null
2025-10-07 Shaken or Stirred? An Analysis of MetaFormer’s Token Mixing for Medical Imaging Ron Keuth et.al. 2510.05971 null
2025-10-07 ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving Yongxuan Lyu et.al. 2510.05752 null
2025-07-25 Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing Haichuan Li et.al. 2507.19691 null
2025-07-25 SurgPIS: Surgical-instrument-level Instances and Part-level Semantics for Weakly-supervised Part-aware Instance Segmentation Meng Wei et.al. 2507.19592 null
2025-07-24 HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation Xinyu Wang et.al. 2507.18575 null
2025-07-24 Synthetic Data Augmentation for Enhanced Chicken Carcass Instance Segmentation Yihong Feng et.al. 2507.18558 null
2025-07-24 Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows Simin Huo et.al. 2507.18405 link
2025-07-24 GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences Gabriel Jarry et.al. 2507.18330 null
2025-07-24 SemiSegECG: A Multi-Dataset Benchmark for Semi-Supervised Semantic Segmentation in ECG Delineation Minje Park et.al. 2507.18323 link
2025-07-24 Unsupervised Domain Adaptation for 3D LiDAR Semantic Segmentation Using Contrastive Learning and Multi-Model Pseudo Labeling Abhishek Kaushik et.al. 2507.18176 null
2025-07-23 AFRDA: Attentive Feature Refinement for Domain Adaptive Semantic Segmentation Md. Al-Masrur Khan et.al. 2507.17957 link
2025-07-23 Exploring Spatial Diversity for Region-based Active Learning Lile Cai et.al. 2507.17367 null
2025-07-23 Exploring Active Learning for Semiconductor Defect Segmentation Lile Cai et.al. 2507.17359 null
2025-07-23 Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation Haotian Chen et.al. 2507.17347 null
2025-07-23 On Temporal Guidance and Iterative Refinement in Audio Source Separation Tobias Morocutti et.al. 2507.17297 null
2025-07-23 ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation Bo Fang et.al. 2507.17149 null
2025-07-22 MultiTaskDeltaNet: Change Detection-based Image Segmentation for Operando ETEM with Application to Carbon Gasification Kinetics Yushuo Niu et.al. 2507.16803 null
2025-07-22 A2Mamba: Attention-augmented State Space Models for Visual Recognition Meng Lou et.al. 2507.16624 link
2025-07-22 Semantic Segmentation for Preoperative Planning in Transcatheter Aortic Valve Replacement Cedric Zöllner et.al. 2507.16573 null
2025-07-22 Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge Tobias Rueckert et.al. 2507.16559 null
2025-07-23 EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion Shang Liu et.al. 2507.16535 null
2025-07-22 Advancing Visual Large Language Model for Multi-granular Versatile Perception Wentao Xiang et.al. 2507.16213 null
2025-07-22 AMMNet: An Asymmetric Multi-Modal Network for Remote Sensing Semantic Segmentation Hui Ye et.al. 2507.16158 null
2025-07-21 Improved Semantic Segmentation from Ultra-Low-Resolution RGB Images Applied to Privacy-Preserving Object-Goal Navigation Xuying Huang et.al. 2507.16034 null
2025-07-21 ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction Danhui Chen et.al. 2507.15803 null
2025-07-21 ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting Ruijie Zhu et.al. 2507.15454 link
2025-07-21 Rethinking Occlusion in FER: A Semantic-Aware Perspective and Go Beyond Huiyu Zhai et.al. 2507.15401 null
2025-07-20 Towards Geometric and Textural Consistency 3D Scene Generation via Single Image-guided Model Generation and Layout Optimization Xiang Tang et.al. 2507.14841 null
2025-07-20 A Novel Downsampling Strategy Based on Information Complementarity for Medical Image Segmentation Wenbo Yue et.al. 2507.14790 null
2025-07-19 GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset Zhiwei Zhang et.al. 2507.14697 null
2025-07-19 Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall Shayan Rokhva et.al. 2507.14662 null
2025-07-19 Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection Jifeng Shen et.al. 2507.14643 null
2025-07-19 DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF Doriand Petit et.al. 2507.14596 null
2025-07-18 Semantic Segmentation based Scene Understanding in Autonomous Vehicles Ehsan Rassekh et.al. 2507.14303 null
2025-07-18 Leveraging Pathology Foundation Models for Panoptic Segmentation of Melanoma in H&E Images Jiaqi Lv et.al. 2507.13974 null
2025-07-17 SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation Shiqi Huang et.al. 2507.12857 null
2025-07-17 A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique Homare Sueyoshi et.al. 2507.12730 null
2025-07-16 VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians Siyuan Yao et.al. 2507.12667 null
2025-07-16 NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting Kuangshi Ai et.al. 2507.12621 null
2025-07-16 Out-of-distribution data supervision towards biomedical semantic segmentation Yiquan Gao et.al. 2507.12105 null
2025-07-16 Tree-SLAM: semantic object SLAM for efficient mapping of individual trees in orchards David Rapado-Rincon et.al. 2507.12093 null
2025-07-16 Frequency-Dynamic Attention Modulation for Dense Prediction Linwei Chen et.al. 2507.12006 null
2025-07-16 SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation Jun Yin et.al. 2507.11994 null
2025-07-16 Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation Yuhang Zhang et.al. 2507.11955 null
2025-07-16 Spatial Frequency Modulation for Semantic Segmentation Linwei Chen et.al. 2507.11893 link
2025-07-15 SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics Suyuan Zhao et.al. 2507.11588 null
2025-07-15 Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping Yujie Zhang et.al. 2507.11279 null
2025-07-15 Personalized OVSS: Understanding Personal Concept in Open-Vocabulary Semantic Segmentation Sunghyun Park et.al. 2507.11030 null
2025-07-15 Graph Aggregation Prototype Learning for Semantic Change Detection in Remote Sensing Zhengyi Xu et.al. 2507.10938 null
2025-07-14 Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision Justin M. Kasowski et.al. 2507.10813 null
2025-07-14 rt-RISeg: Real-Time Model-Free Robot Interactive Segmentation for Active Instance-Level Object Understanding Howard H. Qian et.al. 2507.10776 null
2025-07-14 FGSSNet: Feature-Guided Semantic Segmentation of Real World Floorplans Hugo Norrby et.al. 2507.10343 null
2025-07-14 Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks Ben Hamscher et.al. 2507.10239 null
2025-07-14 Spatial Lifting for Dense Prediction Mingzhi Xu et.al. 2507.10222 null
2025-07-14 DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation Ivan Martinović et.al. 2507.10118 null
2025-07-13 MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression Ofir Gordon et.al. 2507.09616 null
2025-07-13 Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive You Huang et.al. 2507.09612 null
2025-07-13 SegVec3D: A Method for Vector Embedding of 3D Objects Oriented Towards Robot manipulation Zhihan Kang et.al. 2507.09459 null
2025-07-11 Multimodal HD Mapping for Intersections by Intelligent Roadside Units Zhongzhang Chen et.al. 2507.08903 null
2025-07-11 Image Translation with Kernel Prediction Networks for Semantic Segmentation Cristina Mata et.al. 2507.08554 null
2025-07-11 From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning Sen Wang et.al. 2507.08380 null
2025-07-11 SurfDist: Interpretable Three-Dimensional Instance Segmentation Using Curved Surface Patches Jackson Borchardt et.al. 2507.08223 null
2025-07-10 RAPS-3D: Efficient interactive segmentation for 3D radiological imaging Théo Danielou et.al. 2507.07730 null
2025-07-10 LOSC: LiDAR Open-voc Segmentation Consolidator Nermin Samet et.al. 2507.07605 null
2025-07-10 Diffusion-Guided Knowledge Distillation for Weakly-Supervised Low-Light Semantic Segmentation Chunyan Wang et.al. 2507.07578 null
2025-07-10 Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections Yongtang Bao et.al. 2507.07395 null
2025-07-08 CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings Cristina Mata et.al. 2507.07125 null
2025-07-09 A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level Johanna Orsholm et.al. 2507.06972 null
2025-07-09 SemRaFiner: Panoptic Segmentation in Sparse and Noisy Radar Point Clouds Matthias Zeller et.al. 2507.06906 null
2025-07-09 Know Your Attention Maps: Class-specific Token Masking for Weakly Supervised Semantic Segmentation Joelle Hanna et.al. 2507.06848 null
2025-07-09 Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning Yang Chen et.al. 2507.06592 null
2025-07-08 Centralized Copy-Paste: Enhanced Data Augmentation Strategy for Wildland Fire Semantic Segmentation Joon Tai Kim et.al. 2507.06321 null
2025-07-08 FineGrasp: Towards Robust Grasping for Delicate Objects Yun Du et.al. 2507.05978 null
2025-07-08 Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation Quanzhu Niu et.al. 2507.05948 link
2025-07-08 I $^2$ R: Inter and Intra-image Refinement in Few Shot Segmentation Ourui Fu et.al. 2507.05838 null
2025-07-09 Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework Wang Wang et.al. 2507.05814 null
2025-07-08 SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning Xin Hu et.al. 2507.05798 null
2025-07-08 DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation Young Hun Kim et.al. 2507.05627 null
2025-07-07 OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts Shiting Xiao et.al. 2507.05427 null
2025-07-07 Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations Xiang Xu et.al. 2507.05260 null
2025-07-07 All in One: Visual-Description-Guided Unified Point Cloud Segmentation Zongyan Han et.al. 2507.05211 null
2025-07-07 RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis Songxiao Yang et.al. 2507.05193 null
2025-07-07 MOSU: Autonomous Long-range Robot Navigation with Multi-modal Scene Understanding Jing Liang et.al. 2507.04686 null
2025-07-06 Street design and driving behavior: evidence from a large-scale study in Milan, Amsterdam, and Dubai Giacomo Orsi et.al. 2507.04434 null
2025-07-06 CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning Fatmaelzahraa Ali Ahmed et.al. 2507.04317 null
2025-07-06 Surg-SegFormer: A Dual Transformer-Based Model for Holistic Surgical Scene Segmentation Fatimaelzahraa Ahmed et.al. 2507.04304 null
2025-07-05 Differentiable High-Performance Ray Tracing-Based Simulation of Radio Propagation with Point Clouds Niklas Vaara et.al. 2507.04021 null
2025-07-05 NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models Siyu Li et.al. 2507.04002 null
2025-07-05 CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes via Chain-of-Thought Reasoning Jeonghyo Song et.al. 2507.03984 null
2025-07-03 LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion Fangfu Liu et.al. 2507.02813 link
2025-07-03 No time to train! Training-Free Reference-Based Instance Segmentation Miguel Espinosa et.al. 2507.02798 link
2025-07-03 From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images Danrong Zhang et.al. 2507.02781 null
2025-07-03 MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention Zunhui Xia et.al. 2507.02488 null
2025-07-03 Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis Byung Hyun Lee et.al. 2507.02395 null
2025-07-03 Perception Activator: An intuitive and portable framework for brain cognitive exploration Le Xu et.al. 2507.02311 null
2025-07-02 How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks Rahul Ramachandran et.al. 2507.01955 link
2025-07-02 3D Reconstruction and Information Fusion between Dormant and Canopy Seasons in Commercial Orchards Using Deep Learning and Fast GICP Ranjan Sapkota et.al. 2507.01912 null
2025-07-02 A Gift from the Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation Hao Wang et.al. 2507.01573 null
2025-07-02 NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation Max Gandyra et.al. 2507.01463 null
2025-07-01 Towards Open-World Human Action Segmentation Using Graph Convolutional Networks Hao Xing et.al. 2507.00756 null
2025-07-01 Rectifying Magnitude Neglect in Linear Attention Qihang Fan et.al. 2507.00698 link
2025-07-02 ExPaMoE: An Expandable Parallel Mixture of Experts for Continual Test-Time Adaptation JianChao Zhao et.al. 2507.00502 null
2025-07-01 Process-aware and high-fidelity microstructure generation using stable diffusion Hoang Cuong Phan et.al. 2507.00459 null
2025-07-01 PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching Xin Yang et.al. 2507.00371 null
2025-06-30 SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures Fengyi Jiang et.al. 2507.00209 null
2025-06-30 Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors Ce Wang et.al. 2506.23801 null
2025-06-30 Deep Learning-Based Semantic Segmentation for Real-Time Kidney Imaging and Measurements with Augmented Reality-Assisted Ultrasound Gijs Luijten et.al. 2506.23721 null
2025-06-30 PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum Shiqi Zhang et.al. 2506.23607 null
2025-06-30 Interactive Interface For Semantic Segmentation Dataset Synthesis Ngoc-Do Tran et.al. 2506.23470 null
2025-06-30 Contrastive Learning with Diffusion Features for Weakly Supervised Medical Image Segmentation Dewen Zeng et.al. 2506.23460 null
2025-06-29 Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement Siyuan Chai et.al. 2506.23353 null
2025-06-29 FastSeg: Efficient Training-Free Open-Vocabulary Segmentation via Hierarchical Attention Refinement Method Quang-Huy Che et.al. 2506.23323 null
2025-06-29 BPD-Neo: An MRI Dataset for Lung-Trachea Segmentation with Clinical Data for Neonatal Bronchopulmonary Dysplasia Rachit Saluja et.al. 2506.23305 null
2025-06-29 High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation Lunhao Duan et.al. 2506.23227 null
2025-06-29 DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation Jihun Kim et.al. 2506.23104 null
2025-06-27 Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation Jialei Chen et.al. 2506.22032 null
2025-06-27 TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models Meng Yu et.al. 2506.21975 null
2025-06-27 SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Naftaly Wambugu et.al. 2506.21945 null
2025-06-26 Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection Tobias J. Riedlinger et.al. 2506.21486 null
2025-06-26 PanSt3R: Multi-view Consistent Panoptic Segmentation Lojze Zust et.al. 2506.21348 null
2025-06-26 HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation Diego Biagini et.al. 2506.21287 null
2025-06-27 ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation Xiwei Xuan et.al. 2506.21233 null
2025-06-26 Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 Jongyeon Park et.al. 2506.21174 null
2025-06-27 DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation Wenzhou Lyu et.al. 2506.21034 null
2025-06-26 TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation Chade Li et.al. 2506.20991 null
2025-06-26 Segment Anything in Pathology Images with Natural Language Zhixuan Chen et.al. 2506.20988 null
2025-06-25 A Deep Learning Approach to Identify Rock Bolts in Complex 3D Point Clouds of Underground Mines Captured Using Mobile Laser Scanners Dibyayan Patra et.al. 2506.20464 null
2025-06-26 Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition Man Duc Chuc et.al. 2506.20174 null
2025-06-24 A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects Shulan Ruan et.al. 2506.19769 null
2025-06-24 USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation Lin Hong et.al. 2506.19472 null
2025-06-24 A Global-Local Cross-Attention Network for Ultra-high Resolution Remote Sensing Image Semantic Segmentation Chen Yi et.al. 2506.19406 null
2025-06-25 AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation Ziyan Zhao et.al. 2506.19269 null
2025-06-23 Orthogonal Projection Subspace to Aggregate Online Prior-knowledge for Continual Test-time Adaptation Jinlong Li et.al. 2506.19022 null
2025-06-23 Multi-Scale Spectral Attention Module-based Hyperspectral Segmentation in Autonomous Driving Scenarios Imad Ali Shah et.al. 2506.18682 null
2025-06-23 SafeClick: Error-Tolerant Interactive Segmentation of Any Medical Volumes via Hierarchical Expert Consensus Yifan Gao et.al. 2506.18404 null
2025-06-23 Jet Reconstruction with Mamba Networks in Collider Events Jinmian Li et.al. 2506.18336 null
2025-06-22 OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model Shuaiyu Chen et.al. 2506.18006 null
2025-06-22 Relation3D: Enhancing Relation Modeling for Point Cloud Instance Segmentation Jiahao Lu et.al. 2506.17891 null
2025-06-22 Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation Xiaodong Guo et.al. 2506.17869 null
2025-06-20 Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation Qing Xu et.al. 2506.17159 link
2025-06-20 ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds Binbin Xiang et.al. 2506.16991 link
2025-06-20 LunarLoc: Segment-Based Global Localization on the Moon Annika Thomas et.al. 2506.16940 link
2025-06-19 From Semantic To Instance: A Semi-Self-Supervised Learning Approach Keyhan Najafian et.al. 2506.16563 null
2025-06-19 Structured Semantic 3D Reconstruction (S23DR) Challenge 2025 – Winning solution Jan Skvrna et.al. 2506.16421 null
2025-06-19 LBMamba: Locally Bi-directional Mamba Jingwei Zhang et.al. 2506.15976 link
2025-06-19 Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging Jiawen Yang et.al. 2506.15971 null
2025-06-19 Polyline Path Masked Attention for Vision Transformer Zhongchen Zhao et.al. 2506.15940 link
2025-06-18 MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning Leonid Ivanov et.al. 2506.15313 link
2025-06-18 Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation Jiaqi Shi et.al. 2506.15160 link
2025-06-17 Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset Nikolaos Dionelis et.al. 2506.14765 null
2025-06-17 FocalClick-XL: Towards Unified and High-quality Interactive Segmentation Xi Chen et.al. 2506.14686 null
2025-06-17 VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy Zhuoyue Tan et.al. 2506.14525 null
2025-06-17 DepthSeg: Depth prompting in remote sensing semantic segmentation Ning Zhou et.al. 2506.14382 null
2025-06-17 Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment Weiming Zhang et.al. 2506.14271 null
2025-06-16 HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment Numair Nadeem et.al. 2506.13925 null
2025-06-16 A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects Guohuan Xie et.al. 2506.13552 null
2025-06-16 Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning Rohit Mohan et.al. 2506.13265 null
2025-06-16 ViewPCL: a point cloud based active learning method for multi-view segmentation Christian Hilaire et.al. 2506.13043 null
2025-06-15 A large-scale, physically-based synthetic dataset for satellite pose estimation Szabolcs Velkei et.al. 2506.12782 null
2025-06-15 Unleashing Diffusion and State Space Models for Medical Image Segmentation Rong Wu et.al. 2506.12747 null
2025-06-15 Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups Zhenghao Xi et.al. 2506.12712 null
2025-06-13 O2Former:Direction-Aware and Multi-Scale Query Enhancement for SAR Ship Instance Segmentation F. Gao et.al. 2506.11913 null
2025-06-13 Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling Yunhan Ren et.al. 2506.11661 null
2025-06-13 A $^2$ LC: Active and Automated Label Correction for Semantic Segmentation Youjin Jeon et.al. 2506.11599 null
2025-06-13 OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots Juno Kim et.al. 2506.11585 null
2025-06-12 GynSurg: A Comprehensive Gynecology Laparoscopic Surgery Dataset Sahar Nasirihaghighi et.al. 2506.11356 null
2025-06-12 Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes Masahiro Yasuda et.al. 2506.10676 link
2025-06-12 Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models Francisco Caetano et.al. 2506.10634 link
2025-06-12 Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration Jun Wang et.al. 2506.10573 null
2025-06-12 ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation Teerapong Panboonyuen et.al. 2506.10524 null
2025-06-12 Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation Shuyang Li et.al. 2506.10503 null
2025-06-12 Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success Che Wang et.al. 2506.10359 null
2025-06-11 Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements Mustafa Atahan Nuhoglu et.al. 2506.10107 null
2025-06-11 Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation Siyu Chen et.al. 2506.09881 link
2025-06-11 Accurate and efficient zero-shot 6D pose estimation with frozen foundation models Andrea Caraffa et.al. 2506.09784 null
2025-06-11 The Four Color Theorem for Cell Instance Segmentation Ye Zhang et.al. 2506.09724 link
2025-06-11 Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments Fatemeh Mohammadi Amin et.al. 2506.09552 null
2025-06-12 Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20 $^{th}$ century Urban Landscapes with Satellite Imageries Tianxiang Hao et.al. 2506.09476 null
2025-06-11 MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Tong Wang et.al. 2506.09327 null
2025-06-10 WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos Negin Ghamsarian et.al. 2506.08896 null
2025-06-11 RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation Jiayi Song et.al. 2506.08772 null
2025-06-10 ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction Juan Yeo et.al. 2506.08678 null
2025-06-10 ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network Feixiang Du et.al. 2506.08629 null
2025-06-09 LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds Zihui Zhang et.al. 2506.07857 null
2025-06-09 SAM2Auto: Auto Annotation Using FLASH Arash Rocky et.al. 2506.07850 null
2025-06-09 F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation Hengzhi Chen et.al. 2506.07847 null
2025-06-09 Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity Mohamed Djilani et.al. 2506.07773 null
2025-06-09 OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting Jens Piekenbrinck et.al. 2506.07697 null
2025-06-09 Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation Jintao Tong et.al. 2506.07376 null
2025-06-09 Multiple Object Stitching for Unsupervised Representation Learning Chengchao Shen et.al. 2506.07364 link
2025-06-08 BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite Liyang Chen et.al. 2506.07116 null
2025-06-08 Technical Report for ICRA 2025 GOOSE 3D Semantic Segmentation Challenge: Adaptive Point Cloud Understanding for Heterogeneous Robotic Systems Xiaoya Zhang et.al. 2506.06995 null
2025-06-07 Position Prediction Self-Supervised Learning for Multimodal Satellite Imagery Semantic Segmentation John Waithaka et.al. 2506.06852 null
2025-06-06 Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness Steven Landgraf et.al. 2506.05917 null
2025-06-06 You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping Jingshun Huang et.al. 2506.05719 null
2025-06-05 FRAME: Pre-Training Video Feature Representations via Anticipation and Memory Sethuraman TV et.al. 2506.05543 null
2025-06-05 U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation Marwane Kzadri et.al. 2506.05444 null
2025-06-05 Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting Alfred T. Christiansen et.al. 2506.05009 null
2025-06-05 Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery Mélisande Teng et.al. 2506.04970 null
2025-06-05 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx Lukas Picek et.al. 2506.04931 null
2025-06-05 OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model Kunshen Zhang et.al. 2506.04837 null
2025-06-05 Gen-n-Val: Agentic Image Data Generation and Validation Jing-En Huang et.al. 2506.04676 null
2025-06-04 You Only Train Once Christos Sakaridis et.al. 2506.04349 null
2025-06-04 AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives Aniruddh Sikdar et.al. 2506.03709 null
2025-06-04 OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation Aditya Gandhamal et.al. 2506.03706 null
2025-06-04 BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation Jialei Chen et.al. 2506.03675 null
2025-06-03 Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery Pengyu Chen et.al. 2506.03388 null
2025-06-03 Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding Weiqing Xiao et.al. 2506.03134 null
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 link
2025-06-03 Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather Longyu Yang et.al. 2506.02396 null
2025-06-04 SAB3R: Semantic-Augmented Backbone in 3D Reconstruction Xuweiyi Chen et.al. 2506.02112 null
2025-06-02 SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation Rafael Flor-Rodríguez et.al. 2506.01418 null
2025-06-01 Perceptual Inductive Bias Is What You Need Before Contrastive Learning Tianqin Li et.al. 2506.01201 null
2025-06-01 GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning Sahiti Yerramilli et.al. 2506.00785 null
2025-05-31 BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation Wei Tao et.al. 2506.00475 null
2025-05-30 Bi-Manual Joint Camera Calibration and Scene Representation Haozhan Tang et.al. 2505.24819 null
2025-06-02 NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation Xuzhi Wang et.al. 2505.24634 null
2025-05-30 SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds Cheng Zeng et.al. 2505.24475 null
2025-05-30 Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation Roger Ferrod et.al. 2505.24361 null
2025-05-30 Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors Peiran Xu et.al. 2505.24103 null
2025-05-29 MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking Numair Nadeem et.al. 2505.24026 null
2025-05-29 Semantics-Guided Generative Image Compression Cheng-Lin Wu et.al. 2505.24015 null
2025-05-29 Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts Xuweiyi Chen et.al. 2505.23926 null
2025-05-29 TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models Yao Xiao et.al. 2505.23769 link
2025-05-29 Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation Georgios Voulgaris et.al. 2505.23597 null
2025-05-29 VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration Ben Li et.al. 2505.23439 link
2025-05-29 Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation Lingyan Ran et.al. 2505.23438 null
2025-05-29 Federated Unsupervised Semantic Segmentation Evangelos Charalampakis et.al. 2505.23292 null
2025-05-29 LeMoRe: Learn More Details for Lightweight Semantic Segmentation Mian Muhammad Naeem Abid et.al. 2505.23093 link
2025-05-28 ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions Maxence Wynen et.al. 2505.22537 null
2025-05-28 Universal Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2505.22458 null
2025-05-28 LiDAR Based Semantic Perception for Forklifts in Outdoor Environments Benjamin Serfling et.al. 2505.22258 null
2025-05-29 YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction Mingzhuang Wang et.al. 2505.22250 null
2025-05-28 Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation Zhisong Wang et.al. 2505.22230 null
2025-05-28 A Survey on Training-free Open-Vocabulary Semantic Segmentation Naomi Kombol et.al. 2505.22209 null
2025-05-28 S2AFormer: Strip Self-Attention for Efficient Vision Transformer Guoan Xu et.al. 2505.22195 null
2025-05-28 LiDARDustX: A LiDAR Dataset for Dusty Unstructured Road Environments Chenfeng Wei et.al. 2505.21914 null
2025-05-29 CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation Pardis Taghavi et.al. 2505.21904 null
2025-05-28 Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation Mehrdad Noori et.al. 2505.21844 null
2025-05-27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et.al. 2505.21457 null
2025-05-27 Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning Nikos Giannakakis et.al. 2505.20962 null
2025-05-27 DSOcc: Leveraging Depth Awareness and Semantic Aid to Boost Camera-Based 3D Semantic Occupancy Prediction Naiyu Fang et.al. 2505.20951 null
2025-05-26 Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments Julio de la Torre-Vanegas et.al. 2505.20423 null
2025-05-26 A fully automated urban PV parameterization framework for improved estimation of energy production profiles Bowen Tian et.al. 2505.19876 null
2025-05-26 Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation Nagito Saito et.al. 2505.19846 null
2025-05-26 The Missing Point in Vision Transformers for Universal Image Segmentation Sajjad Shahabodini et.al. 2505.19795 null
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 null
2025-05-25 A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic Segmentation Yuze Wang et.al. 2505.19159 link
2025-05-25 SPARS: Self-Play Adversarial Reinforcement Learning for Segmentation of Liver Tumours Catalina Tan et.al. 2505.18989 link
2025-05-25 How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation Yining Pan et.al. 2505.18956 null
2025-05-25 LLM-Guided Taxonomy and Hierarchical Uncertainty for 3D Point CLoud Active Learning Chenxi Li et.al. 2505.18924 null
2025-05-24 ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts Shiu-hong Kao et.al. 2505.18561 null
2025-05-23 REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders Savya Khosla et.al. 2505.18153 null
2025-05-23 SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Shashank Agnihotri et.al. 2505.18015 null
2025-05-23 Semantic segmentation with reward Xie Ting et.al. 2505.17905 null
2025-05-23 Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring Nikolas Papadopoulos et.al. 2505.17782 null
2025-05-23 EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy Yichun Yu et.al. 2505.17665 null
2025-05-22 Deep mineralogical segmentation of thin section images based on QEMSCAN maps Jean Pablo Vieira de Mello et.al. 2505.17008 link
2025-05-22 OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning Zongyan Han et.al. 2505.16974 link
2025-05-22 NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification NovelSeek Team et.al. 2505.16938 link
2025-05-22 TextureSAM: Towards a Texture Aware Foundation Model for Segmentation Inbal Cohen et.al. 2505.16540 null
2025-05-22 Detailed Evaluation of Modern Machine Learning Approaches for Optic Plastics Sorting Vaishali Maheshkar et.al. 2505.16513 null
2025-05-22 Sketchy Bounding-box Supervision for 3D Instance Segmentation Qian Deng et.al. 2505.16399 null
2025-05-22 Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation Estelle Chigot et.al. 2505.16360 link
2025-05-22 RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition Yechan Park et.al. 2505.16165 link
2025-05-21 VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation Niccolo Avogaro et.al. 2505.15592 null
2025-05-21 UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset Hua Li et.al. 2505.15581 link
2025-05-21 seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation Andrew Caunes et.al. 2505.15545 link
2025-05-21 Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation Ce Zhang et.al. 2505.15491 null
2025-05-21 gen2seg: Generative Models Enable Generalizable Instance Segmentation Om Khangaonkar et.al. 2505.15263 null
2025-05-21 Zero-Shot Gaze-based Volumetric Medical Image Segmentation Tatyana Shmykova et.al. 2505.15256 null
2025-05-21 From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation Quanwei Liu et.al. 2505.15147 null
2025-05-20 Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning Amine Elhafsi et.al. 2505.14938 null
2025-05-20 Instance Segmentation for Point Sets Abhimanyu Talwar et.al. 2505.14583 null
2025-05-20 ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains Guillaume Vray et.al. 2505.14511 link
2025-05-20 Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation Bin-Bin Gao et.al. 2505.14239 link
2025-05-20 Intra-class Patch Swap for Self-Distillation Hongjun Choi et.al. 2505.14124 link
2025-05-20 Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts Xi Chen et.al. 2505.14088 null
2025-05-20 Scaling Vision Mamba Across Resolutions via Fractal Traversal Bo Li et.al. 2505.14062 null
2025-05-20 EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation Zelin Zhang et.al. 2505.14014 null
2025-05-19 Self-Supervised Learning for Image Segmentation: A Comprehensive Survey Thangarajah Akilan et.al. 2505.13584 null
2025-05-19 FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching Alp Eren Sari et.al. 2505.13174 null
2025-05-20 Industrial Synthetic Segment Pre-training Shinichi Mae et.al. 2505.13099 null
2025-05-19 Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation Jiaqi Tan et.al. 2505.12861 link
2025-05-19 Enhancing Transformers Through Conditioned Embedded Tokens Hemanth Saratchandran et.al. 2505.12789 null
2025-05-18 Temporal-Spectral-Spatial Unified Remote Sensing Dense Prediction Sijie Zhao et.al. 2505.12280 link
2025-05-17 SoftPQ: Robust Instance Segmentation Evaluation via Soft Matching and Tunable Thresholds Ranit Karmakar et.al. 2505.12155 link
2025-05-17 EarthSynth: Generating Informative Earth Observation with Diffusion Models Jiancheng Pan et.al. 2505.12108 null
2025-05-17 iSegMan: Interactive Segment-and-Manipulate 3D Gaussians Yian Zhao et.al. 2505.11934 null
2025-05-17 Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Boosting Off-Road Segmentation via Photometric Distortion and Exponential Moving Average Wonjune Kim et.al. 2505.11769 null
2025-05-16 DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation Ziyu Zhao et.al. 2505.11676 null
2025-05-16 SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision Utsav Rai et.al. 2505.11439 null
2025-05-16 Pseudo-Label Quality Decoupling and Correction for Semi-Supervised Instance Segmentation Jianghang Lin et.al. 2505.11075 null
2025-05-16 Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation David Minkwan Kim et.al. 2505.10781 null
2025-05-15 Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis Francisco Raverta Capua et.al. 2505.10751 null
2025-05-15 TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation Manthan Patel et.al. 2505.10696 null
2025-05-15 SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity Shihao Zou et.al. 2505.10352 null
2025-05-15 APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds Yuan Gao et.al. 2505.09971 link
2025-05-14 FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization Xiaoyang Yu et.al. 2505.09385 null
2025-05-14 MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning Bin-Bin Gao et.al. 2505.09265 link
2025-05-13 MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment Barak Pinkovich et.al. 2505.08589 null
2025-05-14 The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning Mohamed Lamine Mekhalfi et.al. 2505.08537 null
2025-05-13 Dynamic Snake Upsampling Operater and Boundary-Skeleton Weighted Loss for Tubular Structure Segmentation Yiqi Chen et.al. 2505.08525 null
2025-05-13 Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency Adel Ammar et.al. 2505.08445 null
2025-05-13 GNCAF: A GNN-based Neighboring Context Aggregation Framework for Tertiary Lymphoid Structures Semantic Segmentation in WSI Lei Su et.al. 2505.08430 null
2025-05-12 Vision Foundation Model Embedding-Based Semantic Anomaly Detection Max Peter Ronecker et.al. 2505.07998 null
2025-05-12 Privacy Risks of Robot Vision: A User Study on Image Modalities and Resolution Xuying Huang et.al. 2505.07766 null
2025-05-12 Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation Negin Ghamsarian et.al. 2505.07691 null
2025-05-12 MAIS: Memory-Attention for Interactive Segmentation Mauricio Orbes-Arteaga et.al. 2505.07511 null
2025-05-13 TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset Olaf Wysocki et.al. 2505.07396 null
2025-05-11 Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution Zihang Liu et.al. 2505.07071 link
2025-05-11 Depth-Sensitive Soft Suppression with RGB-D Inter-Modal Stylization Flow for Domain Generalization Semantic Segmentation Binbin Wei et.al. 2505.07050 null
2025-05-11 Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding Chih-Chung Hsu et.al. 2505.06991 null
2025-05-11 Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation Seokjun Kwon et.al. 2505.06951 null
2025-05-10 Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization Xu Zheng et.al. 2505.06635 null
2025-05-10 RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation Zhiwen Zeng et.al. 2505.06515 null
2025-05-09 Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet Kodai Hirata et.al. 2505.06185 null
2025-05-08 CottonSim: Development of an autonomous visual-guided robotic cotton-picking system in the Gazebo Thevathayarajh Thayananthan et.al. 2505.05317 null
2025-05-08 RepSNet: A Nucleus Instance Segmentation model based on Boundary Regression and Structural Re-parameterization Shengchun Xiong et.al. 2505.05073 null
2025-05-09 UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model Timo Kaiser et.al. 2505.05049 link
2025-05-08 Split Matching for Inductive Zero-shot Semantic Segmentation Jialei Chen et.al. 2505.05023 null
2025-05-08 Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model Navin Ranjan et.al. 2505.04861 null
2025-05-07 Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions? Shashank Agnihotri et.al. 2505.04835 link
2025-05-07 Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer Sainath Dey et.al. 2505.04740 null
2025-05-07 DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Junjie Wang et.al. 2505.04410 link
2025-05-07 MFSeg: Efficient Multi-frame 3D Semantic Segmentation Chengjie Huang et.al. 2505.04408 null
2025-05-06 Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach Srecharan Selvam et.al. 2505.03702 null
2025-05-06 CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting Huawei Sun et.al. 2505.03679 null
2025-05-06 Panoramic Out-of-Distribution Segmentation Mengfei Duan et.al. 2505.03539 link
2025-05-06 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation Andrew Caunes et.al. 2505.03300 null
2025-05-05 Platelet enumeration in dense aggregates H. Martin Gillis et.al. 2505.02751 null
2025-05-04 Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation Volodymyr Havrylov et.al. 2505.02075 link
2025-05-04 Segment Any RGB-Thermal Model with Language-aided Distillation Dong Xing et.al. 2505.01950 null
2025-05-03 OODTE: A Differential Testing Engine for the ONNX Optimizer Nikolaos Louloudakis et.al. 2505.01892 null
2025-05-03 A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory Chenyang Fan et.al. 2505.01656 null
2025-05-02 A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative Learning Anan Yaghmour et.al. 2505.01558 null
2025-05-02 Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation Zhen Yao et.al. 2505.01548 link
2025-05-02 Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing Fahong Zhang et.al. 2505.01385 null
2025-05-02 GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation Boris Kriuk et.al. 2505.01057 null
2025-04-30 MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection Qiushi Yang et.al. 2505.00739 null
2025-05-03 Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook Muyi Bao et.al. 2505.00630 null
2025-05-01 Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation Feng Xue et.al. 2505.00378 null
2025-04-30 Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space Leonhard Sommer et.al. 2504.21749 null
2025-04-30 Real Time Semantic Segmentation of High Resolution Automotive LiDAR Scans Hannes Reichert et.al. 2504.21602 null
2025-04-30 Make Both Ends Meet: A Synergistic Optimization Infrared Small Target Detection with Streamlined Computational Overhead Yuxin Jing et.al. 2504.21581 null
2025-04-30 ClassWise-CRF: Category-Specific Fusion for Enhanced Semantic Segmentation of Remote Sensing Imagery Qinfeng Zhu et.al. 2504.21491 null
2025-04-29 DeepVoid: A Deep Learning Void Detector Sam Kumagai et.al. 2504.21134 null
2025-04-29 Learning a General Model: Folding Clothing with Topological Dynamics Yiming Liu et.al. 2504.20720 null
2025-04-29 OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation Long Liu et.al. 2504.20682 link
2025-04-28 DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes Junlin Guo et.al. 2504.20303 null
2025-04-28 Learning Streaming Video Representation via Multitask Training Yibin Yan et.al. 2504.20041 null
2025-04-28 SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation Yulong Guo et.al. 2504.19839 null
2025-04-28 Open-set Anomaly Segmentation in Complex Scenarios Song Xia et.al. 2504.19706 null
2025-04-28 SubGrapher: Visual Fingerprinting of Chemical Structures Lucas Morin et.al. 2504.19695 null
2025-04-28 BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation Pin-Chi Pan et.al. 2504.19643 null
2025-04-28 Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding Yan Wang et.al. 2504.19500 null
2025-04-28 GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field Zuxing Lu et.al. 2504.19409 null
2025-04-27 OpenFusion++: An Open-vocabulary Real-time Scene Understanding System Xiaofeng Jin et.al. 2504.19266 null
2025-04-27 DeepSPG: Exploring Deep Semantic Prior Guidance for Low-light Image Enhancement with Multimodal Learning Jialang Lu et.al. 2504.19127 null
2025-04-26 VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation Niaz Ahmad et.al. 2504.19032 null
2025-04-25 A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes Nicolas Münger et.al. 2504.18213 null
2025-04-25 Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition Yin Tang et.al. 2504.18201 null
2025-04-25 What is the Added Value of UDA in the VFM Era? Brunó B. Englert et.al. 2504.18190 null
2025-04-25 Back to Fundamentals: Low-Level Visual Features Guided Progressive Token Pruning Yuanbing Ouyang et.al. 2504.17996 null
2025-04-24 Virtual Roads, Smarter Safety: A Digital Twin Framework for Mixed Autonomous Traffic Safety Analysis Hao Zhang et.al. 2504.17968 null
2025-04-24 Masked strategies for images with small objects H. Martin Gillis et.al. 2504.17935 null
2025-04-24 Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images Zebo Huang et.al. 2504.17582 null
2025-04-23 Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection Jens Petersen et.al. 2504.17076 null
2025-04-23 SemanticSugarBeets: A Multi-Task Framework and Dataset for Inspecting Harvest and Storage Characteristics of Sugar Beets Gerardus Croonen et.al. 2504.16684 null
2025-04-23 Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections Max Kirchner et.al. 2504.16612 null
2025-04-23 SAIP-Net: Enhancing Remote Sensing Image Segmentation via Spectral Adaptive Information Propagation Zhongtao Wang et.al. 2504.16564 null
2025-04-23 Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks Murat Bilgehan Ertan et.al. 2504.16557 null
2025-04-22 Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications Leonardo Olivi et.al. 2504.15991 null
2025-04-22 DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining Wei Zhuo et.al. 2504.15669 null
2025-04-21 Segmentation with Noisy Labels via Spatially Correlated Distributions Ryu Tadokoro et.al. 2504.14795 link
2025-04-20 NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation Junyuan Fang et.al. 2504.14638 null
2025-04-19 Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation Johannes Spoecklberger et.al. 2504.14231 null
2025-04-19 Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection Ghodsiyeh Rostami et.al. 2504.14138 null
2025-04-19 Lightweight Road Environment Segmentation using Vector Quantization Jiyong Kwag et.al. 2504.14113 null
2025-04-18 Occlusion-Ordered Semantic Instance Segmentation Soroosh Baselizadeh et.al. 2504.14054 null
2025-04-18 HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework Shuobin Wei et.al. 2504.13579 null
2025-04-18 Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping Wang Liu et.al. 2504.13458 link
2025-04-18 DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images Racheal Mukisa et.al. 2504.13415 null
2025-04-18 Cardiac MRI Semantic Segmentation for Ventricles and Myocardium using Deep Learning Racheal Mukisa et.al. 2504.13391 null
2025-04-17 SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling Yasin Almalioglu et.al. 2504.13310 null
2025-04-17 Digital Twin Generation from Visual Data: A Survey Andrew Melnik et.al. 2504.13159 null
2025-04-17 High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion Libo Zhang et.al. 2504.12844 null
2025-04-17 Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation Siyu Chen et.al. 2504.12753 link
2025-04-17 Parsimonious Dataset Construction for Laparoscopic Cholecystectomy Structure Segmentation Yuning Zhou et.al. 2504.12573 null
2025-04-17 Privacy-Preserving Operating Room Workflow Analysis using Digital Twins Alejandra Perez et.al. 2504.12552 null
2025-04-16 3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap Minmin Yang et.al. 2504.12442 null
2025-04-16 Remote sensing colour image semantic segmentation of trails created by large herbivorous Mammals Jose Francisco Diez-Pastor et.al. 2504.12121 null
2025-04-17 DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency Mengshi Qi et.al. 2504.12080 link
2025-04-16 Single-shot Star-convex Polygon-based Instance Segmentation for Spatially-correlated Biomedical Objects Trina De et.al. 2504.12078 null
2025-04-16 CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting Wei Sun et.al. 2504.11893 null
2025-04-15 CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image Jingshun Huang et.al. 2504.11230 null
2025-04-15 Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation Andrea Simonelli et.al. 2504.11024 null
2025-04-15 PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation Bo-Cheng Hu et.al. 2504.10986 null
2025-04-15 LightFormer: A lightweight and efficient decoder for remote sensing image segmentation Sihang Chen et.al. 2504.10834 null
2025-04-15 OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding Dianbing Xi et.al. 2504.10825 null
2025-04-15 Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics’ Gramian on the Manifold Underlying the Patch Space Kelum Gajamannage et.al. 2504.10820 null
2025-04-14 Real-time Seafloor Segmentation and Mapping Michele Grimaldi et.al. 2504.10750 null
2025-04-14 FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation Yasser Benigmim et.al. 2504.10487 null
2025-04-14 The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer Weixian Lei et.al. 2504.10462 null
2025-04-14 M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data Tzu-Yun Tseng et.al. 2504.10123 null
2025-04-14 DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation Beomseok Kang et.al. 2504.09814 null
2025-04-14 IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme Dinh Dai Quan Tran et.al. 2504.09797 null
2025-04-14 Advancing RFI-Detection in Radio Astronomy with Liquid State Machines Nicholas J Pritchard et.al. 2504.09796 null
2025-04-12 Evolved Hierarchical Masking for Self-Supervised Learning Zhanzhou Feng et.al. 2504.09155 null
2025-04-11 Data-Importance-Aware Power Allocation for Adaptive Real-Time Communication in Computer Vision Applications Chunmei Xu et.al. 2504.08922 null
2025-04-11 Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing Vinal Asodia et.al. 2504.08704 null
2025-04-11 Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation Bram Vanherle et.al. 2504.08473 link
2025-04-11 SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis Yi Chen et.al. 2504.08361 null
2025-04-11 DSM: Building A Diverse Semantic Map for 3D Visual Grounding Qinghongbing Xie et.al. 2504.08307 null
2025-04-10 ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings Astitva Srivastava et.al. 2504.08022 null
2025-04-10 P2Object: Single Point Supervised Object Detection and Instance Segmentation Pengfei Chen et.al. 2504.07813 null
2025-04-10 Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation Yanglin Huang et.al. 2504.07691 null
2025-04-10 SydneyScapes: Image Segmentation for Australian Environments Hongyu Lyu et.al. 2504.07542 null
2025-04-10 RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability Jonggwon Park et.al. 2504.07416 null
2025-04-09 RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration Omar Alama et.al. 2504.06994 null
2025-04-09 Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting Daiwei Zhang et.al. 2504.06978 null
2025-04-09 Domain Generalization through Attenuation of Domain-Specific Information Reiji Saito et.al. 2504.06781 null
2025-04-08 SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation Hritam Basak et.al. 2504.06389 null
2025-04-09 Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation Xiaoxing Hu et.al. 2504.06220 null
2025-04-08 WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care Vanessa Borst et.al. 2504.06185 null
2025-04-08 Towards Varroa destructor mite detection using a narrow spectra illumination Samuel Bielik et.al. 2504.06099 null
2025-04-08 econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians Can Zhang et.al. 2504.06003 null
2025-04-08 Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques Luca Barco et.al. 2504.05882 null
2025-04-08 DefMamba: Deformable Visual State Space Model Leiye Liu et.al. 2504.05794 null
2025-04-08 Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation Enming Zhang et.al. 2504.05774 null
2025-04-07 S^4M: Boosting Semi-Supervised Instance Segmentation with SAM Heeji Yoon et.al. 2504.05301 null
2025-04-07 BoxSeg: Quality-Aware and Peer-Assisted Learning for Box-supervised Instance Segmentation Jinxiang Lai et.al. 2504.05137 null
2025-04-07 Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection Jon Gutiérrez Zaballa et.al. 2504.05119 null
2025-04-07 Prior2Former – Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation Sebastian Schmidt et.al. 2504.04841 null
2025-04-07 DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation Bo-Wen Yin et.al. 2504.04701 link
2025-04-06 Statistical Guarantees Of False Discovery Rate In Medical Instance Segmentation Tasks Based on Conformal Risk Control Mengxia Dai et.al. 2504.04482 null
2025-04-06 Evaluation framework for Image Segmentation Algorithms Tatiana Merkulova et.al. 2504.04435 null
2025-04-05 CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation Kai Fang et.al. 2504.04156 null
2025-04-05 DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning Xiao-Hui Li et.al. 2504.04085 null
2025-04-04 Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation Xin Zhang et.al. 2504.03193 null
2025-04-03 Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation Feng Gao et.al. 2504.02647 null
2025-04-03 Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results Andrei Dumitriu et.al. 2504.02558 null
2025-04-03 Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery Mykola Lavreniuk et.al. 2504.02534 null
2025-04-03 Semantic segmentation of forest stands using deep learning Håkon Næss Sandum et.al. 2504.02471 null
2025-04-03 Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation Changshuo Wang et.al. 2504.02454 null
2025-04-03 Benchmark of Segmentation Techniques for Pelvic Fracture in CT and X-ray: Summary of the PENGWIN 2024 Challenge Yudi Sang et.al. 2504.02382 null
2025-04-03 APSeg: Auto-Prompt Model with Acquired and Injected Knowledge for Nuclear Instance Segmentation and Classification Liying Xu et.al. 2504.02222 null
2025-04-02 Scene-Centric Unsupervised Panoptic Segmentation Oliver Hahn et.al. 2504.01955 link
2025-04-02 Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation Junjie Chen et.al. 2504.01668 null
2025-04-03 Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks Haosheng Li et.al. 2504.01659 null
2025-04-02 ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation Haosheng Li et.al. 2504.01648 null
2025-04-02 Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions Giulia Marchiori Pietrosanti et.al. 2504.01632 null
2025-04-02 Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology Lirui Qi et.al. 2504.01577 null
2025-04-02 Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training Luca Ciampi et.al. 2504.01547 null
2025-04-02 Beyond Nearest Neighbor Interpolation in Data Augmentation Olivier Rukundo et.al. 2504.01527 null
2025-04-02 Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement Zaipeng Duan et.al. 2504.01449 null
2025-04-02 v-CLR: View-Consistent Learning for Open-World Instance Segmentation Chang-Bin Zhang et.al. 2504.01383 null
2025-03-31 Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes Daichi Otsuka et.al. 2503.24229 null
2025-03-31 Spectral-Adaptive Modulation Networks for Visual Perception Guhnoo Yun et.al. 2503.23947 null
2025-03-31 Bridge the Gap Between Visual and Linguistic Comprehension for Generalized Zero-shot Semantic Segmentation Xiaoqing Guo et.al. 2503.23806 null
2025-03-31 Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks Yu Zhou et.al. 2503.23751 null
2025-03-31 Semantic Packet Aggregation and Repeated Transmission for Text-to-Image Generation Seunghun Lee et.al. 2503.23734 null
2025-03-31 CrossFormer: Cross-Segment Semantic Fusion for Document Segmentation Tongke Ni et.al. 2503.23671 null
2025-03-30 BoundMatch: Boundary detection applied to semi-supervised segmentation for urban-driving scenes Haruya Ishikawa et.al. 2503.23519 null
2025-03-30 Improving underwater semantic segmentation with underwater image quality attention and muti-scale aggregation attention Xin Zuo et.al. 2503.23422 null
2025-03-29 Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments Yifan Xu et.al. 2503.23105 null
2025-03-28 Enhancing DeepLabV3+ to Fuse Aerial and Satellite Images for Semantic Segmentation Anas Berka et.al. 2503.22909 null
2025-03-28 KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation Thomas Boucher et.al. 2503.22592 null
2025-03-28 A Dataset for Semantic Segmentation in the Presence of Unknowns Zakaria Laskar et.al. 2503.22309 null
2025-03-28 Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation Minho Park et.al. 2503.22172 null
2025-03-28 Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation Hongmei Yin et.al. 2503.22136 null
2025-03-28 Semantic segmentation for building houses from wooden cubes Ivan Beleacov et.al. 2503.22125 null
2025-03-28 Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes Binh Thien Nguyen et.al. 2503.22088 null
2025-03-28 A Deep Learning Framework for Boundary-Aware Semantic Segmentation Tai An et.al. 2503.22050 null
2025-03-27 Foveated Instance Segmentation Hongyi Zeng et.al. 2503.21854 null
2025-03-27 Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation Reza Qorbani et.al. 2503.21780 link
2025-03-27 A Unified Image-Dense Annotation Generation Model for Underwater Scenes Hongkai Lin et.al. 2503.21771 link
2025-03-27 Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving Lucas Nunes et.al. 2503.21449 link
2025-03-26 Exploring CLIP’s Dense Knowledge for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2503.20826 null
2025-03-26 Exploiting Temporal State Space Sharing for Video Semantic Segmentation Syed Ariff Syed Hesham et.al. 2503.20824 null
2025-03-26 Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery Mélisande Teng et.al. 2503.20199 null
2025-03-25 Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception Luke Chen et.al. 2503.20011 null
2025-03-25 The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs Jonathan Sauder et.al. 2503.20000 null
2025-03-25 LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation Vladan Stojnić et.al. 2503.19777 link
2025-03-25 OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations Christina Kassab et.al. 2503.19764 null
2025-03-25 Show or Tell? Effectively prompting Vision-Language Models for semantic segmentation Niccolo Avogaro et.al. 2503.19647 null
2025-03-25 Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model Peishan Huang et.al. 2503.19386 null
2025-03-25 BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation Hanshuo Qiu et.al. 2503.19303 null
2025-03-25 Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines Junle Liu et.al. 2503.19278 null
2025-03-25 Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications Ben Rahman et.al. 2503.19276 null
2025-03-24 DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation Karim Abou Zeid et.al. 2503.18944 link
2025-03-24 Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation DeShin Hwa et.al. 2503.18862 null
2025-03-24 EgoSurgery-HTS: A Dataset for Egocentric Hand-Tool Segmentation in Open Surgery Videos Nathan Darjana et.al. 2503.18755 null
2025-03-24 HiRes-FusedMIM: A High-Resolution RGB-DSM Pre-trained Model for Building-Level Remote Sensing Applications Guneet Mutreja et.al. 2503.18540 null
2025-03-24 Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness Chenfei Liao et.al. 2503.18445 null
2025-03-24 PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes Xinhua Xu et.al. 2503.18393 null
2025-03-24 MaSS13K: A Matting-level Semantic Segmentation Benchmark Chenxi Xie et.al. 2503.18364 null
2025-03-23 PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding Hongjia Zhai et.al. 2503.18107 null
2025-03-23 Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images Yara AlaaEldin et.al. 2503.17982 null
2025-03-23 FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation Dong Zhao et.al. 2503.17940 null
2025-03-21 Center-guided Classifier for Semantic Segmentation of Remote Sensing Images Wei Zhang et.al. 2503.16963 null
2025-03-21 Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision Maoji Zheng et.al. 2503.16811 null
2025-03-20 SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality Chiara Schiavo et.al. 2503.16747 null
2025-03-20 Panoptic-CUDAL Technical Report: Rural Australia Point Cloud Dataset in Rainy Conditions Tzu-Yun Tseng et.al. 2503.16378 null
2025-03-20 M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation Markus Karmann et.al. 2503.16254 null
2025-03-20 Controllable Segmentation-Based Text-Guided Style Editing Jingwen Li et.al. 2503.16129 null
2025-03-20 No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather Junsung Park et.al. 2503.15910 null
2025-03-19 High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight Cédric Vincent et.al. 2503.15676 link
2025-03-19 Transport-Related Surface Detection with Machine Learning: Analyzing Temporal Trends in Madrid and Vienna Miguel Ureña Pliego et.al. 2503.15653 link
2025-03-19 CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation Masud Ahmed et.al. 2503.15617 null
2025-03-19 SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes Weixiao Gao et.al. 2503.15300 null
2025-03-19 Semantic Segmentation of Transparent and Opaque Drinking Glasses with the Help of Zero-shot Learning Annalena Blänsdorf et.al. 2503.15004 null
2025-03-19 USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network Joseph Emmanuel DL Dayo et.al. 2503.14950 null
2025-03-19 SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments Yinqi Chen et.al. 2503.14837 null
2025-03-18 Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting Runsong Zhu et.al. 2503.14029 link
2025-03-18 PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds Barza Nisar et.al. 2503.13914 null
2025-03-18 Exploiting Inherent Class Label: Towards Robust Scribble Supervised Semantic Segmentation Xinliang Zhang et.al. 2503.13895 link
2025-03-17 SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint Zhenlong Yuan et.al. 2503.13721 null
2025-03-17 Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization Hao Li et.al. 2503.13617 null
2025-03-17 Clustering is back: Reaching state-of-the-art LiDAR instance segmentation without training Corentin Sautier et.al. 2503.13203 null
2025-03-17 3D Hierarchical Panoptic Segmentation in Real Orchard Environments Across Different Sensors Matteo Sodano et.al. 2503.13188 null
2025-03-17 DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model Zhicheng Zhao et.al. 2503.13073 null
2025-03-17 Adaptive Transformer Attention and Multi-Scale Fusion for Spine 3D Segmentation Yanlin Xiang et.al. 2503.12853 null
2025-03-17 LangDA: Building Context-Awareness via Language for Domain Adaptive Semantic Segmentation Chang Liu et.al. 2503.12780 null
2025-03-17 TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image Haoxiao Wang et.al. 2503.12779 null
2025-03-16 Point Cloud Based Scene Segmentation: A Survey Dan Halperin et.al. 2503.12595 null
2025-03-16 BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis Weiguang Zhao et.al. 2503.12539 null
2025-03-16 SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs Guibiao Liao et.al. 2503.12535 null
2025-03-16 Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation Edgar Heinert et.al. 2503.12453 null
2025-03-14 COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation Sanghyun Jo et.al. 2503.11439 null
2025-03-14 CyclePose – Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy Jonas Utz et.al. 2503.11266 null
2025-03-14 SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets Hao Liu et.al. 2503.11133 null
2025-03-14 A Novel Decomposed Feature-Oriented Framework for Open-Set Semantic Segmentation on LiDAR Data Wenbang Deng et.al. 2503.11097 null
2025-03-12 Knowledge Consultation for Semi-Supervised Semantic Segmentation Thuan Than et.al. 2503.10693 null
2025-03-13 RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing Fengxiang Wang et.al. 2503.10392 link
2025-03-13 OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Maxim Popov et.al. 2503.10331 null
2025-03-12 CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation Hariprasath Govindarajan et.al. 2503.09878 null
2025-03-12 Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets Hannah Kniesel et.al. 2503.09221 null
2025-03-12 Learning Appearance and Motion Cues for Panoptic Tracking Juana Valeria Hurtado et.al. 2503.09191 null
2025-03-11 SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories Muzhi Zhu et.al. 2503.08625 null
2025-03-11 SAS: Segment Any 3D Scene with Integrated 2D Priors Zhuoyuan Li et.al. 2503.08512 null
2025-03-11 WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images Yansong Guo et.al. 2503.08407 null
2025-03-11 nnInteractive: Redefining 3D Promptable Segmentation Fabian Isensee et.al. 2503.08373 link
2025-03-11 SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation Sachin Verma et.al. 2503.08290 null
2025-03-11 Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation Deyi Ji et.al. 2503.08043 null
2025-03-11 DiffEGG: Diffusion-Driven Edge Generation as a Pixel-Annotation-Free Alternative for Instance Annotation Sanghyun Jo et.al. 2503.07982 null
2025-03-10 Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models? Yuru Jia et.al. 2503.07890 null
2025-03-10 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Yan Tai et.al. 2503.07413 link
2025-03-10 Semantic Communications with Computer Vision Sensing for Edge Video Transmission Yubo Peng et.al. 2503.07252 null
2025-03-10 OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation Ding Zhong et.al. 2503.07098 null
2025-03-10 Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation Xingye Fan et.al. 2503.06954 null
2025-03-10 Aligning Instance-Semantic Sparse Representation towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives Jiaxin Li et.al. 2503.06947 null
2025-03-10 HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors Siyu Li et.al. 2503.06821 null
2025-03-09 CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving Rui Song et.al. 2503.06744 null
2025-03-09 Continuous Online Adaptation Driven by User Interaction for Medical Image Segmentation Wentian Xu et.al. 2503.06717 null
2025-03-09 MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation Chenfei Liao et.al. 2503.06700 null
2025-03-09 Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence Zhaowei Chen et.al. 2503.06685 null
2025-03-07 Joint 3D Point Cloud Segmentation using Real-Sim Loop: From Panels to Trees and Branches Tian Qiu et.al. 2503.05630 null
2025-03-07 TomatoScanner: phenotyping tomato fruit based on only RGB image Xiaobei Zhao et.al. 2503.05568 null
2025-03-07 S4M: Segment Anything with 4 Extreme Points Adrien Meyer et.al. 2503.05534 null
2025-03-07 Kaiwu: A Multimodal Manipulation Dataset and Framework for Robot Learning and Human-Robot Interaction Shuo Jiang et.al. 2503.05231 null
2025-03-06 EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images Rohit Menon et.al. 2503.04441 null
2025-03-06 PointsToWood: A deep learning framework for complete canopy leaf-wood segmentation of TLS data across diverse European forests Harry J. F. Owen et.al. 2503.04420 null
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235 null
2025-03-06 MASTER: Multimodal Segmentation with Text Prompts Fuyang Liu et.al. 2503.04199 null
2025-03-06 Towards Intelligent Transportation with Pedestrians and Vehicles In-the-Loop: A Surveillance Video-Assisted Federated Digital Twin Framework Xiaolong Li et.al. 2503.04170 null
2025-03-06 H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision Yunxiao Shi et.al. 2503.04059 null
2025-03-06 GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding Xihan Wang et.al. 2503.04034 null
2025-03-06 DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation Amin Karimi et.al. 2503.04006 null
2025-03-05 COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation Aurelio Noca et.al. 2503.03947 null
2025-03-05 SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection Devanish N. Kamtam et.al. 2503.03942 null
2025-03-05 Automatic Drywall Analysis for Progress Tracking and Quality Control in Construction Mariusz Trzeciakiewicz et.al. 2503.03422 null
2025-03-05 Golden Cudgel Network for Real-Time Semantic Segmentation Guoyu Yang et.al. 2503.03325 null
2025-03-05 Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters Julia Hindel et.al. 2503.03299 null
2025-03-05 Interactive Segmentation and Report Generation for CT Images Yannian Gu et.al. 2503.03294 null
2025-03-05 Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria Asma A. Almutairi et.al. 2503.03100 null
2025-03-05 AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model Wenlun Zhang et.al. 2503.03088 null
2025-03-04 Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance Jiayi Zhao et.al. 2503.02581 link
2025-03-04 MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments Ege Özsoy et.al. 2503.02579 link
2025-03-04 TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping Xinying Hong et.al. 2503.02578 null
2025-03-04 Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation Dengke Zhang et.al. 2503.02459 null
2025-03-04 Label-Efficient LiDAR Panoptic Segmentation Ahmet Selim Çanakçı et.al. 2503.02372 null
2025-03-03 SAGE: A Framework of Precise Retrieval for RAG Jintao Zhang et.al. 2503.01713 null
2025-03-04 UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface Hao Tang et.al. 2503.01342 link
2025-03-03 OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging Yijie Tang et.al. 2503.01309 null
2025-03-03 Convex Hull-based Algebraic Constraint for Visual Quadric SLAM Xiaolong Yu et.al. 2503.01254 link
2025-03-03 Identity documents recognition and detection using semantic segmentation with convolutional neural network Mykola Kozlenko et.al. 2503.01085 null
2025-02-28 The Common Objects Underwater (COU) Dataset for Robust Underwater Object Detection Rishi Mukherjee et.al. 2502.20651 null
2025-02-27 Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds Mohamed Abdelsamad et.al. 2502.20316 null
2025-02-27 OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels Meng Lou et.al. 2502.20087 link
2025-02-28 SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation Zijie Zhou et.al. 2502.20077 link
2025-03-03 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds Hengshuo Chu et.al. 2502.20041 null
2025-02-27 Learning Mask Invariant Mutual Information for Masked Image Modeling Tao Huang et.al. 2502.19718 null
2025-02-28 You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving Guangfeng Jiang et.al. 2502.19698 null
2025-02-26 Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach Anton Backhaus et.al. 2502.19177 null
2025-02-26 Enhanced Neuromorphic Semantic Segmentation Latency through Stream Event D. Hareb et.al. 2502.18982 null
2025-02-28 OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation Yunpeng Gao et.al. 2502.18041 null
2025-02-25 CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems Rui Liu et.al. 2502.17821 null
2025-02-24 CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation Vishal Thengane et.al. 2502.17429 link
2025-02-25 DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks Canyu Zhao et.al. 2502.17157 link
2025-02-24 SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations Wendi Liu et.al. 2502.17056 null
2025-02-25 VPNeXt – Rethinking Dense Decoding for Plain Vision Transformer Xikai Tang et.al. 2502.16654 null
2025-02-23 Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration Kim Jun-Seong et.al. 2502.16652 null
2025-02-23 OpenVox: Real-time Instance-level Open-vocabulary Probabilistic Voxel Representation Yinan Deng et.al. 2502.16528 null
2025-02-23 Deep learning approaches to surgical video segmentation and object detection: A Scoping Review Devanish N. Kamtam et.al. 2502.16459 null
2025-02-22 Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field Wenhao Hu et.al. 2502.16303 null
2025-02-22 Importance-Aware Source-Channel Coding for Multi-Modal Task-Oriented Semantic Communication Yi Ma et.al. 2502.16194 null
2025-02-22 FeatSharp: Your Vision Model Features, Sharper Mike Ranzinger et.al. 2502.16025 link
2025-02-21 Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence Yufeng Diao et.al. 2502.15472 null
2025-02-21 DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation Luzhou Ge et.al. 2502.15309 link
2025-02-21 Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation Ebenezer Tarubinga et.al. 2502.15152 link
2025-02-20 RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation Henrique Piñeiro Monteagudo et.al. 2502.14792 null
2025-02-20 Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes Lukas Rauch et.al. 2502.14721 null
2025-02-20 Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving Jon Gutiérrez-Zaballa et.al. 2502.14416 null
2025-02-20 Bayesian SegNet for Semantic Segmentation with Improved Interpretation of Microstructural Evolution During Irradiation of Materials Marjolein Oostrom et.al. 2502.14184 null
2025-02-19 SegRet: An Efficient Design for Semantic Segmentation with Retentive Network Zhiyuan Li et.al. 2502.14014 link
2025-02-19 Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model Huiying Shi et.al. 2502.13990 null
2025-02-19 MGFI-Net: A Multi-Grained Feature Integration Network for Enhanced Medical Image Segmentation Yucheng Zeng et.al. 2502.13808 null
2025-02-19 CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models Nikolaos Dionelis et.al. 2502.13734 null
2025-02-18 WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields Ekin Celikkan et.al. 2502.13103 link
2025-02-18 Enhancing Power Grid Inspections with Machine Learning Diogo Lavado et.al. 2502.13037 null
2025-02-18 DAMamba: Vision State Space Model with Dynamic Adaptive Scan Tanzhe Li et.al. 2502.12627 null
2025-02-17 From Open-Vocabulary to Vocabulary-Free Semantic Segmentation Klara Reichard et.al. 2502.11891 null
2025-02-16 Leveraging Multimodal-LLMs Assisted by Instance Segmentation for Intelligent Traffic Monitoring Murat Arda Onsu et.al. 2502.11304 null
2025-02-16 Text-promptable Propagation for Referring Medical Image Sequence Segmentation Runtian Yuan et.al. 2502.11093 null
2025-02-16 Detecting Cadastral Boundary from Satellite Images Using U-Net model Neda Rahimpour Anaraki et.al. 2502.11044 null
2025-02-15 NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing Shutong Zhang et.al. 2502.10720 null
2025-02-15 Deep Learning for Wound Tissue Segmentation: A Comprehensive Evaluation using A Novel Dataset Muhammad Ashad Kabir et.al. 2502.10652 null
2025-02-14 Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs – A Multinational Study Yin-Chih Chelsea Wang et.al. 2502.10277 null
2025-02-14 FrGNet: A fourier-guided weakly-supervised framework for nuclear instance segmentation Peng Ling et.al. 2502.09874 null
2025-02-12 Towards Fine-grained Interactive Segmentation in Images and Videos Yuan Yao et.al. 2502.09660 null
2025-02-13 Instance Segmentation of Scene Sketches Using Natural Image Priors Mia Tang et.al. 2502.09608 null
2025-02-13 SQ-GAN: Semantic Image Communications Using Masked Vector Quantization Francesco Pezone et.al. 2502.09520 null
2025-02-13 FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation Bin Yang et.al. 2502.09274 null
2025-02-13 Memory-based Ensemble Learning in CMR Semantic Segmentation Yiwei Liu et.al. 2502.09269 link
2025-02-13 Latents of latents to delineate pixels: hybrid Matryoshka autoencoder-to-U-Net pairing for segmenting large medical images in GPU-poor and low-data regimes Tahir Syed et.al. 2502.08988 null
2025-02-12 HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification Valentina Vadori et.al. 2502.08754 link
2025-02-12 Generalized Class Discovery in Instance Segmentation Cuong Manh Hoang et.al. 2502.08149 null
2025-02-12 Knowledge Swapping via Learning and Unlearning Mingyu Xing et.al. 2502.08075 null
2025-02-11 Efficient Continuous Group Convolutions for Local SE(3) Equivariance in 3D Point Clouds Lisa Weijler et.al. 2502.07505 link
2025-02-11 A Survey on Mamba Architecture for Vision Applications Fady Ibrahim et.al. 2502.07161 null
2025-02-09 A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation Wang Jiangtao et.al. 2502.06895 null
2025-02-10 SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement Yuqi Lin et.al. 2502.06756 null
2025-02-10 A Large-scale AI-generated Image Inpainting Benchmark Paschalis Giakoumoglou et.al. 2502.06593 link
2025-02-11 Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation Emanuele Mule et.al. 2502.06288 null
2025-02-10 Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds Lassi Ruoppa et.al. 2502.06227 null
2025-02-09 Traveling Waves Integrate Spatial Information Into Spectral Representations Mozes Jacobs et.al. 2502.06034 null
2025-02-11 VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer Xinyu Liu et.al. 2502.05979 null
2025-02-09 LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification Shubham Kumar Nigam et.al. 2502.05836 null
2025-02-08 Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture Mitul Goswami et.al. 2502.05476 null
2025-02-08 LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation Shengdong Zhang et.al. 2502.05473 null
2025-02-08 A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation Canxuan Gang et.al. 2502.05396 null
2025-02-07 IPSeg: Image Posterior Mitigates Semantic Drift in Class-Incremental Segmentation Xiao Yu et.al. 2502.04870 null
2025-02-07 AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers Runqing Jiang et.al. 2502.04628 null
2025-02-05 DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation Luciano Baresi et.al. 2502.04378 link
2025-02-06 Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation Jiahao Lu et.al. 2502.04139 null
2025-02-06 Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation Yang Chen et.al. 2502.04111 null
2025-02-06 LeAP: Consistent multi-domain 3D labeling using Foundation Models Simon Gebraad et.al. 2502.03901 null
2025-02-06 Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation Xuan Li et.al. 2502.03813 null
2025-02-05 Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics Indrashis Das et.al. 2502.03654 link
2025-02-05 ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models Ying Zhang et.al. 2502.03266 link
2025-02-05 Disentangling CLIP Features for Enhanced Localized Understanding Samyak Rawelekar et.al. 2502.02977 null
2025-02-05 From DeepSense to Open RAN: AI/ML Advancements in Dynamic Spectrum Sensing and Their Applications Ryan Barker et.al. 2502.02889 null
2025-02-04 Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications William O’Donnell et.al. 2502.02624 null
2025-02-04 COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation Xueqing Deng et.al. 2502.02589 null
2025-02-04 Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation Junha Lee et.al. 2502.02548 null
2025-02-04 Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification Valentina Vadori et.al. 2502.02471 null
2025-02-04 Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation Shutong Duan et.al. 2502.02340 null
2025-02-04 UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation Tao Zhang et.al. 2502.02257 link
2025-02-04 Deep Ensemble approach for Enhancing Brain Tumor Segmentation in Resource-Limited Settings Jeremiah Fadugba et.al. 2502.02179 null
2025-02-04 Memory Efficient Transformer Adapter for Dense Predictions Dong Zhang et.al. 2502.01962 null
2025-02-03 Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis Haowen Bai et.al. 2502.01467 null
2025-02-03 Temporal-consistent CAMs for Weakly Supervised Video Segmentation in Waste Sorting Andrea Marelli et.al. 2502.01455 null
2025-02-03 ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies Costin F. Ciusdel et.al. 2502.01335 null
2025-01-31 Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches Ying Zang et.al. 2501.19329 null
2025-01-31 GO: The Great Outdoors Multimodal Dataset Peng Jiang et.al. 2501.19274 null
2025-01-31 Medical Semantic Segmentation with Diffusion Pretrain David Li et.al. 2501.19265 null
2025-01-31 ContextFormer: Redefining Efficiency in Semantic Segmentation Mian Muhammad Naeem Abid et.al. 2501.19255 null
2025-01-31 Integrating Semi-Supervised and Active Learning for Semantic Segmentation Wanli Ma et.al. 2501.19227 null
2025-01-31 Improving vision-language alignment with graph spiking hybrid Networks Siyu Zhang et.al. 2501.19069 null
2025-01-31 SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging Javier Montalvo et.al. 2501.19035 null
2025-01-31 Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks Xiaoyan Jiang et.al. 2501.18851 null
2025-01-30 INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation Jian Hu et.al. 2501.18753 null
2025-02-03 Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models Hao Dong et.al. 2501.18592 link
2025-01-30 Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations Chengxi Zeng et.al. 2501.18474 null
2025-01-30 Ground Awareness in Deep Learning for Large Outdoor Point Cloud Segmentation Kevin Qiu et.al. 2501.18246 null
2025-01-30 ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation Transformer Weiwei Yao et.al. 2501.17688 null
2025-01-29 Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation Lin Chen et.al. 2501.17642 null
2025-01-29 3DSES: an indoor Lidar point cloud segmentation dataset with real and pseudo-labels from a 3D model Maxime Mérizette et.al. 2501.17534 null
2025-01-29 Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models Muhammad Atta ur Rahman et.al. 2501.16769 null
2025-01-28 AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies Surojit Saha et.al. 2501.16760 null
2025-01-28 SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios Yinqi Chen et.al. 2501.16754 null
2025-01-27 Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation Philip Hughes et.al. 2501.16467 null
2025-01-27 DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation Han Sun et.al. 2501.16410 null
2025-01-27 The Linear Attention Resurrection in Vision Transformer Chuanyang Zheng et.al. 2501.16182 null
2025-01-27 D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation Maik Steinhauser et.al. 2501.15870 null
2025-01-26 iFormer: Integrating ConvNet and Transformer for Mobile Application Chuanyang Zheng et.al. 2501.15369 link
2025-01-25 A Training-free Synthetic Data Selection Method for Semantic Segmentation Hao Tang et.al. 2501.15201 null
2025-01-24 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving Jules Sanchez et.al. 2501.14605 link
2025-01-24 Effective Defect Detection Using Instance Segmentation for NDI Ashiqur Rahman et.al. 2501.14149 null
2025-01-23 ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban 3D Change Detection Luqi Zhang et.al. 2501.14004 link
2025-01-23 IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models Jiayi Lei et.al. 2501.13920 null
2025-01-23 Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning Zuyao You et.al. 2501.13893 link
2025-01-23 Where Do You Go? Pedestrian Trajectory Prediction using Scene Features Mohammad Ali Rezaei et.al. 2501.13848 null
2025-01-23 Overcoming Support Dilution for Robust Few-shot Semantic Segmentation Wailing Tang et.al. 2501.13529 null
2025-01-22 Revisiting Data Augmentation for Ultrasound Images Adam Tupper et.al. 2501.13193 link
2025-01-22 A Novel Scene Coupling Semantic Mask Network for Remote Sensing Image Segmentation Xiaowen Ma et.al. 2501.13130 link
2025-01-22 Hybridization of Attention UNet with Repeated Atrous Spatial Pyramid Pooling for Improved Brain Tumour Segmentation Satyaki Roy Chowdhury et.al. 2501.13129 null
2025-01-22 Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks Alessio Quercia et.al. 2501.12824 null
2025-01-19 Comparative Analysis of Hand-Crafted and Machine-Driven Histopathological Features for Prostate Cancer Classification and Segmentation Feda Bolus Al Baqain et.al. 2501.12415 null
2025-01-21 Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems Stefano Carlo Lambertenghi et.al. 2501.12269 null
2025-01-21 A margin-based replacement for cross-entropy loss Michael W. Spratling et.al. 2501.12191 null
2025-01-21 Foreign object segmentation in chest x-rays through anatomy-guided shape insertion Constantin Seibold et.al. 2501.12022 null
2025-01-21 Data-driven Detection and Evaluation of Damages in Concrete Structures: Using Deep Learning and Computer Vision Saeid Ataei et.al. 2501.11836 null
2025-01-20 MedicoSAM: Towards foundation models for medical image segmentation Anwai Archit et.al. 2501.11734 link
2025-01-20 Automatic Labelling & Semantic Segmentation with 4D Radar Tensors Botao Sun et.al. 2501.11351 null
2025-01-20 Enhancing Uncertainty Estimation in Semantic Segmentation via Monte-Carlo Frequency Dropout Tal Zeevi et.al. 2501.11258 link
2025-01-20 Advancing Oyster Phenotype Segmentation with Multi-Network Ensemble and Multi-Scale mechanism Wenli Yang et.al. 2501.11203 null
2025-01-19 Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation Zhengwen Shen et.al. 2501.10958 null
2025-01-18 OpenEarthMap-SAR: A Benchmark Synthetic Aperture Radar Dataset for Global High-Resolution Land Cover Mapping Junshi Xia et.al. 2501.10891 null
2025-01-17 Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks Michael Schwingshackl et.al. 2501.10080 link
2025-01-17 Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework Ali Can Karaca et.al. 2501.10075 link
2025-01-17 One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression Keita Miwa et.al. 2501.10064 null
2025-01-17 LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks Wei Lu et.al. 2501.10040 link
2025-01-16 The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning Wonjun Jo et.al. 2501.09485 null
2025-01-16 Scaling up self-supervised learning for improved surgical foundation models Tim J. M. Jaspers et.al. 2501.09436 link
2025-01-16 SVIA: A Street View Image Anonymization Framework for Self-Driving Applications Dongyu Liu et.al. 2501.09393 link
2025-01-15 UNIR-Net: A Novel Approach for Restoring Underwater Images with Non-Uniform Illumination Using Synthetic Data Ezequiel Perez-Zarate et.al. 2501.09053 link
2025-01-15 Pseudolabel guided pixels contrast for domain adaptive semantic segmentation Jianzi Xiang et.al. 2501.09040 link
2025-01-14 FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing Isaac Corley et.al. 2501.08490 null
2025-01-14 Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers Efstathios Karypidis et.al. 2501.08303 link
2025-01-14 SmartEraser: Remove Anything from Images using Masked-Region Guidance Longtao Jiang et.al. 2501.08279 null
2025-01-14 A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation Steven Landgraf et.al. 2501.08188 null
2025-01-14 Threshold Attention Network for Semantic Segmentation of Remote Sensing Images Wei Long et.al. 2501.07984 null
2025-01-14 SkipClick: Combining Quick Responses and Low-Level Features for Interactive Segmentation in Winter Sports Contexts Robin Schön et.al. 2501.07960 null
2025-01-14 Balance Divergence for Knowledge Distillation Yafei Qi et.al. 2501.07804 null
2025-01-13 Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation Xianping Ma et.al. 2501.07390 link
2025-01-13 TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations Daniel Steininger et.al. 2501.07360 link
2025-01-13 Toward Realistic Camouflaged Object Detection: Benchmarks and Method Zhimeng Xin et.al. 2501.07297 link
2025-01-13 Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Li Liang et.al. 2501.07260 link
2025-01-12 LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier Haojun Yu et.al. 2501.06862 link
2025-01-12 SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation Javier Gamazo Tejero et.al. 2501.06836 null
2025-01-12 Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation Zhenyang Feng et.al. 2501.06749 null
2025-01-11 Parking Space Detection in the City of Granada Crespo-Orti Luis et.al. 2501.06651 link
2025-01-06 The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge Qing Wu et.al. 2501.05472 null
2025-01-09 Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions Shishir Muralidhara et.al. 2501.05246 null
2025-01-09 Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment Haoyi Xiu et.al. 2501.05095 null
2025-01-08 Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation Ulindu De Silva et.al. 2501.04696 link
2025-01-08 Rapid Automated Mapping of Clouds on Titan With Instance Segmentation Zachary Yahn et.al. 2501.04459 link
2025-01-07 Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images Hongyi Wu et.al. 2501.03891 null
2025-01-07 AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish Stefan Hein Bengtson et.al. 2501.03767 null
2025-01-07 Image Segmentation: Inducing graph-based learning Aryan Singh et.al. 2501.03765 link
2025-01-06 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation Jiexi Zhong et.al. 2501.02937 null
2025-01-08 GLoG-CSUnet: Enhancing Vision Transformers with Adaptable Radiomic Features for Medical Image Segmentation Niloufar Eghbali et.al. 2501.02788 link
2025-01-04 Unsupervised Class Generation to Expand Semantic Segmentation Datasets Javier Montalvo et.al. 2501.02264 null
2025-01-03 DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data Yuanpeng Tu et.al. 2501.02048 null
2025-01-03 Semantic Segmentation for Sequential Historical Maps by Learning from Only One Map Yunshuang Yuan et.al. 2501.01845 null
2025-01-03 Dedicated Inference Engine and Binary-Weight Neural Networks for Lightweight Instance Segmentation Tse-Wei Chen et.al. 2501.01841 null
2025-01-03 IAM: Enhancing RGB-D Instance Segmentation with New Benchmarks Aecheon Jung et.al. 2501.01685 link
2025-01-03 Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation Rini Smita Thakur et.al. 2501.01640 null
2025-01-02 A Multi-task Supervised Compression Model for Split Computing Yoshitomo Matsubara et.al. 2501.01420 link
2025-01-02 Leverage Cross-Attention for End-to-End Open-Vocabulary Panoptic Reconstruction Xuan Yu et.al. 2501.01119 null
2025-01-02 Evidential Calibrated Uncertainty-Guided Interactive Segmentation paradigm for Ultrasound Images Jiang Shang et.al. 2501.01072 null
2025-01-02 Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function Anna Grim et.al. 2501.01022 link
2025-01-03 FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation Bingyu Li et.al. 2501.00877 link
2024-12-31 Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves Madeleine Darbyshire et.al. 2501.00527 link
2024-12-31 H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters Pedram Fekri et.al. 2501.00514 null
2024-12-31 A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images Dawen Yu et.al. 2501.00360 null
2024-12-31 PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM Runnan Chen et.al. 2501.00352 null
2024-12-31 OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Runnan Chen et.al. 2501.00326 link
2024-12-30 HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization Zijie Fang et.al. 2412.20924 link
2024-12-30 LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training Fardin Ayar et.al. 2412.20881 null
2024-12-29 Image Augmentation Agent for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.20439 null
2024-12-27 Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP Zhongxing Xu et.al. 2412.19650 null
2024-12-27 An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments Vignesh Kottayam Viswanathan et.al. 2412.19582 null
2024-12-27 Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation Chengyang Ye et.al. 2412.19492 link
2024-12-26 Impact of color and mixing proportion of synthetic point clouds on semantic segmentation Shaojie Zhou et.al. 2412.19145 null
2024-12-25 Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model Yi-Chia Chen et.al. 2412.18917 link
2024-12-24 AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction Pufan Zou et.al. 2412.18255 null
2024-12-25 VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis Shicheng Yin et.al. 2412.18178 link
2024-12-24 UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision Yuru Wang et.al. 2412.18131 null
2024-12-24 LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding Hao Li et.al. 2412.17635 null
2024-12-25 AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation Jiaqi Ma et.al. 2412.17601 link
2024-12-24 Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation Jianjian Yin et.al. 2412.17331 link
2024-12-22 Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation Samuel Marschall et.al. 2412.16990 null
2024-12-22 Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection Yuhang Gan et.al. 2412.16918 null
2024-12-22 MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection Xu Zheng et.al. 2412.16876 null
2024-12-22 Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation Jongmin Yu et.al. 2412.16859 null
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 null
2024-12-21 IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks Yaming Zhang et.al. 2412.16654 link
2024-12-21 V”Mean”ba: Visual State Space Models only need 1 hidden dimension Tien-Yu Chi et.al. 2412.16602 null
2024-12-20 SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data Xinwei Ju et.al. 2412.16078 null
2024-12-20 Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer Xinyue Chen et.al. 2412.15835 link
2024-12-19 MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance Hallee E. Wong et.al. 2412.15058 link
2024-12-19 GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation G. Andrade-Miranda et.al. 2412.15054 link
2024-12-19 PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation Shoumeng Qiu et.al. 2412.14821 link
2024-12-19 Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers Rui Ding et.al. 2412.14633 null
2024-12-19 Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation Zhenxin Lei et.al. 2412.14587 null
2024-12-18 Split Learning in Computer Vision for Semantic Segmentation Delay Minimization Nikos G. Evgenidis et.al. 2412.14272 null
2024-12-18 Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation Jianyu Zhang et.al. 2412.14145 null
2024-12-18 Prompt Categories Cluster for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.13823 null
2024-12-18 Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data Junki Mori et.al. 2412.13757 null
2024-12-18 Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration Dominik Werner Wolf et.al. 2412.13695 null
2024-12-18 GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting Yuning Peng et.al. 2412.13654 link
2024-12-18 RelationField: Relate Anything in Radiance Fields Sebastian Koch et.al. 2412.13652 null
2024-12-17 S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging Yimu Pan et.al. 2412.13156 null
2024-12-17 Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks Xiaxin Zhu et.al. 2412.12843 null
2024-12-17 ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation Shiqi Huang et.al. 2412.12798 link
2024-12-17 Open-World Panoptic Segmentation Matteo Sodano et.al. 2412.12740 null
2024-12-17 SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing Chen Chen et.al. 2412.12685 link
2024-12-17 Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation Dongyue Wu et.al. 2412.12672 link
2024-12-17 Adaptive Prototype Replay for Class Incremental Semantic Segmentation Guilin Zhu et.al. 2412.12669 null
2024-12-17 SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation Shuangping Huang et.al. 2412.12660 null
2024-12-16 Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation Hongwei Niu et.al. 2412.12050 link
2024-12-16 SAMIC: Segment Anything with In-Context Spatial Prompt Engineering Savinay Nagendra et.al. 2412.11998 null
2024-12-16 SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation Yunxiang Fu et.al. 2412.11890 link
2024-12-16 Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation Svetlana Pavlitska et.al. 2412.11608 null
2024-12-16 PyPotteryLens: An Open-Source Deep Learning Framework for Automated Digitisation of Archaeological Pottery Documentation Lorenzo Cardarelli et.al. 2412.11574 null
2024-12-15 Volumetric Mapping with Panoptic Refinement via Kernel Density Estimation for Mobile Robots Khang Nguyen et.al. 2412.11241 link
2024-12-15 MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2412.11076 link
2024-12-15 Classification Drives Geographic Bias in Street Scene Segmentation Rahul Nair et.al. 2412.11061 null
2024-12-15 SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation Xudong Zhou et.al. 2412.11034 null
2024-12-14 RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone Mustafa Munir et.al. 2412.10995 link
2024-12-13 A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation Wangkai Li et.al. 2412.10339 null
2024-12-13 SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians Siyun Liang et.al. 2412.10231 null
2024-12-13 SPT: Sequence Prompt Transformer for Interactive Image Segmentation Senlin Cheng et.al. 2412.10224 null
2024-12-13 TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views Liang Zhao et.al. 2412.10051 null
2024-12-13 Object-Focused Data Selection for Dense Prediction Tasks Niclas Popp et.al. 2412.10032 null
2024-12-12 MaskTerial: A Foundation Model for Automated 2D Material Flake Detection Jan-Lucas Uslu et.al. 2412.09333 null
2024-12-12 Towards Open-Vocabulary Video Semantic Segmentation Xinhao Li et.al. 2412.09329 null
2024-12-12 FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation Yuntian Bo et.al. 2412.09319 link
2024-12-12 VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation Roberto Alcover-Couso et.al. 2412.09240 null
2024-12-12 STEAM: Squeeze and Transform Enhanced Attention Module Rishabh Sabharwal et.al. 2412.09023 null
2024-12-11 SegFace: Face Segmentation of Long-Tail Classes Kartik Narayan et.al. 2412.08647 link
2024-12-11 EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation Hongwei Niu et.al. 2412.08628 null
2024-12-12 Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning Fan Lu et.al. 2412.08614 link
2024-12-11 Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion Bingzhi Shen et.al. 2412.08315 null
2024-12-11 Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction Bohan Li et.al. 2412.08243 null
2024-12-11 THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots Zeshun Li et.al. 2412.08096 null
2024-12-11 Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation Zhigang Cen et.al. 2412.08034 null
2024-12-10 Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation Kurt H. W. Stolle et.al. 2412.07966 link
2024-12-11 CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings Jiazuo Mu et.al. 2412.07377 null
2024-12-09 SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception Yaniv Benny et.al. 2412.06968 null
2024-12-10 ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet Andrei-Robert Alexandrescu et.al. 2412.06742 null
2024-12-09 Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation Fei Wu et.al. 2412.06470 null
2024-12-09 Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework Jiuyi Xu et.al. 2412.06268 null
2024-12-09 GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image Lei Su et.al. 2412.06129 null
2024-12-08 Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation Zipeng Qi et.al. 2412.05969 null
2024-12-08 CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation Elay Dahan et.al. 2412.05833 null
2024-12-07 Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards Ranjan Sapkota et.al. 2412.05728 null
2024-12-10 RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts Xu Liu et.al. 2412.05679 link
2024-12-06 FogROS2-FT: Fault Tolerant Cloud Robotics Kaiyuan Chen et.al. 2412.05408 null
2024-12-06 DreamColour: Controllable Video Colour Editing without Training Chaitat Utintu et.al. 2412.05180 null
2024-12-05 Assessing and Learning Alignment of Unimodal Vision and Language Models Le Zhang et.al. 2412.04616 link
2024-12-05 Towards Real-Time Open-Vocabulary Video Instance Segmentation Bin Yan et.al. 2412.04434 null
2024-12-05 A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers Anaïs Halin et.al. 2412.04377 null
2024-12-05 Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts Chenyang Zhu et.al. 2412.04220 null
2024-12-05 Text Change Detection in Multilingual Documents Using Image Comparison Doyoung Park et.al. 2412.04137 null
2024-12-05 SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning Seokju Yun et.al. 2412.04077 null
2024-12-05 Quality Control in Open-Ended Crowdsourcing: A Survey Lei Chai et.al. 2412.03991 null
2024-12-05 Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation Hao Zhu et.al. 2412.03968 link
2024-12-05 LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model Yuan Xue et.al. 2412.03841 null
2024-12-04 Designing DNNs for a trade-off between robustness and processing performance in embedded devices Jon Gutiérrez-Zaballa et.al. 2412.03682 null
2024-12-04 FLAIR: VLM with Fine-grained Language-informed Image Representations Rui Xiao et.al. 2412.03561 link
2024-12-04 Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy Ronald L. P. D. de Jong et.al. 2412.03401 null
2024-12-04 Task-driven Image Fusion with Learnable Fusion Loss Haowen Bai et.al. 2412.03240 null
2024-12-04 Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging Luca Ciampi et.al. 2412.03192 null
2024-12-04 Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype Song Tang et.al. 2412.02983 null
2024-12-04 Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch Qing Zhang et.al. 2412.02978 null
2024-12-04 Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution Jiahua Xiao et.al. 2412.02960 null
2024-12-04 Panoptic Diffusion Models: co-generation of images and segmentation maps Yinghan Long et.al. 2412.02929 null
2024-12-03 SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection Joongwon Chae et.al. 2412.02565 null
2024-12-03 Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps Malik Abdul Manan et.al. 2412.02443 null
2024-12-03 AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation Jaehyun Choi et.al. 2412.02280 null
2024-12-03 Vision Transformers for Weakly-Supervised Microorganism Enumeration Javier Ureña Santiago et.al. 2412.02250 link
2024-12-03 Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance Jing Zeng et.al. 2412.02249 null
2024-12-02 INSIGHT: Explainable Weakly-Supervised Medical Image Analysis Wenbo Zhang et.al. 2412.02012 null
2024-12-02 Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers Alberto Gonzalo Rodriguez Salgado et.al. 2412.01941 null
2024-12-02 COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training Sanghwan Kim et.al. 2412.01814 null
2024-12-02 Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior Yi Yu et.al. 2412.01646 null
2024-12-02 Epipolar Attention Field Transformers for Bird’s Eye View Semantic Segmentation Christian Witte et.al. 2412.01595 null
2024-11-29 LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention Zewen Du et.al. 2411.19585 link
2024-11-29 Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Wenbo Zhang et.al. 2411.19551 null
2024-11-29 Retrieval-guided Cross-view Image Synthesis Hongji Yang et.al. 2411.19510 null
2024-11-29 Adaptive Interactive Segmentation for Multimodal Medical Imaging via Selection Engine Zhi Li et.al. 2411.19447 link
2024-11-28 GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model Rui Zhou et.al. 2411.19289 null
2024-11-28 InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception Haijie Li et.al. 2411.19235 null
2024-11-28 MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers Jongseong Bae et.al. 2411.18995 null
2024-11-28 Textured As-Is BIM via GIS-informed Point Cloud Segmentation Mohamed S. H. Alabassy et.al. 2411.18898 null
2024-11-27 The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation Daniel Morales-Brotons et.al. 2411.18728 null
2024-11-27 HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior Li-Yuan Tsao et.al. 2411.18662 link
2024-11-26 Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation Sudarshan Rajagopalan et.al. 2411.17814 null
2024-11-26 Efficient Multi-modal Large Language Models via Visual Token Grouping Minbin Huang et.al. 2411.17773 null
2024-11-26 Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation Niharika Hegde et.al. 2411.17610 null
2024-11-26 A Bilayer Segmentation-Recombination Network for Accurate Segmentation of Overlapping C. elegans Mengqian Dinga et.al. 2411.17557 null
2024-11-26 Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving Jon Gutiérrez-Zaballa et.al. 2411.17543 null
2024-11-26 Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning Hoàng-Ân Lê et.al. 2411.17536 link
2024-11-26 TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba Xiaowen Ma et.al. 2411.17473 link
2024-11-26 Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps Xue Xia et.al. 2411.17425 null
2024-11-26 MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection Juefei He et.al. 2411.17167 null
2024-11-26 Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation Chanyoung Kim et.al. 2411.17150 null
2024-11-26 ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction Chang Li et.al. 2411.17088 null
2024-11-26 SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation Guoan Xu et.al. 2411.17061 null
2024-11-25 Deformable Mamba for Wide Field of View Segmentation Jie Hu et.al. 2411.16481 link
2024-11-25 A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models Manuel Schwonberg et.al. 2411.16407 null
2024-11-25 CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation Leon Sick et.al. 2411.16319 null
2024-11-25 An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models Wentao Qu et.al. 2411.16308 null
2024-11-25 A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads Rafael S. Toledo et.al. 2411.16295 null
2024-11-25 Weakly supervised image segmentation for defect-based grading of fresh produce Manuel Knott et.al. 2411.16219 null
2024-11-25 Learn from Foundation Model: Fruit Detection Model without Manual Annotation Yanan Wang et.al. 2411.16196 null
2024-11-25 Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking Phuc Nguyen et.al. 2411.16183 null
2024-11-25 Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training Man Yao et.al. 2411.16061 link
2024-11-24 Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan Saba Zahid et.al. 2411.15923 null
2024-11-22 Effective SAM Combination for Open-Vocabulary Semantic Segmentation Minhyeok Lee et.al. 2411.14723 null
2024-11-21 Revisiting the Integration of Convolution and Attention for Vision Backbone Lei Zhu et.al. 2411.14429 link
2024-11-21 CompetitorFormer: Competitor Transformer for 3D Instance Segmentation Duanchu Wang et.al. 2411.14179 null
2024-11-21 CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Lin Sun et.al. 2411.13836 link
2024-11-21 Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals Hussni Mohd Zakir et.al. 2411.13774 null
2024-11-20 FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting Ola Shorinwa et.al. 2411.13753 null
2024-11-20 DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines Mizanur Rahman Jewel et.al. 2411.13544 null
2024-11-21 Entropy Bootstrapping for Weakly Supervised Nuclei Detection James Willoughby et.al. 2411.13528 null
2024-11-20 BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Umamaheswaran Raman Kumar et.al. 2411.13251 null
2024-11-20 XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation Ziyi Wang et.al. 2411.13243 link
2024-11-20 Automating Sonologists USG Commands with AI and Voice Interface Emad Mohamed et.al. 2411.13006 null
2024-11-19 Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline Junlong Cheng et.al. 2411.12814 link
2024-11-19 A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation Jiaqi Yang et.al. 2411.12615 link
2024-11-19 SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation Ron Keuth et.al. 2411.12602 link
2024-11-19 ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator Xiao Jiang et.al. 2411.12250 null
2024-11-18 ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements M. Arda Aydın et.al. 2411.12044 link
2024-11-18 Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation Hanieh Shojaei Miandashti et.al. 2411.11935 null
2024-11-18 MGNiceNet: Unified Monocular Geometric Scene Understanding Markus Schön et.al. 2411.11466 null
2024-11-18 MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models Harshita Sharma et.al. 2411.11362 null
2024-11-18 Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications Scarlett Raine et.al. 2411.11287 null
2024-11-18 Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development Ranjan Sapkota et.al. 2411.11285 null
2024-11-16 Attention-based U-Net Method for Autonomous Lane Detection Mohammadhamed Tangestanizadeh et.al. 2411.10902 null
2024-11-16 Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation Jaisidh Singh et.al. 2411.10845 null
2024-11-16 Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients Maria Monzon et.al. 2411.10755 null
2024-11-15 Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation Markus Karmann et.al. 2411.10411 null
2024-11-15 Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images Ammar Qammaz et.al. 2411.10334 null
2024-11-15 RETR: Multi-View Radar Detection Transformer for Indoor Perception Ryoma Yataka et.al. 2411.10293 null
2024-11-15 CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Dengke Zhang et.al. 2411.10086 link
2024-11-14 OneNet: A Channel-Wise 1D Convolutional U-Net Sanghyun Byun et.al. 2411.09838 link
2024-11-14 Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks Zengyi Yang et.al. 2411.09387 null
2024-11-14 Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Yuheng Shi et.al. 2411.09219 link
2024-11-14 Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Ashim Dahal et.al. 2411.09101 link
2024-11-13 CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation Xuming Zhang et.al. 2411.09023 null
2024-11-14 Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation Yangyang Li et.al. 2411.08756 null
2024-11-13 Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model Jun Xie et.al. 2411.08592 null
2024-11-13 UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation Chengyuan Zhang et.al. 2411.08569 null
2024-11-13 Detection and classification of radio sources with deep learning S. Riggi et.al. 2411.08519 null
2024-11-12 Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry Christopher Hahne et.al. 2411.07918 link
2024-11-12 INTRABENCH: Interactive Radiological Benchmark Constantin Ulrich et.al. 2411.07885 null
2024-11-12 Horticultural Temporal Fruit Monitoring via 3D Instance Segmentation and Re-Identification using Point Clouds Daniel Fusaro et.al. 2411.07799 link
2024-11-12 Semantic segmentation on multi-resolution optical and microwave data using deep learning Jai G Singla et.al. 2411.07581 null
2024-11-12 GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting Umangi Jain et.al. 2411.07555 null
2024-11-11 Data-Centric Learning Framework for Real-Time Detection of Aiming Beam in Fluorescence Lifetime Imaging Guided Surgery Mohamed Abul Hassan et.al. 2411.07395 null
2024-11-11 SAMPart3D: Segment Any Part in 3D Objects Yunhan Yang et.al. 2411.07184 link
2024-11-11 SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation Jiale Chen et.al. 2411.06991 null
2024-11-11 Fast and Efficient Transformer-based Method for Bird’s Eye View Instance Prediction Miguel Antunes-García et.al. 2411.06851 link
2024-11-11 Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision Yueyang Cang et.al. 2411.06727 null
2024-11-10 Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments Deegan Atha et.al. 2411.06632 null
2024-11-09 Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing Kaixuan Lu et.al. 2411.06091 null
2024-11-08 Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model Shuchang Lyu et.al. 2411.05878 link
2024-11-08 Agricultural Landscape Understanding At Country-Scale Radhika Dua et.al. 2411.05359 null
2024-11-08 Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation Sien Li et.al. 2411.05307 link
2024-11-07 In the Era of Prompt Learning with Vision-Language Models Ankit Jha et.al. 2411.04892 null
2024-11-08 ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset Olaf Wysocki et.al. 2411.04865 link
2024-11-06 Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts Zhitong Gao et.al. 2411.03829 link
2024-11-06 SA3DIP: Segment Any 3D Instance with Potential 3D Priors Xi Yang et.al. 2411.03819 link
2024-11-06 Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model Yansong Qu et.al. 2411.03672 null
2024-11-05 Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation Zhiling Yue et.al. 2411.03551 null
2024-11-05 SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture Andrew Heschl et.al. 2411.03505 link
2024-11-05 Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need Qishuai Wen et.al. 2411.03033 link
2024-11-05 Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation Xavier Timoneda et.al. 2411.02969 null
2024-11-05 Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery Mohammad Kakooei et.al. 2411.02935 null
2024-11-05 CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation Jinchao Ge et.al. 2411.02715 null
2024-11-04 Deep Learning on 3D Semantic Segmentation: A Detailed Review Thodoris Betsas et.al. 2411.02104 null
2024-11-04 Tree level change detection over Ahmedabad city using very high resolution satellite images and Deep Learning Jai G Singla et.al. 2411.02009 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability Bo Gao et.al. 2411.01819 null
2024-11-04 Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations Thanh Nguyen Canh et.al. 2411.01816 null
2024-11-05 MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation Duc Dang Trung Tran et.al. 2411.01781 null
2024-11-03 PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation Xinyu Xu et.al. 2411.01624 null
2024-11-01 Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions Lixiao Yang et.al. 2411.01039 null
2024-11-01 Event-guided Low-light Video Semantic Segmentation Zhen Yao et.al. 2411.00639 null
2024-11-01 Automated Classification of Cell Shapes: A Comparative Evaluation of Shape Descriptors Valentina Vadori et.al. 2411.00561 null
2024-10-31 Federated Black-Box Adaptation for Semantic Segmentation Jay N. Paranjape et.al. 2410.24181 null
2024-10-31 COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes Muhammad Ali et.al. 2410.24139 link
2024-10-31 Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model Hao Zhang et.al. 2410.23905 link
2024-10-30 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 null
2024-10-31 CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation Ziyang Gong et.al. 2410.22629 link
2024-10-29 Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation Zhaochong An et.al. 2410.22489 link
2024-10-29 Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation Jintao Tong et.al. 2410.22135 null
2024-10-29 Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models Imad Ali Shah et.al. 2410.22101 null
2024-10-29 Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation Ruihao Xia et.al. 2410.21708 link
2024-10-28 Domain Adaptation with a Single Vision-Language Embedding Mohammad Fahes et.al. 2410.21361 null
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 null
2024-10-27 A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models Camilo Espinosa-Curilem et.al. 2410.20595 link
2024-10-27 Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist et.al. 2410.20459 link
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 null
2024-10-25 IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation Kaixian Qu et.al. 2410.19697 null
2024-10-25 Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation Yao Wu et.al. 2410.19446 link
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-24 Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Alexander Jaus et.al. 2410.18684 null
2024-10-24 Unsupervised semantic segmentation of urban high-density multispectral point clouds Oona Oinonen et.al. 2410.18520 null
2024-10-26 CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator Stefanos Pasios et.al. 2410.18238 link
2024-10-23 Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers Achille Chiuchiarelli et.al. 2410.17738 null
2024-10-23 YOLOv11: An Overview of the Key Architectural Enhancements Rahima Khanam et.al. 2410.17725 null
2024-10-23 PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting Yu Wang et.al. 2410.17505 null
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 null
2024-10-22 LIMIS: Towards Language-based Interactive Medical Image Segmentation Lena Heinemann et.al. 2410.16939 null
2024-10-22 DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model Zhixiong Nan et.al. 2410.16707 null
2024-10-22 SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments Jumman Hossain et.al. 2410.16686 null
2024-10-22 NucleiMix: Realistic Data Augmentation for Nuclei Instance Segmentation Jiamu Wang et.al. 2410.16671 null
2024-10-21 PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model Zhongchen Deng et.al. 2410.16545 null
2024-10-21 TIPS: Text-Image Pretraining with Spatial Awareness Kevis-Kokitsi Maninis et.al. 2410.16512 link
2024-10-21 GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2410.16485 null
2024-10-21 Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation Ruting Chi et.al. 2410.16063 null
2024-10-21 LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training Thomas Kreutz et.al. 2410.15833 link
2024-10-21 TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight Hyun-Kurl Jang et.al. 2410.15674 link
2024-10-21 Deep Learning and Machine Learning – Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 null
2024-10-20 Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation Fnu Neha et.al. 2410.15472 null
2024-10-20 Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing Daniya Najiha Abdul Kareem et.al. 2410.15360 null
2024-10-18 On the Influence of Shape, Texture and Color for Learning Semantic Segmentation Annika Mütze et.al. 2410.14878 null
2024-10-18 Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ Arpan Mahara et.al. 2410.14836 null
2024-10-18 Impact of imperfect annotations on CNN training and performance for instance segmentation and classification in digital pathology Laura Gálvez Jiménez et.al. 2410.14365 null
2024-10-17 ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Guangda Ji et.al. 2410.13924 link
2024-10-17 Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks Clément Playout et.al. 2410.13822 link
2024-10-18 Enhanced Prompt-leveraged Weakly Supervised Cancer Segmentation based on Segment Anything Joonhyeon Song et.al. 2410.13621 link
2024-10-17 Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation Ziyang Chen et.al. 2410.13472 null
2024-10-17 SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing Bin Wang et.al. 2410.13471 link
2024-10-17 Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation Florian Wulff et.al. 2410.13383 null
2024-10-17 LESS: Label-Efficient and Single-Stage Referring 3D Segmentation Xuexun Liu et.al. 2410.13294 link
2024-10-17 Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation Houze Liu et.al. 2410.13099 null
2024-10-16 Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation Wenbo Xu et.al. 2410.13094 null
2024-10-16 Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation Anthony Opipari et.al. 2410.12995 null
2024-10-16 Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation Jesús Alejandro Loera-Ponce et.al. 2410.12988 null
2024-10-16 VividMed: Vision Language Model with Versatile Visual Grounding for Medicine Lingxiao Luo et.al. 2410.12694 null
2024-10-16 Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans Luca Marsilio et.al. 2410.12641 null
2024-10-16 Order-Aware Interactive Segmentation Bin Wang et.al. 2410.12214 null
2024-10-16 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-15 WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation Chenghao Qian et.al. 2410.12075 link
2024-10-15 Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning Rijun Wang et.al. 2410.11913 null
2024-10-15 Fractal Calibration for long-tailed object detection Konstantinos Panagiotis Alexandridis et.al. 2410.11774 link
2024-10-15 RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation Anton Antonov et.al. 2410.11722 link
2024-10-15 InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Jiayi Lin et.al. 2410.11473 null
2024-10-15 MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation Xianping Ma et.al. 2410.11160 link
2024-10-14 Locality Alignment Improves Vision-Language Models Ian Covert et.al. 2410.11087 null
2024-10-14 Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes Tim Broedermann et.al. 2410.10791 null
2024-10-14 UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation Lihe Yang et.al. 2410.10777 link
2024-10-14 PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion Runsong Zhu et.al. 2410.10659 link
2024-10-14 Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation Daniel Fusaro et.al. 2410.10510 link
2024-10-14 LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections Xuezhi Xiang et.al. 2410.10433 null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-13 UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation Ye Sun et.al. 2410.09909 null
2024-10-13 AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model Yuchen Li et.al. 2410.09714 null
2024-10-12 An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation Wei Liang et.al. 2410.09443 null
2024-10-11 Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation Varduhi Yeghiazaryan et.al. 2410.08946 null
2024-10-11 Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation Hanieh Shojaei et.al. 2410.08687 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 null
2024-10-10 Interactive4D: Interactive 4D LiDAR Segmentation Ilya Fradlin et.al. 2410.08206 link
2024-10-10 Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation Zhiyi Pan et.al. 2410.08091 null
2024-10-10 Shift and matching queries for video semantic segmentation Tsubasa Mizuno et.al. 2410.07635 null
2024-10-10 3D Vision-Language Gaussian Splatting Qucheng Peng et.al. 2410.07577 null
2024-10-09 Segmenting objects with Bayesian fusion of active contour models and convnet priors Przemyslaw Polewski et.al. 2410.07421 null
2024-10-11 Bridge the Points: Graph-based Few-shot Segment Anything Semantically Anqi Zhang et.al. 2410.06964 null
2024-10-09 Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation Seungho Lee et.al. 2410.06893 null
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 link
2024-10-10 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 link
2024-10-09 Transesophageal Echocardiography Generation using Anatomical Models Emmanuel Oladokun et.al. 2410.06781 null
2024-10-09 Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy Qinfeng Zhu et.al. 2410.06725 null
2024-10-09 Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments Meng Yu et.al. 2410.06626 null
2024-10-09 Towards Natural Image Matting in the Wild via Real-Scenario Prior Ruihao Xia et.al. 2410.06593 link
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 link
2024-10-08 Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts Zhiwei Lin et.al. 2410.05963 null
2024-10-07 Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation Vince Zhu et.al. 2410.04689 null
2024-10-06 In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding Shenghao Li et.al. 2410.04529 null
2024-10-05 ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments Lorenzo Terenzi et.al. 2410.04250 null
2024-10-04 SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 Hao Yu et.al. 2410.03962 null
2024-10-04 Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features Benyuan Meng et.al. 2410.03558 link
2024-10-04 Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images Abhijeet Patil et.al. 2410.03289 link
2024-10-04 HRVMamba: High-Resolution Visual State Space Model for Dense Prediction Hao Zhang et.al. 2410.03174 null
2024-10-03 HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer Jingjing Ren et.al. 2410.02528 null
2024-10-06 SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations Nikolaos Giakoumoglou et.al. 2410.02401 link
2024-10-04 Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation Muzhi Zhu et.al. 2410.02369 null
2024-10-03 ProtoSeg: A Prototype-Based Point Cloud Instance Segmentation Method Remco Royen et.al. 2410.02352 null
2024-10-03 RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds Remco Royen et.al. 2410.02323 null
2024-10-03 Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network Yangyang Qiu et.al. 2410.02224 null
2024-10-03 Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images Qingyuan Liu et.al. 2410.02207 null
2024-10-02 SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images Kaiyu Li et.al. 2410.01768 link
2024-10-02 One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations Shaokang Wu et.al. 2410.01630 null
2024-10-02 Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation Zhaofeng Shi et.al. 2410.01341 null
2024-10-02 VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings Andrea Carrara et.al. 2410.01336 null
2024-10-01 RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation Yazhou Zhu et.al. 2410.01110 null
2024-10-01 Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer Vlatko Spasev et.al. 2410.01092 null
2024-10-01 Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time Chiao-An Yang et.al. 2410.01083 link
2024-10-01 DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles Robert Krajewski et.al. 2410.00769 null
2024-10-01 Optimizing Drug Delivery in Smart Pharmacies: A Novel Framework of Multi-Stage Grasping Network Combined with Adaptive Robotics Mechanism Rui Tang et.al. 2410.00753 null
2024-10-01 Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection Pengxi Zeng et.al. 2410.00582 null
2024-09-30 AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation Boyu Han et.al. 2409.20398 null
2024-09-30 Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation Tillmann Rheude et.al. 2409.20287 link
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-30 Segmenting Wood Rot using Computer Vision Models Roland Kammerbauer et.al. 2409.20137 null
2024-09-30 Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels Heeseong Shin et.al. 2409.19846 null
2024-09-27 ProMerge: Prompt and Merge for Unsupervised Instance Segmentation Dylan Li et.al. 2409.18961 null
2024-09-27 Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation Raphael Hagmanns et.al. 2409.18788 null
2024-09-27 Learning from Pattern Completion: Self-supervised Controllable Generation Zhiqiang Chen et.al. 2409.18694 link
2024-09-27 Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast Xiaoke Hao et.al. 2409.18543 link
2024-10-01 Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization Siru Li et.al. 2409.18434 null
2024-09-27 Search3D: Hierarchical Open-Vocabulary 3D Segmentation Ayca Takmaz et.al. 2409.18431 null
2024-09-26 Efficient Microscopic Image Instance Segmentation for Food Crystal Quality Control Xiaoyu Ji et.al. 2409.18291 null
2024-09-26 Amodal Instance Segmentation with Diffusion Shape Prior Estimation Minh Tran et.al. 2409.18256 null
2024-09-26 Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning Siyi Lu et.al. 2409.17659 null
2024-09-26 Global-Local Medical SAM Adaptor Based on Full Adaption Meng Wang et.al. 2409.17486 null
2024-09-25 VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection Liangyu Zhong et.al. 2409.17330 null
2024-09-25 2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation Tommie Kerssies et.al. 2409.17208 link
2024-09-25 WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks Alberto Bacchin et.al. 2409.16999 link
2024-09-25 Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis Illia Tsiporenko et.al. 2409.16940 null
2024-09-24 A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation Avisha Kumar et.al. 2409.16441 null
2024-09-24 Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds Asad Ur Rahman et.al. 2409.16381 null
2024-09-24 Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation Yong Xien Chng et.al. 2409.16278 null
2024-09-24 Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation Hannah Kerner et.al. 2409.16252 link
2024-09-24 Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation Harry Rogers et.al. 2409.16213 link
2024-09-24 Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification Pang-Yuan Pao et.al. 2409.15846 null
2024-09-24 Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks Roberto Alcover-Couso et.al. 2409.15813 null
2024-09-24 DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation Soojin Jang et.al. 2409.15801 null
2024-09-24 Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis Camndon Reed et.al. 2409.15671 null
2024-09-23 Adapting Segment Anything Model for Unseen Object Instance Segmentation Rui Cao et.al. 2409.15481 null
2024-09-23 ZeroSCD: Zero-Shot Street Scene Change Detection Shyam Sundar Kannan et.al. 2409.15255 null
2024-09-23 Diffusion-based RGB-D Semantic Segmentation with Deformable Attention Transformer Minh Bui et.al. 2409.15117 null
2024-09-18 Applications of Knowledge Distillation in Remote Sensing: A Survey Yassine Himeur et.al. 2409.12111 null
2024-09-18 Panoptic-Depth Forecasting Juana Valeria Hurtado et.al. 2409.12008 null
2024-09-18 Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments Gang Chen et.al. 2409.11975 null
2024-09-17 Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks Edgar Heinert et.al. 2409.11373 null
2024-09-17 MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping Amirreza Fateh et.al. 2409.11316 link
2024-09-17 Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark Clifford Broni-Bediako et.al. 2409.11227 link
2024-09-17 HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios Nick Theisen et.al. 2409.11205 link
2024-09-16 Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks? Kaleb Kassaw et.al. 2409.10775 null
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 null
2024-09-16 BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images Wentao Wang et.al. 2409.10269 null
2024-09-15 Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation Zhanteng Xie et.al. 2409.09899 null
2024-09-15 Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation Qilong Zhangli et.al. 2409.09893 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-14 One missing piece in Vision and Language: A Survey on Comics Understanding Emanuele Vivoli et.al. 2409.09502 link
2024-09-14 Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation Hugo Porta et.al. 2409.09497 null
2024-09-14 LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation Qiyuan Wang et.al. 2409.09360 null
2024-09-16 QueryCAD: Grounded Question Answering for CAD Models Claudius Kienle et.al. 2409.08704 null
2024-09-13 AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation Zechao Sun et.al. 2409.08516 null
2024-09-13 VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation Ezra MacDonald et.al. 2409.08461 link
2024-09-12 Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding Hongyu Li et.al. 2409.08251 null
2024-09-12 Bayesian Self-Training for Semi-Supervised 3D Segmentation Ozan Unal et.al. 2409.08102 null
2024-09-12 Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes Siyu Chen et.al. 2409.07995 null
2024-09-12 UNIT: Unsupervised Online Instance Segmentation through Time Corentin Sautier et.al. 2409.07887 null
2024-09-12 SURGIVID: Annotation-Efficient Surgical Video Object Discovery Çağhan Köksal et.al. 2409.07801 null
2024-09-12 Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation Fuchen Zheng et.al. 2409.07793 link
2024-09-12 ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation Fuchen Zheng et.al. 2409.07779 link
2024-09-12 Open-Vocabulary Remote Sensing Image Semantic Segmentation Qinglong Cao et.al. 2409.07683 null
2024-09-11 Token Turing Machines are Efficient Vision Models Purvish Jajal et.al. 2409.07613 null
2024-09-11 AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution Wangduo Xie et.al. 2409.07171 null
2024-09-11 Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images Xuexue Li et.al. 2409.07022 null
2024-09-11 Brain-Inspired Stepwise Patch Merging for Vision Transformers Yonghao Yu et.al. 2409.06963 null
2024-09-10 Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds Mu Cai et.al. 2409.06827 link
2024-09-10 A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO Sabit Ahamed Preanto et.al. 2409.06671 null
2024-09-10 Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data Ali Tourani et.al. 2409.06625 null
2024-09-10 PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation Yin Hu et.al. 2409.06309 null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 link
2024-09-09 SVS-GAN: Leveraging GANs for Semantic Video Synthesis Khaled M. Seyam et.al. 2409.06074 null
2024-09-09 Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance Quang-Huy Che et.al. 2409.06002 null
2024-09-09 Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features Jacob Gildenblat et.al. 2409.05697 null
2024-09-09 ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions Furqan Ahmed Shaik et.al. 2409.05327 null
2024-09-08 RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network Zhiwei Lin et.al. 2409.04979 null
2024-09-06 Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation Björn Michele et.al. 2409.04409 link
2024-09-06 Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor Manufacturing for Advanced IC Nodes Bappaditya Dey et.al. 2409.04310 null
2024-09-06 CISCA and CytoDArk0: a Cell Instance Segmentation and Classification method for histo(patho)logical image Analyses and a new, open, Nissl-stained dataset for brain cytoarchitecture studies Valentina Vadori et.al. 2409.04175 null
2024-09-05 Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution Marga Don et.al. 2409.03754 link
2024-09-05 MaskVal: Simple but Effective Uncertainty Quantification for 6D Pose Estimation Philipp Quentin et.al. 2409.03556 null
2024-09-05 LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones Moritz Nottebaum et.al. 2409.03460 link
2024-09-05 Automatic occlusion removal from 3D maps for maritime situational awareness Felix Sattler et.al. 2409.03451 null
2024-09-05 Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications Tong Bu et.al. 2409.03368 null
2024-09-05 MouseSIS: A Frames-and-Events Dataset for Space-Time Instance Segmentation of Mice Friedhelm Hamann et.al. 2409.03358 null
2024-09-05 UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking Md. Mahfuzur Rahman et.al. 2409.03245 null
2024-09-05 Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation Xixi Jiang et.al. 2409.03228 link
2024-09-05 iSeg: An Iterative Refinement-based Framework for Training-free Segmentation Lin Sun et.al. 2409.03209 link
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-04 CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation Minhee Cho et.al. 2409.02699 null
2024-09-04 Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation Tiantian Zhang et.al. 2409.02567 null
2024-09-04 SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction Sumin Son et.al. 2409.02513 null
2024-09-03 K-Origins: Better Colour Quantification for Neural Networks Lewis Mason et.al. 2409.02281 null
2024-09-03 AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions Chenghao Qian et.al. 2409.02045 null
2024-09-03 MetaFood3D: Large 3D Food Object Dataset with Nutrition Values Yuhao Chen et.al. 2409.01966 null
2024-09-03 Segmenting Object Affordances: Reproducibility and Sensitivity to Scale Tommaso Apicella et.al. 2409.01814 link
2024-09-03 Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation Haodong Wang et.al. 2409.01662 null
2024-09-02 Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition Xuanrui Zeng et.al. 2409.01472 link
2024-08-30 Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes Li Zhang et.al. 2408.17421 link
2024-08-30 Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations Ahmed Hammam et.al. 2408.17311 null
2024-08-30 Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training Zizheng Huang et.al. 2408.17081 link
2024-08-30 Transient Fault Tolerant Semantic Segmentation for Autonomous Driving Leonardo Iurada et.al. 2408.16952 link
2024-08-29 Eigen-Cluster VIS: Improving Weakly-supervised Video Instance Segmentation by Leveraging Spatio-temporal Consistency Farnoosh Arefi et.al. 2408.16661 link
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 null
2024-08-29 A Simple and Generalist Approach for Panoptic Segmentation Nedyalko Prisadnikov et.al. 2408.16504 null
2024-08-29 MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation Linyan Yang et.al. 2408.16478 null
2024-08-29 Multi-source Domain Adaptation for Panoramic Semantic Segmentation Jing Jiang et.al. 2408.16469 null
2024-08-29 EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More Kanghao Chen et.al. 2408.16254 null
2024-08-28 InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation Thibaut Goldsborough et.al. 2408.15954 link
2024-08-28 SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors Zhiqing Zhang et.al. 2408.15887 null
2024-08-28 DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries Yu Yang et.al. 2408.15813 null
2024-08-28 TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation Junbao Zhou et.al. 2408.15657 link
2024-08-27 Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images Silvia Seidlitz et.al. 2408.15373 link
2024-08-27 An Investigation on The Position Encoding in Vision-Based Dynamics Prediction Jiageng Zhu et.al. 2408.15201 null
2024-08-27 Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation Elona Shatri et.al. 2408.15002 null
2024-08-27 Applying ViT in Generalized Few-shot Semantic Segmentation Liyuan Geng et.al. 2408.14957 link
2024-08-27 Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack Naufal Suryanto et.al. 2408.14879 null
2024-08-27 MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation Yuanbing Zhu et.al. 2408.14776 null
2024-08-26 Physically Feasible Semantic Segmentation Shamik Basu et.al. 2408.14672 link
2024-08-26 A Survey of Camouflaged Object Detection and Beyond Fengyang Xiao et.al. 2408.14562 null
2024-08-26 Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping Vishal Batchu et.al. 2408.14400 null
2024-08-25 OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation Muhammad Rameez ur Rahman et.al. 2408.13936 link
2024-08-25 Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation Yuwen Pan et.al. 2408.13838 null
2024-08-25 TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather Xiongwei Zhao et.al. 2408.13802 link
2024-08-25 ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation Xin Zhang et.al. 2408.13771 null
2024-08-25 Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation Zhaoyang Li et.al. 2408.13752 null
2024-08-24 ESA: Annotation-Efficient Active Learning for Semantic Segmentation Jinchao Ge et.al. 2408.13491 link
2024-08-23 Accuracy Improvement of Cell Image Segmentation Using Feedback Former Hinako Mitsuoka et.al. 2408.12974 null
2024-08-23 Image Segmentation in Foundation Model Era: A Survey Tianfei Zhou et.al. 2408.12957 null
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 null
2024-08-22 Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets Wolfgang Boettcher et.al. 2408.12489 null
2024-08-22 The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation Tuyen Tran et.al. 2408.12447 null
2024-08-22 ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving Scenes Zhenyi Liu et.al. 2408.12048 link
2024-08-21 EmbodiedSAM: Online Segment Any 3D Thing in Real Time Xiuwei Xu et.al. 2408.11811 null
2024-08-21 NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation Zhenye Lou et.al. 2408.11787 link
2024-08-21 Open-Ended 3D Point Cloud Instance Segmentation Phuc D. A. Nguyen et.al. 2408.11747 null
2024-08-21 UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images Enze Zhu et.al. 2408.11545 null
2024-08-22 SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything Chongkai Yu et.al. 2408.11535 null
2024-08-21 Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation Chuandong Liu et.al. 2408.11280 null
2024-08-20 An Interpretable Deep Learning Approach for Morphological Script Type Analysis Malamatenia Vlachou-Efstathiou et.al. 2408.11150 null
2024-08-20 NeCo: Improving DINOv2’s spatial representations in 19 GPU hours with Patch Neighbor Consistency Valentinos Pariza et.al. 2408.11054 null
2024-08-20 CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients Karen Sanchez et.al. 2408.10827 null
2024-08-20 Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant Guofeng Mei et.al. 2408.10652 null
2024-08-20 Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? Chen Liang et.al. 2408.10627 null
2024-08-20 Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation Jiawei Han et.al. 2408.10537 link
2024-08-21 LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS Xinyu Liu et.al. 2408.10469 null
2024-08-19 Leveraging Superfluous Information in Contrastive Representation Learning Xuechu Yu et.al. 2408.10292 null
2024-08-19 Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network Rasha Alshawi et.al. 2408.10181 null
2024-08-19 Dynamic Label Injection for Imbalanced Industrial Defect Segmentation Emanuele Caruso et.al. 2408.10031 link
2024-08-19 Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis Kira Maag et.al. 2408.10021 null
2024-08-19 DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery Corentin Dumery et.al. 2408.09928 null
2024-08-19 3D-Aware Instance Segmentation and Tracking in Egocentric Videos Yash Bhalgat et.al. 2408.09860 null
2024-08-19 Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving Jun Yan et.al. 2408.09839 link
2024-08-18 OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras Muhammad Rameez Ur Rahman et.al. 2408.09424 link
2024-08-18 VrdONE: One-stage Video Visual Relation Detection Xinjie Jiang et.al. 2408.09408 link
2024-08-18 Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration Hao Ai et.al. 2408.09336 null
2024-08-17 Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology Junchao Zhu et.al. 2408.09278 link
2024-08-16 Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation Tri Ton et.al. 2408.08591 null
2024-08-16 Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation Linghao Zheng et.al. 2408.08576 null
2024-08-16 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 null
2024-08-15 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Dongshuo Yin et.al. 2408.08345 link
2024-08-14 MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis Nimeesha Chan et.al. 2408.07773 link
2024-08-15 MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation Beoungwoo Kang et.al. 2408.07576 link
2024-08-15 MagicFace: Training-free Universal-Style Human Image Customized Synthesis Yibin Wang et.al. 2408.07433 null
2024-08-14 Segment Using Just One Example Pratik Vora et.al. 2408.07393 null
2024-08-14 Ensemble architecture in polyp segmentation Hao-Yun Hsu et.al. 2408.07262 link
2024-08-14 Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks Raghavendra Singh et.al. 2408.07243 null
2024-08-14 Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training Ethan Kou et.al. 2408.07239 null
2024-08-13 ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation Jingyun Wang et.al. 2408.06747 link
2024-08-10 Dilated Convolution with Learnable Spacings Ismail Khalfaoui-Hassani et.al. 2408.06383 null
2024-08-12 Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images Siladittya Manna et.al. 2408.06235 null
2024-08-12 A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting Felix Assion et.al. 2408.06071 null
2024-08-13 ClickAttention: Click Region Similarity Guided Interactive Segmentation Long Xu et.al. 2408.06021 null
2024-08-12 Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning Xinrong Hu et.al. 2408.05889 null
2024-08-11 Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task Hannuo Zhang et.al. 2408.05777 null
2024-08-11 MacFormer: Semantic Segmentation with Fine Object Boundaries Guoan Xu et.al. 2408.05699 null
2024-08-13 Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment Abdul-Razak Alhassan Gamani et.al. 2408.05661 null
2024-08-10 Multimodal generative semantic communication based on latent diffusion model Weiqi Fu et.al. 2408.05455 null
2024-08-09 PRISM Lite: A lightweight model for interactive 3D placenta segmentation in ultrasound Hao Li et.al. 2408.05372 link
2024-08-09 In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Dahyun Kang et.al. 2408.04961 link
2024-08-09 ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation Mengcheng Lan et.al. 2408.04883 link
2024-08-09 Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning Fumihiro Kaneko et.al. 2408.04795 null
2024-08-08 Embodied Uncertainty-Aware Object Segmentation Xiaolin Fang et.al. 2408.04760 null
2024-08-08 SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation Jieming Yu et.al. 2408.04593 null
2024-08-08 Robust Approximate Characterization of Single-Cell Heterogeneity in Microbial Growth Richard D. Paul et.al. 2408.04501 link
2024-08-08 SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios Sriram Mandalika et.al. 2408.04482 null
2024-08-08 What could go wrong? Discovering and describing failure modes in computer vision Gabriela Csurka et.al. 2408.04471 null
2024-08-07 Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation Yiqing Shen et.al. 2408.04098 null
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 link
2024-08-07 SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology Mingya Zhang et.al. 2408.03651 link
2024-08-06 Post-Mortem Human Iris Segmentation Analysis with Deep Learning Afzal Hossain et.al. 2408.03448 null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 link
2024-08-06 Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment Shijie Lian et.al. 2408.02924 link
2024-08-05 Scribble-Based Interactive Segmentation of Medical Hyperspectral Images Zhonghao Wang et.al. 2408.02708 null
2024-08-05 Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation Sai Prasanna et.al. 2408.02297 null
2024-08-05 Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs Jeongkee Lim et.al. 2408.02261 null
2024-08-05 Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders Muhammad Abdullah Jamal et.al. 2408.02245 null
2024-08-04 Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation Ye Du et.al. 2408.02039 null
2024-08-03 NuLite – Lightweight and Fast Model for Nuclei Instance Segmentation and Classification Cristian Tommasino et.al. 2408.01797 null
2024-08-03 Bayesian Active Learning for Semantic Segmentation Sima Didari et.al. 2408.01694 null
2024-08-03 A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection Omkar Oak et.al. 2408.01692 null
2024-08-03 Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation Balázs Opra et.al. 2408.01640 null
2024-08-02 Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans Lukas Kratochvila et.al. 2408.01526 null
2024-08-02 Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation Yuanzhi Su et.al. 2408.01356 null
2024-08-02 StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation Bingyu Li et.al. 2408.01343 null
2024-08-02 Amodal Segmentation for Laparoscopic Surgery Video Instruments Ruohua Shi et.al. 2408.01067 null
2024-08-02 Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach Yabin Zhu et.al. 2408.00969 null
2024-08-01 Medical SAM 2: Segment medical images as video via Segment Anything Model 2 Jiayuan Zhu et.al. 2408.00874 link
2024-08-01 Leaf Angle Estimation using Mask R-CNN and LETR Vision Transformer Venkat Margapuri et.al. 2408.00749 null
2024-08-01 Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation Siyu Jiao et.al. 2408.00744 link
2024-08-01 Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function Matias Oscar Volman Stern et.al. 2408.00707 null
2024-08-01 AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation Asbjørn Munk et.al. 2408.00640 null
2024-08-01 SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation Shengbo Tan et.al. 2408.00496 null
2024-08-01 A Simple Background Augmentation Method for Object Detection with Diffusion Model Yuhang Li et.al. 2408.00350 null
2024-07-31 Con4m: Context-aware Consistency Learning Framework for Segmented Time Series Classification Junru Chen et.al. 2408.00041 null
2024-07-31 Open-Vocabulary Audio-Visual Semantic Segmentation Ruohao Guo et.al. 2407.21721 link
2024-07-31 MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment Anurag Das et.al. 2407.21654 null
2024-07-31 MaskUno: Switch-Split Block For Enhancing Instance Segmentation Jawad Haidar et.al. 2407.21498 null
2024-07-31 Small Object Few-shot Segmentation for Vision-based Industrial Inspection Zilong Zhang et.al. 2407.21351 null
2024-07-31 On-the-fly Point Feature Representation for Point Clouds Analysis Jiangyi Wang et.al. 2407.21335 null
2024-07-31 Fine-grained Metrics for Point Cloud Semantic Segmentation Zhuheng Lu et.al. 2407.21289 null
2024-07-30 PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds Kerem Mertoğlu et.al. 2407.21150 null
2024-07-30 Learning Ordinality in Semantic Segmentation Rafael Cristino et.al. 2407.20959 null
2024-07-29 Improving 2D Feature Representations by 3D-Aware Fine-Tuning Yuanwen Yue et.al. 2407.20229 link
2024-07-29 Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset Yimian Dai et.al. 2407.20078 link
2024-07-29 Language-driven Grasp Detection with Mask-guided Attention Tuan Van Vo et.al. 2407.19877 null
2024-07-29 Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets Muhammad Abdullah Jamal et.al. 2407.19714 null
2024-07-29 ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement Ezequiel Perez-Zarate et.al. 2407.19708 link
2024-07-28 ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding Zhen Chen et.al. 2407.19435 link
2024-07-28 Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets Tianxiao Zhang et.al. 2407.19394 link
2024-07-27 Ensembling convolutional neural networks for human skin segmentation Patryk Kuban et.al. 2407.19310 null
2024-07-27 Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network Gang Pan et.al. 2407.19271 null
2024-07-26 Sparse Refinement for Efficient High-Resolution Semantic Segmentation Zhijian Liu et.al. 2407.19014 null
2024-07-26 A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention João D. Nunes et.al. 2407.18673 null
2024-07-26 Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation Jingjun Yi et.al. 2407.18568 null
2024-07-25 Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception Julia Hindel et.al. 2407.18145 null
2024-07-25 LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels Ziwei Cui et.al. 2407.18054 link
2024-07-25 TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework Guanfeng Tang et.al. 2407.18038 null
2024-07-25 Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions Jan Nikolas Morshuis et.al. 2407.18026 link
2024-07-26 Quality Assured: Rethinking Annotation Strategies in Imaging AI Tim Rädsch et.al. 2407.17596 null
2024-07-24 Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation Hyunwoo Yu et.al. 2407.17261 link
2024-07-24 Trans2Unet: Neural fusion for Nuclei Semantic Segmentation Dinh-Phu Tran et.al. 2407.17181 null
2024-07-24 PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning Mu Chen et.al. 2407.17101 null
2024-07-25 Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste Qinfeng Zhu et.al. 2407.17028 link
2024-07-24 Progressive Query Refinement Framework for Bird’s-Eye-View Semantic Segmentation from Surrounding Images Dooseop Choi et.al. 2407.17003 link
2024-07-24 McGAN: Generating Manufacturable Designs by Embedding Manufacturing Rules into Conditional Generative Adversarial Network Zhichao Wang et.al. 2407.16943 null
2024-07-23 SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation Pengfei Chen et.al. 2407.16682 null
2024-07-23 Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving Anam Manzoor et.al. 2407.16647 null
2024-07-23 Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging Daniela L. Ramos et.al. 2407.16608 null
2024-07-23 Strike a Balance in Continual Panoptic Segmentation Jinpeng Chen et.al. 2407.16354 link
2024-07-23 Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision Aditya Krishnan et.al. 2407.16102 null
2024-07-22 Enhancing Cell Instance Segmentation in Scanning Electron Microscopy Images via a Deep Contour Closing Operator Florian Robert et.al. 2407.15817 null
2024-07-22 MILAN: Milli-Annotations for Lidar Semantic Segmentation Nermin Samet et.al. 2407.15797 null
2024-07-22 Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond Silvio Galesso et.al. 2407.15739 link
2024-07-22 MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics Alexander Melekhin et.al. 2407.15663 link
2024-07-22 Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling Bo Yuan et.al. 2407.15429 link
2024-07-22 Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data Junha Song et.al. 2407.15383 null
2024-07-21 Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation Xiaoyang Wu et.al. 2407.15282 null
2024-07-20 Downstream-Pretext Domain Knowledge Traceback for Active Learning Beichen Zhang et.al. 2407.14720 null
2024-07-19 Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model Kun Zhao et.al. 2407.14326 null
2024-07-19 Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation Zhengyuan Xie et.al. 2407.14142 link
2024-07-19 MC-PanDA: Mask Confidence for Panoptic Domain Adaptation Ivan Martinović et.al. 2407.14110 link
2024-07-19 GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation Florian Chabot et.al. 2407.14108 null
2024-07-19 Scale Disparity of Instances in Interactive Point Cloud Segmentation Chenrui Han et.al. 2407.14009 null
2024-07-18 Many Perception Tasks are Highly Redundant Functions of their Input Data Rahul Ramesh et.al. 2407.13841 null
2024-07-18 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model Abdelrahman Shaker et.al. 2407.13772 link
2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 null
2024-07-18 MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis Ziming Zhong et.al. 2407.13675 link
2024-07-18 Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models Xiaoyu Zhu et.al. 2407.13642 null
2024-07-18 FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures Hao Lu et.al. 2407.13500 null
2024-07-18 FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions Sohyun Lee et.al. 2407.13437 null
2024-07-18 Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability Judith Dijk et.al. 2407.13392 null
2024-07-18 Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation Chang Liu et.al. 2407.13363 null
2024-07-18 Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation Shoumeng Qiu et.al. 2407.13254 link
2024-07-18 OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation Jian Sun et.al. 2407.13137 null
2024-07-17 FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification Yiqing Shen et.al. 2407.12658 null
2024-07-17 Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation Prantik Howlader et.al. 2407.12630 link
2024-07-17 Instance-wise Uncertainty for Class Imbalance in Semantic Segmentation Luís Almeida et.al. 2407.12609 null
2024-07-17 Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk et.al. 2407.12588 link
2024-07-17 Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation Ruijie Xu et.al. 2407.12489 link
2024-07-17 Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation Hyun Seok Seong et.al. 2407.12463 null
2024-07-17 Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation Kaixin Bai et.al. 2407.12449 null
2024-07-17 ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference Mengcheng Lan et.al. 2407.12442 null
2024-07-17 Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model Tao Wang et.al. 2407.12319 null
2024-07-16 FoodMem: Near Real-time and Precise Food Video Segmentation Ahmad AlMughrabi et.al. 2407.12121 null
2024-07-16 Mitigating Background Shift in Class-Incremental Semantic Segmentation Gilhan Park et.al. 2407.11859 link
2024-07-16 Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation Juncheng Ma et.al. 2407.11820 null
2024-07-16 Click-Gaussian: Interactive Segmentation to Any 3D Gaussians Seokhun Choi et.al. 2407.11793 null
2024-07-16 XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach Truong Thanh Hung Nguyen et.al. 2407.11771 null
2024-07-16 OAM-TCD: A globally diverse dataset of high-resolution tree cover maps Josh Veitch-Michaelis et.al. 2407.11743 link
2024-07-16 SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds Yanbo Wang et.al. 2407.11569 link
2024-07-16 SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation Lei Yao et.al. 2407.11564 link
2024-07-16 Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes Zhi Cai et.al. 2407.11464 link
2024-07-16 Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations Yunya Gao et.al. 2407.11381 link
2024-07-16 Generative AI Driven Task-Oriented Adaptive Semantic Communications Yuzhou Fu et.al. 2407.11354 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964 link
2024-07-15 APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2407.10649 null
2024-07-15 Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs Rong Ma et.al. 2407.10534 null
2024-07-14 Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data Tuo Feng et.al. 2407.10200 link
2024-07-14 RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation Li Li et.al. 2407.10159 link
2024-07-14 Part2Object: Hierarchical Unsupervised 3D Instance Segmentation Cheng Shi et.al. 2407.10084 link
2024-07-14 HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation Chengjie Jiang et.al. 2407.10047 null
2024-07-13 Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation Anqi Zhang et.al. 2407.09838 null
2024-07-13 Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach Md Rakibul Islam et.al. 2407.09828 null
2024-07-13 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance Xiaoxu Xu et.al. 2407.09826 link
2024-07-12 FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background Muhammad Ali et.al. 2407.09379 link
2024-07-12 WSESeg: Introducing a Dataset for the Segmentation of Winter Sports Equipment with a Baseline for Interactive Segmentation Robin Schön et.al. 2407.09288 null
2024-07-12 A Fair Ranking and New Model for Panoptic Scene Graph Generation Julian Lorenz et.al. 2407.09216 link
2024-07-12 Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy Julian Wyatt et.al. 2407.09192 null
2024-07-12 From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation Hanrong Shi et.al. 2407.09191 null
2024-07-12 Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off Levente Halmosi et.al. 2407.09150 link
2024-07-12 Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation Wei Cong et.al. 2407.09047 null
2024-07-12 Textual Query-Driven Mask Transformer for Domain Generalized Segmentation Byeonghyun Pak et.al. 2407.09033 link
2024-07-12 Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation Zihao Li et.al. 2407.08994 null
2024-07-11 SLoRD: Structural Low-Rank Descriptors for Shape Consistency in Vertebrae Segmentation Xin You et.al. 2407.08555 null
2024-07-11 Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation Tong Shao et.al. 2407.08268 link
2024-07-11 Enrich the content of the image Using Context-Aware Copy Paste Qiushi Guo et.al. 2407.08151 null
2024-07-10 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Ali Hatamizadeh et.al. 2407.08083 link
2024-07-10 Interactive Segmentation Model for Placenta Segmentation from 3D Ultrasound images Hao Li et.al. 2407.08020 link
2024-07-10 Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift Elliot Vincent et.al. 2407.07616 link
2024-07-10 H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper Ryan Banks et.al. 2407.07604 link
2024-07-11 Trainable Highly-expressive Activation Functions Irit Chelly et.al. 2407.07564 null
2024-07-10 Panoptic Segmentation of Galactic Structures in LSB Images Felix Richards et.al. 2407.07494 null
2024-07-10 Deformable-Heatmap-Segmentation for Automobile Visual Perception Hongyu Jin et.al. 2407.07493 null
2024-07-10 Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining Tianfang Sun et.al. 2407.07465 null
2024-07-11 HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation Guoan Xu et.al. 2407.07441 null
2024-07-10 Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation Hao Fang et.al. 2407.07427 link
2024-07-09 ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation Yuyuan Liu et.al. 2407.07171 link
2024-07-09 Improved Block Merging for 3D Point Cloud Instance Segmentation Leon Denis et.al. 2407.06991 null
2024-07-09 Joint prototype and coefficient prediction for 3D instance segmentation Remco Royen et.al. 2407.06958 null
2024-07-08 Training-free CryoET Tomogram Segmentation Yizhou Zhao et.al. 2407.06833 link
2024-07-09 CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM Aditya Murali et.al. 2407.06795 null
2024-07-09 LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration Jiayi Liu et.al. 2407.06512 link
2024-07-08 Leveraging image captions for selective whole slide image annotation Jingna Qiu et.al. 2407.06363 null
2024-07-08 Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Siva Krishna Ravipati et.al. 2407.06077 null
2024-07-08 Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts Puzuo Wang et.al. 2407.06043 null
2024-07-08 RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation Sarah Elmahdy et.al. 2407.06016 link
2024-07-07 Semantic Segmentation for Real-World and Synthetic Vehicle’s Forward-Facing Camera Images Tuan T. Nguyen et.al. 2407.05452 null
2024-07-07 Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness Idris Hamoud et.al. 2407.05448 null
2024-07-06 A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation Monika Wysoczańska et.al. 2407.05061 null
2024-07-06 BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support Vladyslav Polushko et.al. 2407.05007 null
2024-07-05 Explainable Metric Learning for Deflating Data Bias Emma Andrews et.al. 2407.04866 null
2024-07-05 Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge Yuanze Lin et.al. 2407.04681 null
2024-07-05 LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes Zexian Huang et.al. 2407.04326 null
2024-07-04 Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing Anushrut Jignasu et.al. 2407.04180 link
2024-07-04 Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier Prantik Howlader et.al. 2407.04036 link
2024-07-04 Performance of Medical Image Fusion in High-level Analysis Tasks: A Mutual Enhancement Framework for Unaligned PAT and MRI Image Fusion Yutian Zhong et.al. 2407.03992 link
2024-07-04 Relative Difficulty Distillation for Semantic Segmentation Dong Liang et.al. 2407.03719 null
2024-07-04 POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation Arindam Dutta et.al. 2407.03549 null
2024-07-03 A Unified Framework for 3D Scene Understanding Wei Xu et.al. 2407.03263 null
2024-07-03 ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation Chang Li et.al. 2407.03033 null
2024-07-03 Context-Aware Video Instance Segmentation Seunghun Lee et.al. 2407.03010 link
2024-07-03 ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation Yipin Guo et.al. 2407.02881 null
2024-07-03 Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation Tao Chen et.al. 2407.02768 null
2024-07-03 ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers Yanfeng Jiang et.al. 2407.02763 null
2024-07-02 Open Panoramic Segmentation Junwei Zheng et.al. 2407.02685 link
2024-07-02 Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction Tinghuai Wang et.al. 2407.02639 null
2024-07-02 Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather Junsung Park et.al. 2407.02286 link
2024-07-02 MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders Baijiong Lin et.al. 2407.02228 link
2024-07-02 Occlusion-Aware Seamless Segmentation Yihong Cao et.al. 2407.02182 link
2024-07-02 VRBiom: A New Periocular Dataset for Biometric Applications of HMD Ketan Kotwal et.al. 2407.02150 null
2024-07-02 HRSAM: Efficiently Segment Anything in High-Resolution Images You Huang et.al. 2407.02109 null
2024-07-02 Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts Pasquale De Marinis et.al. 2407.02075 link
2024-07-02 LiDAR-based HD Map Localization using Semantic Generalized ICP with Road Marking Detection Yansong Gong et.al. 2407.02061 null
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-07-01 Label-free Neural Semantic Image Synthesis Jiayi Wang et.al. 2407.01790 null
2024-07-01 PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction Xuan Yu et.al. 2407.01349 null
2024-06-28 EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model Yuxuan Zhang et.al. 2406.20076 link
2024-07-01 Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding Yifan Tang et.al. 2406.19791 null
2024-06-28 PM-VIS+: High-Performance Video Instance Segmentation without Video Annotation Zhangjing Yang et.al. 2406.19665 link
2024-06-28 Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation Junsung Park et.al. 2406.19638 link
2024-06-28 PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation Deyi Ji et.al. 2406.19632 null
2024-06-27 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model Haobo Yuan et.al. 2406.19369 null
2024-06-27 ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2406.19225 null
2024-06-30 Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO Fuseini Mumuni et.al. 2406.19057 null
2024-06-27 Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation Tao Lian et.al. 2406.18809 null
2024-07-01 3D Feature Distillation with Object-Centric Priors Georgios Tziafas et.al. 2406.18742 null
2024-06-26 CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data Nikolaos Dionelis et.al. 2406.18279 null
2024-06-26 CoDA: Interactive Segmentation and Morphological Analysis of Dendroid Structures Exemplified on Stony Cold-Water Corals Kira Schmitt et.al. 2406.18236 link
2024-06-26 The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval Meinardus Boris et.al. 2406.18113 link
2024-06-26 Few-Shot Medical Image Segmentation with High-Fidelity Prototypes Song Tang et.al. 2406.18074 link
2024-06-25 Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation Bernardo Silva et.al. 2406.17915 null
2024-06-25 Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation Xuming Zhang et.al. 2406.17679 null
2024-06-25 DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation Ahmad Mohammadshirazi et.al. 2406.17591 link
2024-06-25 Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Felix Stillger et.al. 2406.17541 null
2024-06-25 Investigating Self-Supervised Methods for Label-Efficient Learning Srinivasa Rao Nandam et.al. 2406.17460 null
2024-06-25 Pseudo Labelling for Enhanced Masked Autoencoders Srinivasa Rao Nandam et.al. 2406.17450 null
2024-06-25 Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model Zhuoyuan Li et.al. 2406.17442 null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 null
2024-06-25 Depth-Guided Semi-Supervised Instance Segmentation Xin Chen et.al. 2406.17413 null
2024-06-25 XAMI – A Benchmark Dataset for Artefact Detection in XMM-Newton Optical Images Elisabeta-Iulia Dima et.al. 2406.17323 link
2024-06-24 GMT: Guided Mask Transformer for Leaf Instance Segmentation Feng Chen et.al. 2406.17109 null
2024-06-24 Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation Yizheng Wu et.al. 2406.16776 link
2024-06-24 μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation Pierangela Bruno et.al. 2406.16724 null
2024-06-24 GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection Harnaik Dhami et.al. 2406.16625 null
2024-06-24 LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images Xiaowen Ma et.al. 2406.16502 link
2024-06-24 Cascade Reward Sampling for Efficient Decoding-Time Alignment Bolian Li et.al. 2406.16306 link
2024-06-24 SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Neng Wang et.al. 2406.16279 link
2024-06-23 UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery Pengfei Zhang et.al. 2406.16129 null
2024-06-23 CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery Oluwatosin Alabi et.al. 2406.16039 null
2024-06-22 Fine-grained Background Representation for Weakly Supervised Semantic Segmentation Xu Yin et.al. 2406.15755 null
2024-06-21 TraceNet: Segment one thing efficiently Mingyuan Wu et.al. 2406.14874 null
2024-06-19 3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data Siddiqui Muhammad Yasir et.al. 2406.14581 null
2024-06-20 Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery Ilham Adi Panuntun et.al. 2406.14220 null
2024-06-20 Trusting Semantic Segmentation Networks Samik Some et.al. 2406.14201 null
2024-06-20 EvSegSNN: Neuromorphic Semantic Segmentation for Event Data Dalia Hareb et.al. 2406.14178 null
2024-06-20 Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images Qinfeng Zhu et.al. 2406.14086 link
2024-06-20 2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation Bin Cao et.al. 2406.13939 null
2024-06-19 Search-based DNN Testing and Retraining with GAN-enhanced Simulations Mohammed Oualid Attaoui et.al. 2406.13359 null
2024-06-19 Deep Learning-Based 3D Instance and Semantic Segmentation: A Review Siddiqui Muhammad Yasir et.al. 2406.13308 null
2024-06-18 Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation Guoyu Yang et.al. 2406.12496 link
2024-06-18 Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines Honglei Zhang et.al. 2406.12367 null
2024-06-18 Agriculture-Vision Challenge 2024 – The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble Wang Liu et.al. 2406.12271 null
2024-06-17 OoDIS: Anomaly Instance Segmentation Benchmark Alexey Nekrasov et.al. 2406.11835 link
2024-06-17 Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT Maximilian E. Tschuchnig et.al. 2406.11650 null
2024-06-17 Learning from Exemplars for Interactive Image Segmentation Kun Li et.al. 2406.11472 null
2024-06-17 SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation Zhenchao Lin et.al. 2406.11441 link
2024-06-17 Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Yunsong Wang et.al. 2406.11283 null
2024-06-17 Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation Bingfeng Zhang et.al. 2406.11189 null
2024-06-16 $α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion Sanbao Su et.al. 2406.11021 null
2024-06-16 Benchmarking Label Noise in Instance Segmentation: Spatial Noise Matters Moshe Kimhi et.al. 2406.10891 link
2024-06-16 PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery Libo Wang et.al. 2406.10828 link
2024-06-15 GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR Bharat Singh et.al. 2406.10722 null
2024-06-14 Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations Daan de Geus et.al. 2406.10114 null
2024-06-14 ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers Narges Norouzi et.al. 2406.09936 null
2024-06-14 Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions Aldi Piroli et.al. 2406.09906 null
2024-06-14 Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation Brunó B. Englert et.al. 2406.09896 link
2024-06-14 Open-Vocabulary Semantic Segmentation with Image Embedding Balancing Xiangheng Shan et.al. 2406.09829 link
2024-06-14 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Roman Bachmann et.al. 2406.09406 null
2024-06-13 Instance-level quantitative saliency in multiple sclerosis lesion segmentation Federico Spagnolo et.al. 2406.09335 null
2024-06-13 APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation Weizhao He et.al. 2406.08372 null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249 link
2024-06-12 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation Zhensong Xu et.al. 2406.08192 null
2024-06-13 A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder Lixian Zhang et.al. 2406.08079 null
2024-06-12 OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding Yinan Deng et.al. 2406.08009 link
2024-06-12 SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation Chanda Grover Kamra et.al. 2406.07986 link
2024-06-12 Small Scale Data-Free Knowledge Distillation He Liu et.al. 2406.07876 link
2024-06-11 Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph Sergey Linok et.al. 2406.07113 null
2024-06-11 PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Yining Shi et.al. 2406.07037 null
2024-06-11 RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks Zhechao Wang et.al. 2406.07032 null
2024-06-12 LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection Jiahua Xu et.al. 2406.07023 null
2024-06-11 Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples Kailas Dayanandan et.al. 2406.06967 link
2024-06-11 UVIS: Unsupervised Video Instance Segmentation Shuaiyi Huang et.al. 2406.06908 null
2024-06-10 Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation Dong Zhao et.al. 2406.06813 null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512 link
2024-06-10 UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving Daniel Bogdoll et.al. 2406.06370 null
2024-06-10 Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset Shijie Lian et.al. 2406.06039 link
2024-06-09 Scaling Graph Convolutions for Mobile Vision William Avery et.al. 2406.05850 link
2024-06-09 Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation Jun Yu et.al. 2406.05837 null
2024-06-09 Convolution and Attention-Free Mamba-based Cardiac Image Segmentation Abbas Khan et.al. 2406.05786 null
2024-06-09 Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language Mark Hamilton et.al. 2406.05629 link
2024-06-08 A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ Jianzhao Wang et.al. 2406.05513 null
2024-06-08 Layered Image Vectorization via Semantic Simplification Zhenyu Wang et.al. 2406.05404 null
2024-06-08 1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR’24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation Qingfeng Liu et.al. 2406.05352 null
2024-06-07 Semantic Segmentation on VSPW Dataset through Masked Video Consistency Chen Liang et.al. 2406.04979 null
2024-06-07 Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment Venkanna Babu Guthula et.al. 2406.04949 null
2024-06-06 Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis Chengeng Liu et.al. 2406.04149 null
2024-06-07 3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation Ruipu Wu et.al. 2406.04002 null
2024-06-06 Frequency-based Matcher for Long-tailed Semantic Segmentation Shan Li et.al. 2406.03917 link
2024-06-07 Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge Nan Zhang et.al. 2406.03799 link
2024-06-06 Instance Segmentation and Teeth Classification in Panoramic X-rays Devichand Budagam et.al. 2406.03747 link
2024-06-06 DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation Zilu Guo et.al. 2406.03702 link
2024-06-05 Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation Maximilian Zenk et.al. 2406.03323 null
2024-06-05 Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy Yunho Kim et.al. 2406.02989 null
2024-06-04 W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics Andre Schreiber et.al. 2406.02822 link
2024-06-04 Window to Wall Ratio Detection using SegFormer Zoe De Simone et.al. 2406.02706 link
2024-06-04 Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation Mohamed El Amine Boudjoghra et.al. 2406.02548 link
2024-06-04 Generative Active Learning for Long-tailed Instance Segmentation Muzhi Zhu et.al. 2406.02435 link
2024-06-04 Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning Heather Doig et.al. 2406.01932 null
2024-06-03 MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild Zeren Jiang et.al. 2406.01595 null
2024-06-03 Towards Flexible Interactive Reflection Removal with Human Guidance Xiao Chen et.al. 2406.01555 link
2024-06-03 EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Thanh-Dat Truong et.al. 2406.01429 null
2024-06-03 An expert-driven data generation pipeline for histological images Roberto Basla et.al. 2406.01403 link
2024-06-03 TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation Antonio Santo et.al. 2406.01395 link
2024-06-03 MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images Ke-Lei Wang et.al. 2406.01356 null
2024-06-03 ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds Ka Lung Cheung et.al. 2406.01337 link
2024-05-31 Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks Linlin Yu et.al. 2405.20986 null
2024-05-31 Extreme Point Supervised Instance Segmentation Hyeonjun Lee et.al. 2405.20729 null
2024-05-31 Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation Wooseok Shin et.al. 2405.20610 link
2024-05-30 P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation Qi Zhang et.al. 2405.20443 null
2024-05-30 SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow Chaoyang Wang et.al. 2405.20282 link
2024-05-30 MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion Angel Villar-Corrales et.al. 2405.19921 link
2024-05-30 Open-Set Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2405.19899 link
2024-05-30 DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation Ron Keuth et.al. 2405.19746 link
2024-05-30 Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes Yong-Qiang Mao et.al. 2405.19735 null
2024-05-30 CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation Ankush Gajanan Arudkar et.al. 2405.19672 null
2024-05-29 Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation Lianlei Shan et.al. 2405.19568 null
2024-05-29 Enabling Visual Recognition at Radio Frequency Haowen Lai et.al. 2405.19516 null
2024-05-29 Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models Tianrun Chen et.al. 2405.19326 null
2024-05-29 A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation Niclas Vödisch et.al. 2405.19035 link
2024-05-29 Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation Zelin Peng et.al. 2405.18840 null
2024-05-29 FocSAM: Delving Deeply into Focused Objects in Segmenting Anything You Huang et.al. 2405.18706 null
2024-05-28 Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation JuneHyoung Kwon et.al. 2405.18148 null
2024-05-28 Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images Lianlei Shan et.al. 2405.18078 null
2024-05-28 RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields Mihnea-Bogdan Jurca et.al. 2405.18033 null
2024-05-28 DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture Shentong Mo et.al. 2405.17995 link
2024-05-28 Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation Yangxiao Lu et.al. 2405.17859 link
2024-05-28 The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention Xingyu Ding et.al. 2405.17776 null
2024-05-27 Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation Steven Landgraf et.al. 2405.17097 null
2024-05-27 DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking Hongtao Wang et.al. 2405.16980 null
2024-05-27 Collective Perception Datasets for Autonomous Driving: A Comprehensive Review Sven Teufel et.al. 2405.16973 null
2024-05-27 Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models Qian Wang et.al. 2405.16947 null
2024-05-27 A re-calibration method for object detection with multi-modal alignment bias in autonomous driving Zhihang Song et.al. 2405.16848 null
2024-05-26 Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning Neha Kalibhat et.al. 2405.16401 null
2024-05-25 Video Prediction Models as General Visual Encoders James Maier et.al. 2405.16382 null
2024-05-25 BOLD: Boolean Logic Deep Learning Van Minh Nguyen et.al. 2405.16339 null
2024-05-25 Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation Huizhou Chen et.al. 2405.16099 null
2024-05-25 Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality Hakim Ikebayashi et.al. 2405.16008 null
2024-05-24 Visualize and Paint GAN Activations Rudolf Herdt et.al. 2405.15636 null
2024-05-24 Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets Hoàng-Ân Lê et.al. 2405.15394 null
2024-05-24 Autonomous Quilt Spreading for Caregiving Robots Yuchun Guo et.al. 2405.15373 null
2024-05-24 U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation Bingyu Li et.al. 2405.15365 link
2024-05-24 Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation Jiayi Chen et.al. 2405.15265 null
2024-05-23 Mamba-R: Vision Mamba ALSO Needs Registers Feng Wang et.al. 2405.14858 null
2024-05-23 Efficient Robot Learning for Perception and Mapping Niclas Vödisch et.al. 2405.14688 null
2024-05-23 Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation Daniel Kienzle et.al. 2405.14467 null
2024-05-23 MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models Jiuming Liu et.al. 2405.14338 null
2024-05-23 Tuning-free Universally-Supervised Semantic Segmentation Xiaobo Yang et.al. 2405.14294 null
2024-05-23 SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation Kai Yao et.al. 2405.14278 null
2024-05-23 Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations Mohammed Baharoon et.al. 2405.14239 null
2024-05-23 Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification Taylor Archibald et.al. 2405.14162 null
2024-05-23 Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips Yaotian Liu et.al. 2405.14154 null
2024-05-22 TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System Diogo Lavado et.al. 2405.13989 null
2024-05-21 Transparency Distortion Robustness for SOTA Image Segmentation Tasks Volker Knauthe et.al. 2405.12864 null
2024-05-20 A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation Sushmita Sarker et.al. 2405.11903 null
2024-05-20 Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments Jooyong Park et.al. 2405.11855 null
2024-05-20 Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate Model Mounes Zaval et.al. 2405.11837 null
2024-05-20 Universal Organizer of SAM for Unsupervised Semantic Segmentation Tingting Li et.al. 2405.11742 null
2024-05-19 Interpreting a Semantic Segmentation Model for Coastline Detection Conor O’Sullivan et.al. 2405.11500 null
2024-05-19 Unifying 3D Vision-Language Understanding via Promptable Queries Ziyu Zhu et.al. 2405.11442 null
2024-05-18 PS6D: Point Cloud Based Symmetry-Aware 6D Object Pose Estimation in Robot Bin-Picking Yifan Yang et.al. 2405.11257 null
2024-05-17 CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation Mushui Liu et.al. 2405.10530 link
2024-05-16 4D Panoptic Scene Graph Generation Jingkang Yang et.al. 2405.10305 link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 link
2024-05-16 DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data Chengxiang Fan et.al. 2405.10185 link
2024-05-16 An Integrated Framework for Multi-Granular Explanation of Video Summarization Konstantinos Tsigos et.al. 2405.10082 null
2024-05-16 A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance Andrea Matteazzi et.al. 2405.10046 null
2024-05-16 Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation Jihwan Kwak et.al. 2405.09858 null
2024-05-15 Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation Guo Yachan et.al. 2405.09682 null
2024-05-14 CLIP with Quality Captions: A Strong Pretraining for Vision Tasks Pavan Kumar Anasosalu Vasu et.al. 2405.08911 null
2024-05-14 Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study Qinfeng Zhu et.al. 2405.08493 null
2024-05-14 TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road Detection Martín Bayón-Gutiérrez et.al. 2405.08429 link
2024-05-13 IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data Ziyang Zhang et.al. 2405.07916 null
2024-05-13 PLUTO: Pathology-Universal Transformer Dinkar Juyal et.al. 2405.07905 null
2024-05-12 PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification Mohammad Shafiul Alam et.al. 2405.07332 link
2024-05-12 Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception Haoming Chen et.al. 2405.07201 null
2024-05-11 Global Motion Understanding in Large-Scale Video Object Segmentation Volodymyr Fedynyak et.al. 2405.07031 null
2024-05-10 GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs Mustafa Munir et.al. 2405.06849 link
2024-05-10 Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach Elham Ravanbakhsh et.al. 2405.06586 null
2024-05-10 Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation Xiaowen Ma et.al. 2405.06525 link
2024-05-10 Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data Yonghao Xu et.al. 2405.06502 null
2024-05-10 Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data Rongyu Zhang et.al. 2405.06413 null
2024-05-10 Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation Zhenliang Ni et.al. 2405.06228 link
2024-05-10 Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection Koji Takeda et.al. 2405.06185 null
2024-05-10 Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging Zhuchen Shao et.al. 2405.06175 null
2024-05-09 Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation Yudian Zhang et.al. 2405.05830 null
2024-05-09 CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks Nick et.al. 2405.05755 null
2024-05-08 OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies Lingdong Kong et.al. 2405.05259 link
2024-05-08 Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving Lingdong Kong et.al. 2405.05258 link
2024-05-08 Weakly-supervised Semantic Segmentation via Dual-stream Contrastive Learning of Cross-image Contextual Information Qi Lai et.al. 2405.04913 null
2024-05-08 DeepDamageNet: A two-step deep-learning model for multi-disaster building damage segmentation and classification using satellite imagery Irene Alisjahbana et.al. 2405.04800 null
2024-05-07 A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images László Kopácsi et.al. 2405.04650 null
2024-05-07 FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes Charles Gaydon et.al. 2405.04634 link
2024-05-07 AugmenTory: A Fast and Flexible Polygon Augmentation Library Tanaz Ghahremani et.al. 2405.04442 null
2024-05-07 A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields Raiyan Rahman et.al. 2405.04305 null
2024-05-07 ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation Zhibo Zhang et.al. 2405.04121 null
2024-05-07 Structured Click Control in Transformer-based Interactive Segmentation Long Xu et.al. 2405.04009 link
2024-05-06 PTQ4SAM: Post-Training Quantization for Segment Anything Chengtao Lv et.al. 2405.03144 link
2024-05-04 MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning Vishal Nedungadi et.al. 2405.02771 null
2024-05-04 Few-Shot Fruit Segmentation via Transfer Learning Jordan A. James et.al. 2405.02556 null
2024-05-03 Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic Segmentation Gabriel Fischer Abati et.al. 2405.02177 null
2024-05-03 Towards general deep-learning-based tree instance segmentation models Jonathan Henrich et.al. 2405.02061 null
2024-05-03 DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model Peijin Jia et.al. 2405.02008 null
2024-05-02 Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey Guoping Xu et.al. 2405.01725 link
2024-05-02 Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey Rokas Gipiškis et.al. 2405.01636 null
2024-05-02 CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation Chenying Liu et.al. 2405.01217 null
2024-05-02 Uncertainty-aware self-training with expectation maximization basis transformation Zijia Wang et.al. 2405.01175 null
2024-05-01 GraCo: Granularity-Controllable Interactive Segmentation Yian Zhao et.al. 2405.00587 null
2024-05-01 Exploring Self-Supervised Vision Transformers for Deepfake Detection: A Comparative Analysis Huy H. Nguyen et.al. 2405.00355 null
2024-04-30 Masked Multi-Query Slot Attention for Unsupervised Object Discovery Rishav Pramanik et.al. 2404.19654 link
2024-04-30 UniFS: Universal Few-shot Instance Perception with Point Representations Sheng Jin et.al. 2404.19401 null
2024-04-30 DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents Taylor Archibald et.al. 2404.19259 null
2024-04-29 Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing Leonardo Rossi et.al. 2404.18924 null
2024-04-29 IPixMatch: Boost Semi-supervised Semantic Segmentation with Inter-Pixel Relation Kebin Wu et.al. 2404.18891 null
2024-04-29 From Density to Geometry: YOLOv8 Instance Segmentation for Reverse Engineering of Optimized Structures Thomas Rochefort-Beaudoin et.al. 2404.18763 null
2024-04-29 Towards Long-term Robotics in the Wild Stephen Hausler et.al. 2404.18477 null
2024-04-29 Clicks2Line: Using Lines for Interactive Image Segmentation Chaewon Lee et.al. 2404.18461 null
2024-04-29 MFP: Making Full Use of Probability Maps for Interactive Image Segmentation Chaewon Lee et.al. 2404.18448 null
2024-04-28 Panoptic Segmentation and Labelling of Lumbar Spine Vertebrae using Modified Attention Unet Rikathi Pal et.al. 2404.18291 null
2024-04-28 Garbage Segmentation and Attribute Analysis by Robotic Dogs Nuo Xu et.al. 2404.18112 null
2024-04-27 Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic Environments Benoît Gérin et.al. 2404.17930 link
2024-04-27 GLIMS: Attention-Guided Lightweight Multi-Scale Hybrid Network for Volumetric Semantic Segmentation Ziya Ata Yazıcı et.al. 2404.17854 link
2024-04-26 Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment Kazi Shahriar Sanjid et.al. 2404.17235 null
2024-04-25 Calculation of Femur Caput Collum Diaphyseal angle for X-Rays images using Semantic Segmentation Deepak Bhatia et.al. 2404.17083 null
2024-04-25 Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals Oliver Hahn et.al. 2404.16818 link
2024-04-25 Self-Balanced R-CNN for Instance Segmentation Leonardo Rossi et.al. 2404.16633 link
2024-04-26 Multi-Scale Representations by Varying Window Attention for Semantic Segmentation Haotian Yan et.al. 2404.16573 link
2024-04-25 360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes Xu Zheng et.al. 2404.16501 null
2024-04-25 Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models Hedda Cohen Indelman et.al. 2404.16325 null
2024-04-25 Style Adaptation for Domain-adaptive Semantic Segmentation Ting Li et.al. 2404.16301 null
2024-04-25 A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation Yifan Zhao et.al. 2404.16266 link
2024-04-24 Does SAM dream of EIG? Characterizing Interactive Segmenter Performance using Expected Information Gain Kuan-I Chung et.al. 2404.16155 null
2024-04-24 3D Freehand Ultrasound using Visual Inertial and Deep Inertial Odometry for Measuring Patellar Tracking Russell Buchanan et.al. 2404.15847 null
2024-04-24 Vision Transformer-based Adversarial Domain Adaptation Yahan Li et.al. 2404.15817 link
2024-04-23 PRISM: A Promptable and Robust Interactive Segmentation Model with Visual Prompts Hao Li et.al. 2404.15028 link
2024-04-23 Unknown Object Grasping for Assistive Robotics Elle Miller et.al. 2404.15001 null
2024-04-22 Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic Surgery Yuyang Sheng et.al. 2404.14040 link
2024-04-22 OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks Sophia Sirko-Galouchenko et.al. 2404.14027 null
2024-04-22 PM-VIS: High-Performance Box-Supervised Video Instance Segmentation Zhangjing Yang et.al. 2404.13863 null
2024-04-21 Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation Guanlong Jiao et.al. 2404.13701 null
2024-04-21 PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images Abhishek Jha et.al. 2404.13693 null
2024-04-21 A Complete System for Automated 3D Semantic-Geometric Mapping of Corrosion in Industrial Environments Rui Pimentel de Figueiredo et.al. 2404.13691 null
2024-04-21 LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing Tong Wang et.al. 2404.13659 null
2024-04-21 Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering Ben Fei et.al. 2404.13619 null
2024-04-20 FisheyeDetNet: Object Detection on Fisheye Surround View Camera Systems for Automated Driving Ganesh Sistu et.al. 2404.13443 null
2024-04-20 AMMUNet: Multi-Scale Attention Map Merging for Remote Sensing Image Segmentation Yang Yang et.al. 2404.13408 null
2024-04-19 Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture Zarif Ahmed et.al. 2404.12986 null
2024-04-19 FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving Xingtai Gui et.al. 2404.12867 null
2024-04-19 Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation Yilong Chen et.al. 2404.12861 null
2024-04-19 COIN: Counterfactual inpainting for weakly supervised semantic segmentation for medical images Dmytro Shvetsov et.al. 2404.12832 link
2024-04-19 A Point-Based Approach to Efficient LiDAR Multi-Task Perception Christopher Lang et.al. 2404.12798 null
2024-04-19 Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework Zhuohong Li et.al. 2404.12721 link
2024-04-19 Improving Prediction Accuracy of Semantic Segmentation Methods Using Convolutional Autoencoder Based Pre-processing Layers Hisashi Shimodaira et.al. 2404.12718 null
2024-04-19 Show and Grasp: Few-shot Semantic Segmentation for Robot Grasping through Zero-shot Foundation Models Leonardo Barcellona et.al. 2404.12717 null
2024-04-18 Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds Oliver Lemke et.al. 2404.12440 null
2024-04-18 A Perspective on Deep Vision Performance with Standard Image and Video Codecs Christoph Reich et.al. 2404.12330 null
2024-04-18 Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery Yona Falinie A. Gaus et.al. 2404.12285 null
2024-04-18 Deep Gaussian mixture model for unsupervised image segmentation Matthias Schwab et.al. 2404.12252 null
2024-04-18 Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training Jin Gao et.al. 2404.12210 link
2024-04-18 How to Benchmark Vision Foundation Models for Semantic Segmentation? Tommie Kerssies et.al. 2404.12172 null
2024-04-17 Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding George Retsinas et.al. 2404.12144 link
2024-04-18 Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation Chongjie Si et.al. 2404.11981 null
2024-04-18 The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models Cheng Shi et.al. 2404.11957 link
2024-04-18 Group-On: Boosting One-Shot Segmentation with Supportive Query Hanjing Zhou et.al. 2404.11871 null
2024-04-17 Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach Mir Rayat Imtiaz Hossain et.al. 2404.11732 null
2024-04-17 A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching Francesco Pro et.al. 2404.11302 link
2024-04-17 Learning from Unlabelled Data with Transformers: Domain Adaptation for Semantic Segmentation of High Resolution Aerial Images Nikolaos Dionelis et.al. 2404.11299 link
2024-04-17 Criteria for Uncertainty-based Corner Cases Detection in Instance Segmentation Florian Heidecker et.al. 2404.11266 null
2024-04-16 A Concise Tiling Strategy for Preserving Spatial Context in Earth Observation Imagery Ellianna Abrahams et.al. 2404.10927 link
2024-04-16 Vocabulary-free Image Classification and Semantic Segmentation Alessandro Conti et.al. 2404.10864 link
2024-04-16 Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging Toqi Tahamid Sarker et.al. 2404.10841 link
2024-04-16 Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark Jiangning Zhang et.al. 2404.10760 null
2024-04-16 ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation Iaroslav Melekhov et.al. 2404.10699 null
2024-04-16 Contextrast: Contextual Contrastive Learning for Semantic Segmentation Changki Sung et.al. 2404.10633 null
2024-04-16 Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation Aaron Kujawa et.al. 2404.10572 null
2024-04-16 LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System Shijing Hu et.al. 2404.10498 null
2024-04-16 Adversarial Identity Injection for Semantic Face Image Synthesis Giuseppe Tarollo et.al. 2404.10408 null
2024-04-16 Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation Jiapeng Su et.al. 2404.10322 null
2024-04-16 Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain Steve Andreas Immanuel et.al. 2404.10307 link
2024-04-15 NOISe: Nuclei-Aware Osteoclast Instance Segmentation for Mouse-to-Human Domain Transfer Sai Kumar Reddy Manne et.al. 2404.10130 link
2024-04-15 Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL Fangwei Zhong et.al. 2404.09857 null
2024-04-15 In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation Han Xue et.al. 2404.09633 null
2024-04-15 The revenge of BiSeNet: Efficient Multi-Task Image Segmentation Gabriele Rosi et.al. 2404.09570 null
2024-04-15 kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies Zhongrui Gui et.al. 2404.09447 null
2024-04-15 Human-in-the-Loop Segmentation of Multi-species Coral Imagery Scarlett Raine et.al. 2404.09406 null
2024-04-14 Bridging Data Islands: Geographic Heterogeneity-Aware Federated Learning for Collaborative Remote Sensing Semantic Segmentation Jieyi Tan et.al. 2404.09292 null
2024-04-12 Structured Model Pruning for Efficient Inference in Computational Pathology Mohammed Adnan et.al. 2404.08831 null
2024-04-12 COCONut: Modernizing COCO Segmentation Xueqing Deng et.al. 2404.08639 null
2024-04-12 Benchmarking the Cell Image Segmentation Models Robustness under the Microscope Optical Aberrations Boyuan Peng et.al. 2404.08549 null
2024-04-12 Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning Girmaw Abebe Tadesse et.al. 2404.08544 null
2024-04-12 LaSagnA: Language-based Segmentation Assistant for Complex Queries Cong Wei et.al. 2404.08506 link
2024-04-12 Adapting the Segment Anything Model During Usage in Novel Situations Robin Schön et.al. 2404.08421 null
2024-04-12 Let It Flow: Simultaneous Optimization of 3D Flow and Object Clustering Patrik Vacek et.al. 2404.08363 null
2024-04-12 AdaContour: Adaptive Contour Descriptor with Hierarchical Representation Tianyu Ding et.al. 2404.08292 null
2024-04-12 Tackling Ambiguity from Perspective of Uncertainty Inference and Affinity Diversification for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2404.08195 link
2024-04-12 Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation Sina Hajimiri et.al. 2404.08181 link
2024-04-11 Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification Ricardo Pereira et.al. 2404.07739 null
2024-04-11 OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities Lasse H. Hansen et.al. 2404.07711 link
2024-04-11 ViM-UNet: Vision Mamba for Biomedical Segmentation Anwai Archit et.al. 2404.07705 link
2024-04-11 Implicit and Explicit Language Guidance for Diffusion-based Visual Perception Hefeng Wang et.al. 2404.07600 null
2024-04-11 Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling Sourajit Saha et.al. 2404.07410 null
2024-04-10 AI-Guided Defect Detection Techniques to Model Single Crystal Diamond Growth Rohan Reddy Mekala et.al. 2404.07306 null
2024-04-10 RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds Remco Royen et.al. 2404.06863 null
2024-04-10 O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation Muer Tie et.al. 2404.06836 null
2024-04-10 Convolution-based Probability Gradient Loss for Semantic Segmentation Guohang Shan et.al. 2404.06704 null
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542 null
2024-04-09 QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding Yash Mehan et.al. 2404.06442 null
2024-04-09 DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird’s Eye View Segmentation with Occlusion Reasoning Senthil Yogamani et.al. 2404.06352 null
2024-04-09 Automated National Urban Map Extraction Hasan Nasrallah et.al. 2404.06202 null
2024-04-09 Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation Mariella Dreissig et.al. 2404.06124 null
2024-04-09 Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation Zong-Wei Hong et.al. 2404.06029 null
2024-04-08 Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery Ionut M. Motoi et.al. 2404.05693 null
2024-04-08 AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation Jiannan Ge et.al. 2404.05667 null
2024-04-08 Impact of LiDAR visualisations on semantic segmentation of archaeological objects Raveerat Jaturapitpornchai et.al. 2404.05512 null
2024-04-08 Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance Dazhong Shen et.al. 2404.05384 link
2024-04-08 GPS-free Autonomous Navigation in Cluttered Tree Rows with Deep Semantic Segmentation Alessandro Navone et.al. 2404.05338 null
2024-04-08 Human Detection from 4D Radar Data in Low-Visibility Field Conditions Mikael Skog et.al. 2404.05307 null
2024-04-08 iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection Nan Zhou et.al. 2404.05207 null
2024-04-08 UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather Haimei Zhao et.al. 2404.05145 null
2024-04-07 D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation Xuan Sun et.al. 2404.04807 null
2024-04-06 HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene Ziang Guo et.al. 2404.04653 link
2024-04-05 Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation Zifu Wan et.al. 2404.04256 link
2024-04-05 Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation Ji-Jia Wu et.al. 2404.04231 link
2024-04-05 MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector Junbo Li et.al. 2404.04155 null
2024-04-04 Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation Elham Amin Mansour et.al. 2404.03799 null
2024-04-04 Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball Simon Weber et.al. 2404.03778 null
2024-04-04 OW-VISCap: Open-World Video Instance Segmentation and Captioning Anwesa Choudhuri et.al. 2404.03657 null
2024-04-04 Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation Izumi Fujimori et.al. 2404.03394 null
2024-04-04 iSeg: Interactive 3D Segmentation via Interactive Attention Itai Lang et.al. 2404.03219 null
2024-04-04 CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks Beibei Wang et.al. 2404.03191 null
2024-04-03 GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation Meher Niger et.al. 2404.02813 null
2024-04-03 RS-Mamba for Large Remote Sensing Image Dense Prediction Sijie Zhao et.al. 2404.02668 link
2024-04-03 A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task Eduardo Neto et.al. 2404.02659 null
2024-04-03 SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation Junyan Ye et.al. 2404.02638 link
2024-04-03 Active learning for efficient annotation in precision agriculture: a use-case on crop-weed semantic segmentation Bart M. van Marrewijk et.al. 2404.02580 null
2024-04-03 HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras Zhongyu Xia et.al. 2404.02517 link
2024-04-03 Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression I. Dror et.al. 2404.02481 null
2024-04-03 RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation Xianping Ma et.al. 2404.02457 link
2024-04-02 Constrained Robotic Navigation on Preferred Terrains Using LLMs and Speech Instruction: Exploiting the Power of Adverbs Faraz Lotfi et.al. 2404.02294 null
2024-04-02 Segment Any 3D Object with Language Seungjun Lee et.al. 2404.02157 link
2024-04-02 Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation Hui Xiao et.al. 2404.02065 null
2024-04-01 What is Point Supervision Worth in Video Instance Segmentation? Shuaiyi Huang et.al. 2404.01990 null
2024-04-02 Synthetic Data for Robust Stroke Segmentation Liam Chalcroft et.al. 2404.01946 link
2024-04-02 Improving Bird’s Eye View Semantic Segmentation by Task Decomposition Tianhao Zhao et.al. 2404.01925 null
2024-04-02 Rethinking Annotator Simulation: Realistic Evaluation of Whole-Body PET Lesion Interactive Segmentation Methods Zdravko Marinov et.al. 2404.01816 null
2024-04-02 Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model Qinfeng Zhu et.al. 2404.01705 link
2024-04-02 Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss Jaeha Kim et.al. 2404.01692 null
2024-04-02 JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments Duy-Tho Le et.al. 2404.01686 null
2024-04-01 SUGAR: Pre-training 3D Visual Representations for Robotics Shizhe Chen et.al. 2404.01491 null
2024-03-29 ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning Beomyoung Kim et.al. 2403.20126 link
2024-03-29 Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation Qi Bi et.al. 2403.20092 null
2024-03-29 Using Images as Covariates: Measuring Curb Appeal with Deep Learning Ardyn Nordstrom et.al. 2403.19915 null
2024-03-29 MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection Ali Behrouz et.al. 2403.19888 link
2024-03-28 Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation Qitian Ma et.al. 2403.19826 null
2024-04-01 Efficient 3D Instance Mapping and Localization with Neural Fields George Tang et.al. 2403.19797 null
2024-03-28 ENet-21: An Optimized light CNN Structure for Lane Detection Seyed Rasoul Hosseini et.al. 2403.19782 null
2024-03-29 Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers Pingcheng Dong et.al. 2403.19591 link
2024-03-28 DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs Donghyun Kim et.al. 2403.19588 link
2024-03-28 Learning Multiple Representations with Inconsistency-Guided Detail Regularization for Mask-Guided Matting Weihao Jiang et.al. 2403.19213 null
2024-03-27 Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D Mukund Varma T et.al. 2403.18922 null
2024-03-27 Annolid: Annotate, Segment, and Track Anything You Need Chen Yang et.al. 2403.18690 null
2024-03-27 I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation Ayoub Karine et.al. 2403.18490 null
2024-03-28 ViTAR: Vision Transformer with Any Resolution Qihang Fan et.al. 2403.18361 null
2024-03-27 Generating Diverse Agricultural Data for Vision-Based Farming Applications Mikolaj Cieslak et.al. 2403.18351 null
2024-03-27 Road Obstacle Detection based on Unknown Objectness Scores Chihiro Noguchi et.al. 2403.18207 null
2024-03-26 Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision Transformer Badri N. Patro et.al. 2403.18063 link
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921 link
2024-03-26 Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation Carlos Gomes et.al. 2403.17886 null
2024-03-26 PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Chenhongyi Yang et.al. 2403.17695 link
2024-03-26 Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion Kazi Shahriar Sanjid et.al. 2403.17432 null
2024-03-25 Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions Ye Li et.al. 2403.17009 link
2024-03-25 DreamLIP: Language-Image Pre-training with Long Captions Kecheng Zheng et.al. 2403.17007 link
2024-03-25 TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation Quang-Huy Che et.al. 2403.16958 null
2024-03-25 HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation Linglin Jing et.al. 2403.16788 null
2024-03-25 Clustering Propagation for Universal Medical Image Segmentation Yuhang Ding et.al. 2403.16646 null
2024-03-25 SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation Aysim Toker et.al. 2403.16605 null
2024-03-25 Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes Tianwei Zhang et.al. 2403.16499 null
2024-03-25 GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation Weiming Zhang et.al. 2403.16370 null
2024-03-24 AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans Cedric Perauer et.al. 2403.16318 null
2024-03-24 Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System Jing Li et.al. 2403.16227 null
2024-03-24 Segment Anything Model for Road Network Graph Extraction Congrui Hetang et.al. 2403.16051 link
2024-03-24 SM2C: Boost the Semi-supervised Segmentation for Medical Image by using Meta Pseudo Labels and Mixed Images Yifei Wang et.al. 2403.16009 null
2024-03-22 Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting Jun Guo et.al. 2403.15624 null
2024-03-22 A2DMN: Anatomy-Aware Dilated Multiscale Network for Breast Ultrasound Semantic Segmentation Kyle Lucke et.al. 2403.15560 null
2024-03-22 InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Yi Wang et.al. 2403.15377 link
2024-03-22 Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations Pranav Kulkarni et.al. 2403.15218 null
2024-03-22 Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion Sofia Casarin et.al. 2403.15194 null
2024-03-22 IFSENet : Harnessing Sparse Iterations for Interactive Few-shot Segmentation Excellence Shreyas Chandgothia et.al. 2403.15089 null
2024-03-22 Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans Heng Guo et.al. 2403.15063 null
2024-03-22 BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation Jiahao Lu et.al. 2403.15019 null
2024-03-22 Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation Wenlve Zhou et.al. 2403.14995 null
2024-03-21 WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather Blake Gella et.al. 2403.14874 null
2024-03-21 PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model Zheng Zhang et.al. 2403.14598 link
2024-03-21 Learning to Project for Cross-Task Knowledge Distillation Dylan Auty et.al. 2403.14494 null
2024-03-21 OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation Bohao Peng et.al. 2403.14418 link
2024-03-21 Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models Pablo Marcos-Manchón et.al. 2403.14291 link
2024-03-21 OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation Kwanyoung Kim et.al. 2403.14183 null
2024-03-21 Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference Junyoung Kim et.al. 2403.14138 null
2024-03-21 Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling Yong He et.al. 2403.14124 null
2024-03-21 Semantics from Space: Satellite-Guided Thermal Semantic Segmentation Annotation for Aerial Field Robots Connor Lee et.al. 2403.14056 null
2024-03-20 When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather Giulia Rizzoli et.al. 2403.13762 null
2024-03-20 Next day fire prediction via semantic segmentation Konstantinos Alexis et.al. 2403.13545 null
2024-03-20 MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining Di Wang et.al. 2403.13430 link
2024-03-20 AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments Mohamed Elnoor et.al. 2403.13235 null
2024-03-20 Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation Linshan Wu et.al. 2403.13225 link
2024-03-19 Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation Kasi Viswanath et.al. 2403.13188 null
2024-03-19 As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? Anjun Hu et.al. 2403.12693 null
2024-03-19 PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation Haruya Ishikawa et.al. 2403.12530 null
2024-03-19 Semantics, Distortion, and Style Matter: Towards Source-free UDA for Panoramic Segmentation Xu Zheng et.al. 2403.12505 null
2024-03-19 CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation Wenqi Zhu et.al. 2403.12455 link
2024-03-19 Multi-Object RANSAC: Efficient Plane Clustering Method in a Clutter Seunghyeon Lim et.al. 2403.12449 null
2024-03-18 EffiPerception: an Efficient Framework for Various Perception Tasks Xinhao Xiang et.al. 2403.12317 null
2024-03-18 Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery Yuqi Zhang et.al. 2403.11812 null
2024-03-18 Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation Wangbo Zhao et.al. 2403.11808 link
2024-03-18 LSKNet: A Foundation Lightweight Backbone for Remote Sensing Yuxuan Li et.al. 2403.11735 null
2024-03-18 TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models Lisa Weijler et.al. 2403.11691 null
2024-03-18 Better (pseudo-)labels for semi-supervised instance segmentation François Porcher et.al. 2403.11675 null
2024-03-18 Synthesizing multi-log grasp poses Arvid Fälldin et.al. 2403.11623 null
2024-03-18 OurDB: Ouroboric Domain Bridging for Multi-Target Domain Adaptive Semantic Segmentation Seungbeom Woo et.al. 2403.11582 null
2024-03-18 MISS: Memory-efficient Instance Segmentation Framework By Visual Inductive Priors Flow Propagation Chih-Chung Hsu et.al. 2403.11576 null
2024-03-18 Augment Before Copy-Paste: Data and Memory Efficiency-Oriented Instance Segmentation Framework for Sport-scenes Chih-Chung Hsu et.al. 2403.11572 null
2024-03-18 Circle Representation for Medical Instance Object Segmentation Juming Xiong et.al. 2403.11507 link
2024-03-18 MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception Thien-Minh Nguyen et.al. 2403.11496 null
2024-03-18 Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting Mingkui Tan et.al. 2403.11491 null
2024-03-18 ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation Minh Tran et.al. 2403.11376 null
2024-03-14 PosSAM: Panoptic Open-vocabulary Segment Anything Vibashan VS et.al. 2403.09620 link
2024-03-14 WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity Qiyuan Wang et.al. 2403.09551 null
2024-03-14 Annotation Free Semantic Segmentation with Vision Foundation Models Soroush Seifi et.al. 2403.09307 null
2024-03-14 StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images Robert Jewsbury et.al. 2403.09302 link
2024-03-14 Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation Hyung-Il Kim et.al. 2403.09199 null
2024-03-14 When Semantic Segmentation Meets Frequency Aliasing Linwei Chen et.al. 2403.09065 link
2024-03-13 CART: Caltech Aerial RGB-Thermal Dataset in the Wild Connor Lee et.al. 2403.08997 link
2024-03-13 SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net Helin Cao et.al. 2403.08885 null
2024-03-13 Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches Yun Xin Teoh et.al. 2403.08761 null
2024-03-13 Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution Samuel Sze et.al. 2403.08748 null
2024-03-13 Semantic Segmentation of Solar Radio Spikes at Low Frequencies Pearse C. Murphy et.al. 2403.08546 null
2024-03-13 Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation Zicheng Zhang et.al. 2403.08426 null
2024-03-13 LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving Sicen Guo et.al. 2403.08215 null
2024-03-13 Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks Fuzhi Wu et.al. 2403.08157 link
2024-03-12 Mitigating the Impact of Attribute Editing on Face Recognition Sudipta Banerjee et.al. 2403.08092 null
2024-03-12 Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation Feilong Tang et.al. 2403.07630 link
2024-03-12 PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution Honghao Chen et.al. 2403.07589 null
2024-03-12 Open-World Semantic Segmentation Including Class Similarity Matteo Sodano et.al. 2403.07532 null
2024-03-11 Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation Theodore Barfoot et.al. 2403.06759 link
2024-03-11 Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation Bianca-Cerasela-Zelia Blaga et.al. 2403.06621 link
2024-03-11 OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation Baran Ozaydin et.al. 2403.06546 null
2024-03-11 3D Semantic Segmentation-Driven Representations for 3D Object Detection Hayeon O et.al. 2403.06501 link
2024-03-11 Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy Jiuming Liu et.al. 2403.06467 link
2024-03-11 Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation Xiaoyang Wang et.al. 2403.06462 null
2024-03-11 Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation Peng Zhang et.al. 2403.06401 null
2024-03-10 Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning Woo-Jin Ahn et.al. 2403.06122 link
2024-03-09 Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation Hairong Shi et.al. 2403.05912 null
2024-03-09 Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration Jingyun Xue et.al. 2403.05906 null
2024-03-08 Attention-guided Feature Distillation for Semantic Segmentation Amir M. Mansourian et.al. 2403.05451 link
2024-03-08 Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation Yu Han et.al. 2403.05388 null
2024-03-08 Frequency-Adaptive Dilated Convolution for Semantic Segmentation Linwei Chen et.al. 2403.05369 link
2024-03-08 Embedded Deployment of Semantic Segmentation in Medicine through Low-Resolution Inputs Erik Ostrowski et.al. 2403.05340 null
2024-03-08 LVIC: Multi-modality segmentation by Lifting Visual Info as Cue Zichao Dong et.al. 2403.05159 null
2024-03-07 SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising Tao Zhou et.al. 2403.04194 link
2024-03-06 ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic Segmentation Erik Brorsson et.al. 2403.03854 link
2024-03-06 Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision Yajie Liu et.al. 2403.03707 null
2024-03-06 Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery Jingru Zhu et.al. 2403.03704 null
2024-03-06 GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding Zi-Ting Chou et.al. 2403.03608 null
2024-03-06 Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator Wonhyeok Choi et.al. 2403.03468 null
2024-03-05 CenterDisks: Real-time instance segmentation with disk covering Katia Jodogne-Del Litto et.al. 2403.03296 link
2024-03-05 Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection Mohamed Afifi et.al. 2403.03111 null
2024-03-05 ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving Han Lu et.al. 2403.02877 null
2024-03-05 DDF: A Novel Dual-Domain Image Fusion Strategy for Remote Sensing Image Semantic Segmentation with Unsupervised Domain Adaptation Lingyan Ran et.al. 2403.02784 null
2024-03-05 Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels Zhuohong Li et.al. 2403.02746 null
2024-03-05 FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View Jiawei Hou et.al. 2403.02710 null
2024-03-05 Deep Common Feature Mining for Efficient Video Semantic Segmentation Yaoyan Zheng et.al. 2403.02689 null
2024-03-04 Self-Supervised Facial Representation Learning with Facial Region Awareness Zheng Gao et.al. 2403.02138 null
2024-03-04 Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey Lingyan Ran et.al. 2403.01909 null
2024-03-04 Map-aided annotation for pole base detection Benjamin Missaoui et.al. 2403.01868 null
2024-03-04 AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation Haonan Wang et.al. 2403.01818 link
2024-03-02 Benchmarking Segmentation Models with Mask-Preserved Attribute Editing Zijin Yin et.al. 2403.01231 link
2024-03-02 Boosting Box-supervised Instance Segmentation with Pseudo Depth Xinyi Yu et.al. 2403.01214 null
2024-03-02 Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation Lian Xu et.al. 2403.01156 null
2024-03-01 Rethinking Few-shot 3D Point Cloud Semantic Segmentation Zhaochong An et.al. 2403.00592 link
2024-03-01 Small, Versatile and Mighty: A Range-View Perception Framework Qiang Meng et.al. 2403.00325 null
2024-03-01 YOLO-MED : Multi-Task Interaction Network for Biomedical Images Suizhi Huang et.al. 2403.00245 null
2024-02-29 FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything Safouane El Ghazouali et.al. 2403.00175 link
2024-02-29 Leveraging AI Predicted and Expert Revised Annotations in Interactive Segmentation: Continual Tuning or Full Training? Tiezheng Zhang et.al. 2402.19423 null
2024-03-01 PEM: Prototype-based Efficient MaskFormer for Image Segmentation Niccolò Cavagnero et.al. 2402.19422 link
2024-02-29 RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation Jie Zhang et.al. 2402.19004 null
2024-02-28 Spatial Coherence Loss for Salient and Camouflaged Object Detection and Beyond Ziyun Yang et.al. 2402.18698 null
2024-02-29 Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2402.18467 link
2024-02-29 A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation Francesco Barbato et.al. 2402.18402 null
2024-02-28 Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis Miriam Louise Carnot et.al. 2402.18309 null
2024-02-28 Feature Denoising For Low-Light Instance Segmentation Using Weighted Non-Local Blocks Joanne Lin et.al. 2402.18307 null
2024-02-28 Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis Bashir Kazimi et.al. 2402.18286 null
2024-02-28 PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation Haoyu Xie et.al. 2402.18117 null
2024-02-28 Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image Annotation Samuel O. Folorunsho et.al. 2402.18084 link
2024-02-27 Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation Xinyu Yang et.al. 2402.17891 link
2024-02-27 Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data David S. W. Williams et.al. 2402.17653 null
2024-02-27 Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling David S. W. Williams et.al. 2402.17622 null

(<a href=../README.md>back to main</a>)