Semantic Segmentation
Semantic Segmentation
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2025-12-18 | Next-Embedding Prediction Makes Strong Vision Learners | Sihan Xu et.al. | 2512.16922 | null |
| 2025-12-18 | Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation | Yunkai Yang et.al. | 2512.16740 | null |
| 2025-12-18 | Causal-Tune: Mining Causal Factors from Vision Foundation Models for Domain Generalized Semantic Segmentation | Yin Zhang et.al. | 2512.16567 | null |
| 2025-12-18 | PixelArena: A benchmark for Pixel-Precision Visual Intelligence | Feng Liang et.al. | 2512.16303 | null |
| 2025-12-17 | In Pursuit of Pixel Supervision for Visual Pre-training | Lihe Yang et.al. | 2512.15715 | null |
| 2025-12-17 | MoonSeg3R: Monocular Online Zero-Shot Segment Anything in 3D with Reconstructive Foundation Priors | Zhipeng Du et.al. | 2512.15577 | null |
| 2025-12-17 | SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis | Maximilian Kellner et.al. | 2512.15369 | null |
| 2025-12-17 | Vision-based module for accurately reading linear scales in a laboratory | Parvesh Saini et.al. | 2512.15327 | null |
| 2025-12-17 | SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2512.15310 | null |
| 2025-12-16 | Segmental Attention Decoding With Long Form Acoustic Encodings | Pawel Swietojanski et.al. | 2512.14652 | null |
| 2025-12-16 | S2D: Sparse-To-Dense Keymask Distillation for Unsupervised Video Instance Segmentation | Leon Sick et.al. | 2512.14440 | null |
| 2025-12-16 | DriverGaze360: OmniDirectional Driver Attention with Object-Level Guidance | Shreedhar Govil et.al. | 2512.14266 | null |
| 2025-12-16 | Consistent Instance Field for Dynamic Scene Understanding | Junyi Wu et.al. | 2512.14126 | null |
| 2025-12-16 | ChartAgent: A Chart Understanding Framework with Tool Integrated Reasoning | Boran Wang et.al. | 2512.14040 | null |
| 2025-12-16 | Deep Learning Perspective of Scene Understanding in Autonomous Robots | Afia Maham et.al. | 2512.14020 | null |
| 2025-12-15 | Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation | Hongxuan Sun et.al. | 2512.13175 | null |
| 2025-12-15 | JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion | Haoyu Wang et.al. | 2512.13014 | null |
| 2025-12-15 | TWLR: Text-Guided Weakly-Supervised Lesion Localization and Severity Regression for Explainable Diabetic Retinopathy Grading | Xi Luo et.al. | 2512.13008 | null |
| 2025-12-13 | OMUDA: Omni-level Masking for Unsupervised Domain Adaptation in Semantic Segmentation | Yang Ou et.al. | 2512.12303 | null |
| 2025-12-12 | Enhancing deep learning performance on burned area delineation from SPOT-6/7 imagery for emergency management | Maria Rodriguez et.al. | 2512.12056 | null |
| 2025-12-09 | Generalization vs. Specialization: Evaluating Segment Anything Model (SAM3) Zero-Shot Segmentation Against Fine-Tuned YOLO Detectors | Ranjan Sapkota et.al. | 2512.11884 | null |
| 2025-12-07 | Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training | Jiahao Jiang et.al. | 2512.11874 | null |
| 2025-12-12 | Referring Change Detection in Remote Sensing Imagery | Yilmaz Korkmaz et.al. | 2512.11719 | null |
| 2025-12-12 | DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation | Mohamed Abdelsamad et.al. | 2512.11465 | null |
| 2025-12-12 | Out-of-Distribution Segmentation via Wasserstein-Based Evidential Uncertainty | Arnold Brosch et.al. | 2512.11373 | null |
| 2025-12-12 | VFMF: World Modeling by Forecasting Vision Foundation Model Features | Gabrijel Boduljak et.al. | 2512.11225 | null |
| 2025-12-11 | Take a Peek: Efficient Encoder Adaptation for Few-Shot Semantic Segmentation via LoRA | Pasquale De Marinis et.al. | 2512.10521 | null |
| 2025-12-11 | Hybrid Transformer-Mamba Architecture for Weakly Supervised Volumetric Medical Segmentation | Yiheng Lyu et.al. | 2512.10353 | null |
| 2025-12-11 | ConStruct: Structural Distillation of Foundation Models for Prototype-Based Weakly Supervised Histopathology Segmentation | Khang Le et.al. | 2512.10316 | null |
| 2025-12-11 | DualProtoSeg: Simple and Efficient Design with Text- and Image-Guided Prototype Learning for Weakly Supervised Histopathology Image Segmentation | Anh M. Vu et.al. | 2512.10314 | null |
| 2025-12-10 | NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway | Sander Riisøen Jyhne et.al. | 2512.09913 | null |
| 2025-12-10 | ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation | Shengchao Zhou et.al. | 2512.09364 | null |
| 2025-12-10 | ROI-Packing: Efficient Region-Based Compression for Machine Vision | Md Eimran Hossain Eimon et.al. | 2512.09258 | null |
| 2025-12-09 | SIP: Site in Pieces- A Dataset of Disaggregated Construction-Phase 3D Scans for Semantic Segmentation and Scene Understanding | Seongyong Kim et.al. | 2512.09062 | null |
| 2025-12-09 | Persistent Homology for Labeled Datasets: Gromov-Hausdorff Stability and Generalized Landscapes | Yaoying Fu et.al. | 2512.08794 | null |
| 2025-12-09 | SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images | Kaiyu Li et.al. | 2512.08730 | null |
| 2025-12-09 | Instance-Aware Test-Time Segmentation for Continual Domain Shifts | Seunghwan Lee et.al. | 2512.08569 | null |
| 2025-12-09 | Query-aware Hub Prototype Learning for Few-Shot 3D Point Cloud Semantic Segmentation | YiLin Zhou et.al. | 2512.08253 | null |
| 2025-12-08 | Restrictive Hierarchical Semantic Segmentation for Stratified Tooth Layer Detection | Ryan Banks et.al. | 2512.07984 | null |
| 2025-12-08 | Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-Free Open-Vocabulary Semantic Segmentation | Qiming Huang et.al. | 2512.07360 | null |
| 2025-12-08 | Generalized Referring Expression Segmentation on Aerial Photos | Luís Marnoto et.al. | 2512.07338 | null |
| 2025-12-08 | A graph generation pipeline for critical infrastructures based on heuristics, images and depth data | Mike Diessner et.al. | 2512.07269 | null |
| 2025-12-07 | Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues | Tuan-Anh Vu et.al. | 2512.07034 | null |
| 2025-12-07 | Selective Masking based Self-Supervised Learning for Image Semantic Segmentation | Yuemin Wang et.al. | 2512.06981 | null |
| 2025-12-07 | Balanced Learning for Domain Adaptive Semantic Segmentation | Wangkai Li et.al. | 2512.06886 | null |
| 2025-12-07 | Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion | Yu Zhu et.al. | 2512.06882 | null |
| 2025-12-07 | Towards Robust Pseudo-Label Learning in Semantic Segmentation: An Encoding Perspective | Wangkai Li et.al. | 2512.06870 | null |
| 2025-12-07 | Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training | Kaixuan Lu et.al. | 2512.06864 | null |
| 2025-12-07 | FedDSR: Federated Deep Supervision and Regularization Towards Autonomous Driving | Wei-Bin Kou et.al. | 2512.06676 | null |
| 2025-12-07 | Statistic-Augmented, Decoupled MoE Routing and Aggregating in Autonomous Driving | Wei-Bin Kou et.al. | 2512.06664 | null |
| 2025-12-07 | CoT4Det: A Chain-of-Thought Framework for Perception-Oriented Vision-Language Tasks | Yu Qi et.al. | 2512.06663 | null |
| 2025-12-06 | Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework | Xinhao Xiang et.al. | 2512.06376 | null |
| 2025-12-03 | Fast and Flexible Robustness Certificates for Semantic Segmentation | Thomas Massena et.al. | 2512.06010 | null |
| 2025-11-30 | Stronger is not better: Better Augmentations in Contrastive Learning for Medical Image Segmentation | Azeez Idris et.al. | 2512.05992 | null |
| 2025-12-05 | LPD: Learnable Prototypes with Diversity Regularization for Weakly Supervised Histopathology Segmentation | Khang Le et.al. | 2512.05922 | null |
| 2025-12-05 | Label-Efficient Point Cloud Segmentation with Active Learning | Johannes Meyer et.al. | 2512.05759 | null |
| 2025-12-05 | DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model | Pasquale De Marinis et.al. | 2512.05613 | null |
| 2025-12-01 | FlowEO: Generative Unsupervised Domain Adaptation for Earth Observation | Georges Le Bellier et.al. | 2512.05140 | null |
| 2025-12-04 | GeoPE:A Unified Geometric Positional Embedding for Structured Tensors | Yupu Yao et.al. | 2512.04963 | null |
| 2025-12-04 | MT-Depth: Multi-task Instance feature analysis for the Depth Completion | Abdul Haseeb Nizamani et.al. | 2512.04734 | null |
| 2025-12-04 | DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance | Yinghui Xing et.al. | 2512.04511 | null |
| 2025-12-03 | A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection | Shahid Ansari et.al. | 2512.03684 | null |
| 2025-12-03 | OpenTrack3D: Towards Accurate and Generalizable Open-Vocabulary 3D Instance Segmentation | Zhishan Zhou et.al. | 2512.03532 | null |
| 2025-12-03 | Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation | Seogkyu Jeon et.al. | 2512.03508 | null |
| 2025-12-02 | Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks | Matthew Dutson et.al. | 2512.03014 | null |
| 2025-12-02 | Enhancing Floor Plan Recognition: A Hybrid Mix-Transformer and U-Net Approach for Precise Wall Segmentation | Dmitriy Parashchuk et.al. | 2512.02413 | null |
| 2025-12-02 | Reproducing and Extending RaDelft 4D Radar with Camera-Assisted Labels | Kejia Hu et.al. | 2512.02394 | null |
| 2025-12-02 | SAGE: Style-Adaptive Generalization for Privacy-Constrained Semantic Segmentation Across Domains | Qingmei Li et.al. | 2512.02369 | null |
| 2025-12-01 | Multifractal Recalibration of Neural Networks for Medical Imaging Segmentation | Miguel L. Martins et.al. | 2512.02198 | null |
| 2025-12-01 | Evaluating SAM2 for Video Semantic Segmentation | Syed Hesham Syed Ariff et.al. | 2512.01774 | null |
| 2025-12-01 | SSR: Semantic and Spatial Rectification for CLIP-based Weakly Supervised Segmentation | Xiuli Bi et.al. | 2512.01701 | null |
| 2025-12-01 | ViT $^3$ : Unlocking Test-Time Training in Vision | Dongchen Han et.al. | 2512.01643 | null |
| 2025-12-01 | Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation | Thao Thi Phuong Dao et.al. | 2512.01589 | null |
| 2025-12-01 | ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark | Joanne Lin et.al. | 2512.01495 | null |
| 2025-12-01 | Panda: Self-distillation of Reusable Sensor-level Representations for High Energy Physics | Samuel Young et.al. | 2512.01324 | null |
| 2025-12-01 | TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image | Ziqian Wang et.al. | 2512.01204 | null |
| 2025-11-30 | Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation | An Yang et.al. | 2512.00944 | null |
| 2025-11-30 | The Outline of Deception: Physical Adversarial Attacks on Traffic Signs Using Edge Patches | Haojie Ji et.al. | 2512.00765 | null |
| 2025-11-30 | VFM-ISRefiner: Towards Better Adapting Vision Foundation Models for Interactive Segmentation of Remote Sensing Images | Deliang Wang et.al. | 2512.00718 | null |
| 2025-11-29 | Doppler-Enhanced Deep Learning: Improving Thyroid Nodule Segmentation with YOLOv5 Instance Segmentation | Mahmoud El Hussieni et.al. | 2512.00639 | null |
| 2025-11-29 | EZ-SP: Fast and Lightweight Superpoint-Based 3D Segmentation | Louis Geist et.al. | 2512.00385 | null |
| 2025-11-29 | Breaking It Down: Domain-Aware Semantic Segmentation for Retrieval Augmented Generation | Aparajitha Allamraju et.al. | 2512.00367 | null |
| 2025-11-29 | Towards aligned body representations in vision models | Andrey Gizdov et.al. | 2512.00365 | null |
| 2025-11-28 | Learning to Predict Aboveground Biomass from RGB Images with 3D Synthetic Scenes | Silvia Zuffi et.al. | 2511.23249 | null |
| 2025-11-28 | Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM | Shouhe Zhang et.al. | 2511.22968 | null |
| 2025-11-28 | Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation | Taeyeong Kim et.al. | 2511.22948 | null |
| 2025-11-27 | GazeTrack: High-Precision Eye Tracking Based on Regularization and Spatial Computing | Xiaoyin Yang et.al. | 2511.22607 | null |
| 2025-11-27 | 3D Affordance Keypoint Detection for Robotic Manipulation | Zhiyang Liu et.al. | 2511.22195 | null |
| 2025-11-26 | OpenTwinMap: An Open-Source Digital Twin Generator for Urban Autonomous Driving | Alex Richardson et.al. | 2511.21925 | null |
| 2025-11-26 | ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images | M. Naseer Subhani et.al. | 2511.21606 | null |
| 2025-11-26 | Shift-Equivariant Complex-Valued Convolutional Neural Networks | Quentin Gabot et.al. | 2511.21250 | null |
| 2025-11-25 | Open Vocabulary Compositional Explanations for Neuron Alignment | Biagio La Rosa et.al. | 2511.20931 | null |
| 2025-11-25 | Automated Monitoring of Cultural Heritage Artifacts Using Semantic Segmentation | Andrea Ranieri et.al. | 2511.20541 | null |
| 2025-11-25 | CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation | Shilei Cao et.al. | 2511.20302 | null |
| 2025-11-25 | SAM-MI: A Mask-Injected Framework for Enhancing Open-Vocabulary Semantic Segmentation with SAM | Lin Chen et.al. | 2511.20027 | null |
| 2025-11-25 | Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting | Wen Zhang et.al. | 2511.19953 | null |
| 2025-11-24 | Lightweight Transformer Framework for Weakly Supervised Semantic Segmentation | Ali Torabi et.al. | 2511.19765 | null |
| 2025-11-24 | RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models | Omar Alama et.al. | 2511.19704 | null |
| 2025-11-24 | Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration | Remi Petitpierre et.al. | 2511.19538 | null |
| 2025-11-24 | BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation | Rachit Saluja et.al. | 2511.19394 | null |
| 2025-11-24 | nnActive: A Framework for Evaluation of Active Learning in 3D Biomedical Segmentation | Carsten T. Lüth et.al. | 2511.19183 | null |
| 2025-11-24 | DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection | Hai Ci et.al. | 2511.19111 | null |
| 2025-11-24 | SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation | Nimeshika Udayangani et.al. | 2511.18816 | null |
| 2025-11-24 | PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion | Yichen Yang et.al. | 2511.18801 | null |
| 2025-11-23 | SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation | Peter Siegel et.al. | 2511.18386 | null |
| 2025-11-23 | UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization | Siyi Li et.al. | 2511.18254 | null |
| 2025-11-22 | Matching-Based Few-Shot Semantic Segmentation Models Are Interpretable by Design | Pasquale De Marinis et.al. | 2511.18163 | null |
| 2025-11-22 | AdaPerceiver: Transformers with Adaptive Width, Depth, and Tokens | Purvish Jajal et.al. | 2511.18105 | null |
| 2025-11-18 | HSMix: Hard and Soft Mixing Data Augmentation for Medical Image Segmentation | Danyang Sun et.al. | 2511.17614 | null |
| 2025-11-21 | Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift | Björn Michele et.al. | 2511.17455 | null |
| 2025-11-21 | REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing | Binger Chen et.al. | 2511.17442 | null |
| 2025-11-21 | FisheyeGaussianLift: BEV Feature Lifting for Surround-View Fisheye Camera Perception | Shubham Sonarghare et.al. | 2511.17210 | null |
| 2025-11-20 | Late-decoupled 3D Hierarchical Semantic Segmentation with Semantic Prototype Discrimination based Bi-branch Supervision | Shuyu Cao et.al. | 2511.16650 | null |
| 2025-11-20 | Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling | Minseok Seo et.al. | 2511.16301 | null |
| 2025-11-20 | Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective | Jiahao Li et.al. | 2511.16170 | null |
| 2025-11-20 | InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation via Information-Theoretic Alignment Transfer | Muyao Yuan et.al. | 2511.15967 | null |
| 2025-11-19 | Automatic Uncertainty-Aware Synthetic Data Bootstrapping for Historical Map Segmentation | Lukas Arzoumanidis et.al. | 2511.15875 | null |
| 2025-11-19 | GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI | Naomi Simumba et.al. | 2511.15658 | null |
| 2025-11-19 | Multi-Text Guided Few-Shot Semantic Segmentation | Qiang Jiao et.al. | 2511.15515 | null |
| 2025-11-19 | WarNav: An Autonomous Driving Benchmark for Segmentation of Navigable Zones in War Scenes | Marc-Emmanuel Coupvent des Graviers et.al. | 2511.15429 | null |
| 2025-11-19 | Controlling False Positives in Image Segmentation via Conformal Prediction | Luca Mossina et.al. | 2511.15406 | null |
| 2025-11-18 | EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects | Gbenga Omotara et.al. | 2511.14970 | null |
| 2025-11-18 | FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding | Zhenshi Li et.al. | 2511.14901 | null |
| 2025-11-18 | Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation | Aditi Agarwal et.al. | 2511.14481 | null |
| 2025-11-18 | Step by Step Network | Dongchen Han et.al. | 2511.14329 | null |
| 2025-11-18 | Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution | N Dinesh Reddy et.al. | 2511.14210 | null |
| 2025-11-17 | Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting | Jiangnan Ye et.al. | 2511.13684 | null |
| 2025-11-17 | Mapping the Vanishing and Transformation of Urban Villages in China | Wenyu Zhang et.al. | 2511.13507 | null |
| 2025-11-17 | Delineate Anything Flow: Fast, Country-Level Field Boundary Detection from Any Source | Mykola Lavreniuk et.al. | 2511.13417 | null |
| 2025-11-17 | DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation | Yan Gong et.al. | 2511.13047 | null |
| 2025-11-15 | FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention | Peng Zhang et.al. | 2511.12215 | null |
| 2025-11-15 | Evaluation of Attention Mechanisms in U-Net Architectures for Semantic Segmentation of Brazilian Rock Art Petroglyphs | Leonardi Melo et.al. | 2511.11959 | null |
| 2025-11-14 | Chain-of-Generation: Progressive Latent Diffusion for Text-Guided Molecular Design | Lingxiao Li et.al. | 2511.11894 | null |
| 2025-11-14 | Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation | Camila Machado de Araujo et.al. | 2511.11890 | null |
| 2025-11-13 | AdaptFly: Prompt-Guided Adaptation of Foundation Models for Low-Altitude UAV Networks | Jiao Chen et.al. | 2511.11720 | null |
| 2025-11-14 | Terrain Costmap Generation via Scaled Preference Conditioning | Luisa Mao et.al. | 2511.11529 | null |
| 2025-11-13 | Histology-informed tiling of whole tissue sections improves the interpretability and predictability of cancer relapse and genetic alterations | Willem Bonnaffé et.al. | 2511.10432 | null |
| 2025-11-13 | Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators | Maximiliane Gruber et.al. | 2511.10424 | null |
| 2025-11-13 | DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Semantic Instance Segmentation | Xuexun Liu et.al. | 2511.10003 | null |
| 2025-11-04 | Do Street View Imagery and Public Participation GIS align: Comparative Analysis of Urban Attractiveness | Milad Malekzadeh et.al. | 2511.05570 | null |
| 2025-11-03 | Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation | Jiayuan Wang et.al. | 2511.05557 | null |
| 2025-11-06 | An Active Learning Pipeline for Biomedical Image Instance Segmentation with Minimal Human Intervention | Shuo Zhao et.al. | 2511.04811 | null |
| 2025-11-06 | Cambrian-S: Towards Spatial Supersensing in Video | Shusheng Yang et.al. | 2511.04670 | null |
| 2025-11-06 | Vitessce Link: A Mixed Reality and 2D Display Hybrid Approach for Visual Analysis of 3D Tissue Maps | Eric Mörth et.al. | 2511.04262 | null |
| 2025-11-06 | CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation | Yuwen Tao et.al. | 2511.03992 | null |
| 2025-11-05 | Laugh, Relate, Engage: Stylized Comment Generation for Short Videos | Xuan Ouyang et.al. | 2511.03757 | null |
| 2025-11-05 | Computational Imaging Meets LLMs: Zero-Shot IDH Mutation Prediction in Brain Gliomas | Syed Muqeem Mahmood et.al. | 2511.03376 | null |
| 2025-11-05 | Enhancing Medical Image Segmentation via Heat Conduction Equation | Rong Wu et.al. | 2511.03260 | null |
| 2025-11-05 | Diffusion-Guided Mask-Consistent Paired Mixing for Endoscopic Image Segmentation | Pengyu Jie et.al. | 2511.03219 | null |
| 2025-11-05 | Subsampled Randomized Fourier GaLore for Adapting Foundation Models in Depth-Driven Liver Landmark Segmentation | Yun-Chen Lin et.al. | 2511.03163 | null |
| 2025-11-05 | Accelerating Physical Property Reasoning for Augmented Visual Cognition | Hongbo Lan et.al. | 2511.03126 | null |
| 2025-11-04 | Learning with less: label-efficient land cover classification at very high spatial resolution using self-supervised deep learning | Dakota Hester et.al. | 2511.03004 | null |
| 2025-11-04 | Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data | Syed Mostaquim Ali et.al. | 2511.02994 | null |
| 2025-11-04 | Digital Twin-Driven Pavement Health Monitoring and Maintenance Optimization Using Graph Neural Networks | Mohsin Mahmud Topu et.al. | 2511.02957 | null |
| 2025-11-04 | Optimizing the nnU-Net model for brain tumor (Glioma) segmentation Using a BraTS Sub-Saharan Africa (SSA) dataset | Chukwuemeka Arua Kalu et.al. | 2511.02893 | null |
| 2025-11-02 | Digitizing Spermatogenesis Lineage at Nanoscale Resolution In Tissue-Level Electron Microscopy | Li Xiao et.al. | 2511.02860 | null |
| 2025-11-04 | Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks | Dmitrii Pozdeev et.al. | 2511.02830 | null |
| 2025-11-04 | PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing | Antonio Oroz et.al. | 2511.02777 | null |
| 2025-11-04 | Resource-efficient Automatic Refinement of Segmentations via Weak Supervision from Light Feedback | Alix de Langlais et.al. | 2511.02576 | null |
| 2025-11-04 | ESA: Energy-Based Shot Assembly Optimization for Automatic Video Editing | Yaosen Chen et.al. | 2511.02505 | null |
| 2025-11-04 | Synthetic Crop-Weed Image Generation and its Impact on Model Generalization | Garen Boyadjian et.al. | 2511.02417 | null |
| 2025-11-04 | Revisiting put-that-there, context aware window interactions via LLMs | Riccardo Bovo et.al. | 2511.02378 | null |
| 2025-11-04 | From Instance Segmentation to 3D Growth Trajectory Reconstruction in Planktonic Foraminifera | Huahua Lin et.al. | 2511.02142 | null |
| 2025-11-03 | Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation | Seongkyu Choi et.al. | 2511.01434 | null |
| 2025-11-03 | MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement | Jierui Qu et.al. | 2511.01345 | null |
| 2025-11-03 | Source-Only Cross-Weather LiDAR via Geometry-Aware Point Drop | YoungJae Cheong et.al. | 2511.01250 | null |
| 2025-11-03 | CenterMamba-SAM: Center-Prioritized Scanning and Temporal Prototypes for Brain Lesion Segmentation | Yu Tian et.al. | 2511.01243 | null |
| 2025-11-03 | An Enhanced Proprioceptive Method for Soft Robots Integrating Bend Sensors and IMUs | Dong Heon Han et.al. | 2511.01165 | null |
| 2025-11-03 | MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation | Ziyi Wang et.al. | 2511.01143 | null |
| 2025-11-02 | URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model | Zhe Li et.al. | 2511.00940 | null |
| 2025-11-02 | TA-LSDiff:Topology-Aware Diffusion Guided by a Level Set Energy for Pancreas Segmentation | Yue Gou et.al. | 2511.00815 | null |
| 2025-11-02 | Rhythm in the Air: Vision-based Real-Time Music Generation through Gestures | Barathi Subramanian et.al. | 2511.00793 | null |
| 2025-11-02 | Class-agnostic 3D Segmentation by Granularity-Consistent Automatic 2D Mask Tracking | Juan Wang et.al. | 2511.00785 | null |
| 2025-11-01 | Grounding Surgical Action Triplets with Instrument Instance Segmentation: A Dataset and Target-Aware Fusion Approach | Oluwatosin Alabi et.al. | 2511.00643 | null |
| 2025-11-01 | Text-guided Fine-Grained Video Anomaly Detection | Jihao Gu et.al. | 2511.00524 | null |
| 2025-11-01 | Optimization of continuous-flow over traffic networks with fundamental diagram constraints | Anqi Dong et.al. | 2511.00500 | null |
| 2025-11-01 | HumanCrafter: Synergizing Generalizable Human Reconstruction and Semantic 3D Segmentation | Panwang Pan et.al. | 2511.00468 | null |
| 2025-11-01 | Tree Training: Accelerating Agentic LLMs Training via Shared Prefix Reuse | Shaojie Wang et.al. | 2511.00413 | null |
| 2025-10-31 | Predicting the spatial distribution and demographics of commercial swine farms in the United States | Felipe E. Sanchez et.al. | 2511.00132 | null |
| 2025-10-29 | Habitat and Land Cover Change Detection in Alpine Protected Areas: A Comparison of AI Architectures | Harald Kristen et.al. | 2511.00073 | null |
| 2025-10-31 | VessShape: Few-shot 2D blood vessel segmentation by leveraging shape priors from synthetic images | Cesar H. Comin et.al. | 2510.27646 | null |
| 2025-10-31 | Context-Gated Cross-Modal Perception with Visual Mamba for PET-CT Lung Tumor Segmentation | Elena Mulero Ayllón et.al. | 2510.27508 | null |
| 2025-10-31 | Mask-to-Height: A YOLOv11-Based Architecture for Joint Building Instance Segmentation and Height Classification from Satellite Imagery | Mahmoud El Hussieni et.al. | 2510.27224 | null |
| 2025-10-31 | SpecAware: A Spectral-Content Aware Foundation Model for Unifying Multi-Sensor Learning in Hyperspectral Remote Sensing Mapping | Renjie Ji et.al. | 2510.27219 | null |
| 2025-10-31 | MLPerf Automotive | Radoyeh Shojaei et.al. | 2510.27065 | null |
| 2025-10-30 | AD-SAM: Fine-Tuning the Segment Anything Vision Foundation Model for Autonomous Driving Perception | Mario Camarena et.al. | 2510.27047 | null |
| 2025-10-30 | Photometric Redshifts in JWST Deep Fields: A Pixel-Based Alternative with DeepDISC | Grant Merz et.al. | 2510.27032 | null |
| 2025-10-30 | Surpassing state of the art on AMD area estimation from RGB fundus images through careful selection of U-Net architectures and loss functions for class imbalance | Valentyna Starodub et.al. | 2510.26778 | null |
| 2025-10-30 | Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws | Lin Guo et.al. | 2510.26268 | null |
| 2025-10-29 | BikeScenes: Online LiDAR Semantic Segmentation for Bicycles | Denniz Goren et.al. | 2510.25901 | null |
| 2025-10-29 | StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA | Yuhang Hu et.al. | 2510.25332 | null |
| 2025-10-29 | LangHOPS: Language Grounded Hierarchical Open-Vocabulary Part Segmentation | Yang Miao et.al. | 2510.25263 | null |
| 2025-10-29 | Mapping and Classification of Trees Outside Forests using Deep Learning | Moritz Lucas et.al. | 2510.25239 | null |
| 2025-10-29 | Classifier Enhancement Using Extended Context and Domain Experts for Semantic Segmentation | Huadong Tang et.al. | 2510.25174 | null |
| 2025-10-29 | EA3D: Online Open-World 3D Object Extraction from Streaming Videos | Xiaoyu Zhou et.al. | 2510.25146 | null |
| 2025-10-29 | Region-CAM: Towards Accurate Object Regions in Class Activation Maps for Weakly Supervised Learning Tasks | Qingdong Cai et.al. | 2510.25134 | null |
| 2025-10-28 | A Critical Study towards the Detection of Parkinsons Disease using ML Technologies | Vivek Chetia et.al. | 2510.24456 | null |
| 2025-10-28 | A Quantitative Evaluation Framework for Explainable AI in Semantic Segmentation | Reem Hammoud et.al. | 2510.24414 | null |
| 2025-10-27 | Improving Visual Discriminability of CLIP for Training-Free Open-Vocabulary Semantic Segmentation | Jinxin Zhou et.al. | 2510.23894 | null |
| 2025-10-27 | DPGLA: Bridging the Gap between Synthetic and Real Data for Unsupervised Domain Adaptation in 3D LiDAR Semantic Segmentation | Wanmeng Li et.al. | 2510.23525 | null |
| 2025-10-27 | One-Timestep is Enough: Achieving High-performance ANN-to-SNN Conversion via Scale-and-Fire Neurons | Qiuyang Chen et.al. | 2510.23383 | null |
| 2025-10-27 | Seq-DeepIPC: Sequential Sensing for End-to-End Control in Legged Robot Navigation | Oskar Natan et.al. | 2510.23057 | null |
| 2025-10-26 | WaveMAE: Wavelet decomposition Masked Auto-Encoder for Remote Sensing | Vittorio Bernuzzi et.al. | 2510.22697 | null |
| 2025-10-26 | A Critical Study on Tea Leaf Disease Detection using Deep Learning Techniques | Nabajyoti Borah et.al. | 2510.22647 | null |
| 2025-10-26 | SABlock: Semantic-Aware KV Cache Eviction with Adaptive Compression Block Size | Jinhan Chen et.al. | 2510.22556 | null |
| 2025-10-25 | Real-Time Semantic Segmentation on FPGA for Autonomous Vehicles Using LMIINet with the CGRA4ML Framework | Amir Mohammad Khadem Hosseini et.al. | 2510.22243 | null |
| 2025-10-25 | Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation | Jeongin Kim et.al. | 2510.22229 | null |
| 2025-10-25 | Simplifying Knowledge Transfer in Pretrained Models | Siddharth Jain et.al. | 2510.22208 | null |
| 2025-10-25 | Bridging Perception and Reasoning: Dual-Pipeline Neuro-Symbolic Landing for UAVs in Cluttered Environments | Weixian Qian et.al. | 2510.22204 | null |
| 2025-10-24 | AURASeg: Attention Guided Upsampling with Residual Boundary-Assistive Refinement for Drivable-Area Segmentation | Narendhiran Vijayakumar et.al. | 2510.21536 | null |
| 2025-10-24 | Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks | Jieyuan Zhang et.al. | 2510.21403 | null |
| 2025-10-24 | Urban 3D Change Detection Using LiDAR Sensor for HD Map Maintenance and Smart Mobility | Hezam Albagami et.al. | 2510.21112 | null |
| 2025-10-24 | WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition | Guoan Xu et.al. | 2510.21079 | null |
| 2025-10-23 | ACS-SegNet: An Attention-Based CNN-SegFormer Segmentation Network for Tissue Segmentation in Histopathology | Nima Torbati et.al. | 2510.20754 | null |
| 2025-10-22 | Uncertainty evaluation of segmentation models for Earth observation | Melanie Rey et.al. | 2510.19586 | null |
| 2025-10-22 | Automated Morphological Analysis of Neurons in Fluorescence Microscopy Using YOLOv8 | Banan Alnemri et.al. | 2510.19455 | null |
| 2025-10-21 | ε-Seg: Sparsely Supervised Semantic Segmentation of Microscopy Data | Sheida Rahnamai Kordasiabi et.al. | 2510.18637 | null |
| 2025-10-21 | Learning to Navigate Under Imperfect Perception: Conformalised Segmentation for Safe Reinforcement Learning | Daniel Bethell et.al. | 2510.18485 | null |
| 2025-10-21 | DART: A Structured Dataset of Regulatory Drug Documents in Italian for Clinical NLP | Mariano Barone et.al. | 2510.18475 | null |
| 2025-10-20 | Accelerating Vision Transformers with Adaptive Patch Sizes | Rohan Choudhury et.al. | 2510.18091 | link |
| 2025-10-17 | 3D Weakly Supervised Semantic Segmentation via Class-Aware and Geometry-Guided Pseudo-Label Refinement | Xiaoxu Xu et.al. | 2510.17875 | null |
| 2025-10-20 | 4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads | Ling Liu et.al. | 2510.17664 | null |
| 2025-10-20 | Expose Camouflage in the Water: Underwater Camouflaged Instance Segmentation and Dataset | Chuhong Wang et.al. | 2510.17585 | null |
| 2025-10-20 | M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Monocular Spatial Perception | U. V. B. L Udugama et.al. | 2510.17363 | null |
| 2025-10-20 | Exploring Structural Degradation in Dense Representations for Self-supervised Learning | Siran Dai et.al. | 2510.17299 | null |
| 2025-10-19 | ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification | Akhila Kambhatla et.al. | 2510.16854 | null |
| 2025-10-19 | Needles in the Landscape: Semi-Supervised Pseudolabeling for Archaeological Site Discovery under Label Scarcity | Simon Jaxy et.al. | 2510.16814 | null |
| 2025-10-19 | An Efficient Semantic Segmentation Decoder for In-Car or Distributed Applications | Danish Nazir et.al. | 2510.16747 | null |
| 2025-10-19 | UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid | Tianyang Dou et.al. | 2510.16730 | null |
| 2025-10-18 | Self-Supervised Learning to Fly using Efficient Semantic Segmentation and Metric Depth Estimation for Low-Cost Autonomous UAVs | Sebastian Mocanu et.al. | 2510.16624 | null |
| 2025-10-18 | Cataract-LMM: Large-Scale, Multi-Source, Multi-Task Benchmark for Deep Learning in Surgical Video Analysis | Mohammad Javad Ahmadi et.al. | 2510.16371 | null |
| 2025-10-17 | Neuro-Symbolic Spatial Reasoning in Segmentation | Jiayi Lin et.al. | 2510.15841 | null |
| 2025-10-17 | Semantic segmentation with coarse annotations | Jort de Jong et.al. | 2510.15756 | null |
| 2025-10-17 | Semantic4Safety: Causal Insights from Zero-shot Street View Imagery Segmentation for Urban Road Safety | Huan Chen et.al. | 2510.15434 | null |
| 2025-10-17 | MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment | Bingyu Li et.al. | 2510.15398 | null |
| 2025-10-17 | TranSimHub:A Unified Air-Ground Simulation Platform for Multi-Modal Perception and Decision-Making | Maonan Wang et.al. | 2510.15365 | null |
| 2025-10-17 | RankSEG-RMA: An Efficient Segmentation Algorithm via Reciprocal Moment Approximation | Zixun Wang et.al. | 2510.15362 | null |
| 2025-10-17 | Symmetric Entropy-Constrained Video Coding for Machines | Yuxiao Sun et.al. | 2510.15347 | null |
| 2025-10-16 | Comprehensive language-image pre-training for 3D medical image understanding | Tassilo Wald et.al. | 2510.15042 | null |
| 2025-10-16 | MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning | Mattia Segu et.al. | 2510.15026 | null |
| 2025-10-16 | Multi-modal video data-pipelines for machine learning with minimal human supervision | Mihai-Cristian Pîrvu et.al. | 2510.14862 | null |
| 2025-10-15 | PoissonNet: A Local-Global Approach for Learning on Surfaces | Arman Maesumi et.al. | 2510.14146 | null |
| 2025-10-15 | Multi-Scale High-Resolution Logarithmic Grapher Module for Efficient Vision GNNs | Mustafa Munir et.al. | 2510.13740 | null |
| 2025-10-15 | Dedelayed: Deleting remote inference delay via on-device correction | Dan Jacobellis et.al. | 2510.13714 | null |
| 2025-10-15 | Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning | Yang Li et.al. | 2510.13307 | null |
| 2025-10-15 | FlyAwareV2: A Multimodal Cross-Domain UAV Dataset for Urban Scene Understanding | Francesco Barbato et.al. | 2510.13243 | null |
| 2025-10-14 | SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding | Zhiliu Yang et.al. | 2510.12749 | null |
| 2025-10-14 | Multiplicative Loss for Enhancing Semantic Segmentation in Medical and Cellular Images | Yuto Yokoi et.al. | 2510.12258 | null |
| 2025-10-14 | BEEP3D: Box-Supervised End-to-End Pseudo-Mask Generation for 3D Instance Segmentation | Youngju Yoo et.al. | 2510.12182 | null |
| 2025-10-13 | A Framework for Low-Effort Training Data Generation for Urban Semantic Segmentation | Denis Zavadski et.al. | 2510.11567 | null |
| 2025-10-13 | Building and Evaluating a Realistic Virtual World for Large Scale Urban Exploration from 360° Videos | Mizuki Takenawa et.al. | 2510.11447 | null |
| 2025-10-13 | Uncertainty-Aware ControlNet: Bridging Domain Gaps with Synthetic Image Generation | Joshua Niemeijer et.al. | 2510.11346 | null |
| 2025-10-12 | DAGLFNet:Deep Attention-Guided Global-Local Feature Fusion for Pseudo-Image Point Cloud Segmentation | Chuang Chen et.al. | 2510.10471 | null |
| 2025-10-11 | MRI Brain Tumor Detection with Computer Vision | Jack Krolik et.al. | 2510.10250 | null |
| 2025-10-11 | SparseUWSeg: Active Sparse Point-Label Augmentation for Underwater Semantic Segmentation | César Borja et.al. | 2510.10163 | null |
| 2025-10-11 | An Unsupervised Time Series Anomaly Detection Approach for Efficient Online Process Monitoring of Additive Manufacturing | Frida Cantu et.al. | 2510.09977 | null |
| 2025-10-10 | Cell Instance Segmentation: The Devil Is in the Boundaries | Peixian Liang et.al. | 2510.09848 | null |
| 2025-10-10 | A methodology for clinically driven interactive segmentation evaluation | Parhom Esmaeili et.al. | 2510.09499 | null |
| 2025-10-10 | SilvaScenes: Tree Segmentation and Species Classification from Under-Canopy Images in Natural Forests | David-Alexandre Duclos et.al. | 2510.09458 | null |
| 2025-10-10 | Instance-Aware Robust Consistency Regularization for Semi-Supervised Nuclei Instance Segmentation | Zenan Lin et.al. | 2510.09329 | null |
| 2025-10-10 | SOS: Synthetic Object Segments Improve Detection, Segmentation, and Grounding | Weikai Huang et.al. | 2510.09110 | null |
| 2025-10-10 | Exploring Single Domain Generalization of LiDAR-based Semantic Segmentation under Imperfect Labels | Weitong Kong et.al. | 2510.09035 | null |
| 2025-10-10 | Pinpointing crucial steps: Attribution-based Credit Assignment for Verifiable Reinforcement Learning | Junxi Yin et.al. | 2510.08899 | null |
| 2025-10-09 | FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation | Hongrui Wu et.al. | 2510.08849 | null |
| 2025-10-08 | Out-of-Distribution Detection in LiDAR Semantic Segmentation Using Epistemic Uncertainty from Hierarchical GMMs | Hanieh Shojaei Miandashti et.al. | 2510.08631 | null |
| 2025-10-08 | HARP-NeXt: High-Speed and Accurate Range-Point Fusion Network for 3D LiDAR Semantic Segmentation | Samir Abou Haidar et.al. | 2510.06876 | null |
| 2025-10-08 | Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion | Jie Luo et.al. | 2510.06687 | null |
| 2025-10-08 | Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation | Fei Zhang et.al. | 2510.06582 | null |
| 2025-10-07 | Dropping the D: RGB-D SLAM Without the Depth Sensor | Mert Kiray et.al. | 2510.06216 | link |
| 2025-10-07 | Overlap-aware segmentation for topological reconstruction of obscured objects | J. Schueler et.al. | 2510.06194 | null |
| 2025-10-07 | Shaken or Stirred? An Analysis of MetaFormer’s Token Mixing for Medical Imaging | Ron Keuth et.al. | 2510.05971 | null |
| 2025-10-07 | ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving | Yongxuan Lyu et.al. | 2510.05752 | null |
| 2025-07-25 | Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing | Haichuan Li et.al. | 2507.19691 | null |
| 2025-07-25 | SurgPIS: Surgical-instrument-level Instances and Part-level Semantics for Weakly-supervised Part-aware Instance Segmentation | Meng Wei et.al. | 2507.19592 | null |
| 2025-07-24 | HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation | Xinyu Wang et.al. | 2507.18575 | null |
| 2025-07-24 | Synthetic Data Augmentation for Enhanced Chicken Carcass Instance Segmentation | Yihong Feng et.al. | 2507.18558 | null |
| 2025-07-24 | Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows | Simin Huo et.al. | 2507.18405 | link |
| 2025-07-24 | GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences | Gabriel Jarry et.al. | 2507.18330 | null |
| 2025-07-24 | SemiSegECG: A Multi-Dataset Benchmark for Semi-Supervised Semantic Segmentation in ECG Delineation | Minje Park et.al. | 2507.18323 | link |
| 2025-07-24 | Unsupervised Domain Adaptation for 3D LiDAR Semantic Segmentation Using Contrastive Learning and Multi-Model Pseudo Labeling | Abhishek Kaushik et.al. | 2507.18176 | null |
| 2025-07-23 | AFRDA: Attentive Feature Refinement for Domain Adaptive Semantic Segmentation | Md. Al-Masrur Khan et.al. | 2507.17957 | link |
| 2025-07-23 | Exploring Spatial Diversity for Region-based Active Learning | Lile Cai et.al. | 2507.17367 | null |
| 2025-07-23 | Exploring Active Learning for Semiconductor Defect Segmentation | Lile Cai et.al. | 2507.17359 | null |
| 2025-07-23 | Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation | Haotian Chen et.al. | 2507.17347 | null |
| 2025-07-23 | On Temporal Guidance and Iterative Refinement in Audio Source Separation | Tobias Morocutti et.al. | 2507.17297 | null |
| 2025-07-23 | ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation | Bo Fang et.al. | 2507.17149 | null |
| 2025-07-22 | MultiTaskDeltaNet: Change Detection-based Image Segmentation for Operando ETEM with Application to Carbon Gasification Kinetics | Yushuo Niu et.al. | 2507.16803 | null |
| 2025-07-22 | A2Mamba: Attention-augmented State Space Models for Visual Recognition | Meng Lou et.al. | 2507.16624 | link |
| 2025-07-22 | Semantic Segmentation for Preoperative Planning in Transcatheter Aortic Valve Replacement | Cedric Zöllner et.al. | 2507.16573 | null |
| 2025-07-22 | Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge | Tobias Rueckert et.al. | 2507.16559 | null |
| 2025-07-23 | EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion | Shang Liu et.al. | 2507.16535 | null |
| 2025-07-22 | Advancing Visual Large Language Model for Multi-granular Versatile Perception | Wentao Xiang et.al. | 2507.16213 | null |
| 2025-07-22 | AMMNet: An Asymmetric Multi-Modal Network for Remote Sensing Semantic Segmentation | Hui Ye et.al. | 2507.16158 | null |
| 2025-07-21 | Improved Semantic Segmentation from Ultra-Low-Resolution RGB Images Applied to Privacy-Preserving Object-Goal Navigation | Xuying Huang et.al. | 2507.16034 | null |
| 2025-07-21 | ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction | Danhui Chen et.al. | 2507.15803 | null |
| 2025-07-21 | ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting | Ruijie Zhu et.al. | 2507.15454 | link |
| 2025-07-21 | Rethinking Occlusion in FER: A Semantic-Aware Perspective and Go Beyond | Huiyu Zhai et.al. | 2507.15401 | null |
| 2025-07-20 | Towards Geometric and Textural Consistency 3D Scene Generation via Single Image-guided Model Generation and Layout Optimization | Xiang Tang et.al. | 2507.14841 | null |
| 2025-07-20 | A Novel Downsampling Strategy Based on Information Complementarity for Medical Image Segmentation | Wenbo Yue et.al. | 2507.14790 | null |
| 2025-07-19 | GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset | Zhiwei Zhang et.al. | 2507.14697 | null |
| 2025-07-19 | Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall | Shayan Rokhva et.al. | 2507.14662 | null |
| 2025-07-19 | Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection | Jifeng Shen et.al. | 2507.14643 | null |
| 2025-07-19 | DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF | Doriand Petit et.al. | 2507.14596 | null |
| 2025-07-18 | Semantic Segmentation based Scene Understanding in Autonomous Vehicles | Ehsan Rassekh et.al. | 2507.14303 | null |
| 2025-07-18 | Leveraging Pathology Foundation Models for Panoptic Segmentation of Melanoma in H&E Images | Jiaqi Lv et.al. | 2507.13974 | null |
| 2025-07-17 | SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation | Shiqi Huang et.al. | 2507.12857 | null |
| 2025-07-17 | A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique | Homare Sueyoshi et.al. | 2507.12730 | null |
| 2025-07-16 | VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians | Siyuan Yao et.al. | 2507.12667 | null |
| 2025-07-16 | NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting | Kuangshi Ai et.al. | 2507.12621 | null |
| 2025-07-16 | Out-of-distribution data supervision towards biomedical semantic segmentation | Yiquan Gao et.al. | 2507.12105 | null |
| 2025-07-16 | Tree-SLAM: semantic object SLAM for efficient mapping of individual trees in orchards | David Rapado-Rincon et.al. | 2507.12093 | null |
| 2025-07-16 | Frequency-Dynamic Attention Modulation for Dense Prediction | Linwei Chen et.al. | 2507.12006 | null |
| 2025-07-16 | SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation | Jun Yin et.al. | 2507.11994 | null |
| 2025-07-16 | Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation | Yuhang Zhang et.al. | 2507.11955 | null |
| 2025-07-16 | Spatial Frequency Modulation for Semantic Segmentation | Linwei Chen et.al. | 2507.11893 | link |
| 2025-07-15 | SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics | Suyuan Zhao et.al. | 2507.11588 | null |
| 2025-07-15 | Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping | Yujie Zhang et.al. | 2507.11279 | null |
| 2025-07-15 | Personalized OVSS: Understanding Personal Concept in Open-Vocabulary Semantic Segmentation | Sunghyun Park et.al. | 2507.11030 | null |
| 2025-07-15 | Graph Aggregation Prototype Learning for Semantic Change Detection in Remote Sensing | Zhengyi Xu et.al. | 2507.10938 | null |
| 2025-07-14 | Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision | Justin M. Kasowski et.al. | 2507.10813 | null |
| 2025-07-14 | rt-RISeg: Real-Time Model-Free Robot Interactive Segmentation for Active Instance-Level Object Understanding | Howard H. Qian et.al. | 2507.10776 | null |
| 2025-07-14 | FGSSNet: Feature-Guided Semantic Segmentation of Real World Floorplans | Hugo Norrby et.al. | 2507.10343 | null |
| 2025-07-14 | Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks | Ben Hamscher et.al. | 2507.10239 | null |
| 2025-07-14 | Spatial Lifting for Dense Prediction | Mingzhi Xu et.al. | 2507.10222 | null |
| 2025-07-14 | DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation | Ivan Martinović et.al. | 2507.10118 | null |
| 2025-07-13 | MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression | Ofir Gordon et.al. | 2507.09616 | null |
| 2025-07-13 | Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive | You Huang et.al. | 2507.09612 | null |
| 2025-07-13 | SegVec3D: A Method for Vector Embedding of 3D Objects Oriented Towards Robot manipulation | Zhihan Kang et.al. | 2507.09459 | null |
| 2025-07-11 | Multimodal HD Mapping for Intersections by Intelligent Roadside Units | Zhongzhang Chen et.al. | 2507.08903 | null |
| 2025-07-11 | Image Translation with Kernel Prediction Networks for Semantic Segmentation | Cristina Mata et.al. | 2507.08554 | null |
| 2025-07-11 | From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning | Sen Wang et.al. | 2507.08380 | null |
| 2025-07-11 | SurfDist: Interpretable Three-Dimensional Instance Segmentation Using Curved Surface Patches | Jackson Borchardt et.al. | 2507.08223 | null |
| 2025-07-10 | RAPS-3D: Efficient interactive segmentation for 3D radiological imaging | Théo Danielou et.al. | 2507.07730 | null |
| 2025-07-10 | LOSC: LiDAR Open-voc Segmentation Consolidator | Nermin Samet et.al. | 2507.07605 | null |
| 2025-07-10 | Diffusion-Guided Knowledge Distillation for Weakly-Supervised Low-Light Semantic Segmentation | Chunyan Wang et.al. | 2507.07578 | null |
| 2025-07-10 | Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections | Yongtang Bao et.al. | 2507.07395 | null |
| 2025-07-08 | CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings | Cristina Mata et.al. | 2507.07125 | null |
| 2025-07-09 | A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level | Johanna Orsholm et.al. | 2507.06972 | null |
| 2025-07-09 | SemRaFiner: Panoptic Segmentation in Sparse and Noisy Radar Point Clouds | Matthias Zeller et.al. | 2507.06906 | null |
| 2025-07-09 | Know Your Attention Maps: Class-specific Token Masking for Weakly Supervised Semantic Segmentation | Joelle Hanna et.al. | 2507.06848 | null |
| 2025-07-09 | Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning | Yang Chen et.al. | 2507.06592 | null |
| 2025-07-08 | Centralized Copy-Paste: Enhanced Data Augmentation Strategy for Wildland Fire Semantic Segmentation | Joon Tai Kim et.al. | 2507.06321 | null |
| 2025-07-08 | FineGrasp: Towards Robust Grasping for Delicate Objects | Yun Du et.al. | 2507.05978 | null |
| 2025-07-08 | Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation | Quanzhu Niu et.al. | 2507.05948 | link |
| 2025-07-08 | I $^2$ R: Inter and Intra-image Refinement in Few Shot Segmentation | Ourui Fu et.al. | 2507.05838 | null |
| 2025-07-09 | Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework | Wang Wang et.al. | 2507.05814 | null |
| 2025-07-08 | SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning | Xin Hu et.al. | 2507.05798 | null |
| 2025-07-08 | DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation | Young Hun Kim et.al. | 2507.05627 | null |
| 2025-07-07 | OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts | Shiting Xiao et.al. | 2507.05427 | null |
| 2025-07-07 | Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations | Xiang Xu et.al. | 2507.05260 | null |
| 2025-07-07 | All in One: Visual-Description-Guided Unified Point Cloud Segmentation | Zongyan Han et.al. | 2507.05211 | null |
| 2025-07-07 | RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis | Songxiao Yang et.al. | 2507.05193 | null |
| 2025-07-07 | MOSU: Autonomous Long-range Robot Navigation with Multi-modal Scene Understanding | Jing Liang et.al. | 2507.04686 | null |
| 2025-07-06 | Street design and driving behavior: evidence from a large-scale study in Milan, Amsterdam, and Dubai | Giacomo Orsi et.al. | 2507.04434 | null |
| 2025-07-06 | CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning | Fatmaelzahraa Ali Ahmed et.al. | 2507.04317 | null |
| 2025-07-06 | Surg-SegFormer: A Dual Transformer-Based Model for Holistic Surgical Scene Segmentation | Fatimaelzahraa Ahmed et.al. | 2507.04304 | null |
| 2025-07-05 | Differentiable High-Performance Ray Tracing-Based Simulation of Radio Propagation with Point Clouds | Niklas Vaara et.al. | 2507.04021 | null |
| 2025-07-05 | NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models | Siyu Li et.al. | 2507.04002 | null |
| 2025-07-05 | CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes via Chain-of-Thought Reasoning | Jeonghyo Song et.al. | 2507.03984 | null |
| 2025-07-03 | LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion | Fangfu Liu et.al. | 2507.02813 | link |
| 2025-07-03 | No time to train! Training-Free Reference-Based Instance Segmentation | Miguel Espinosa et.al. | 2507.02798 | link |
| 2025-07-03 | From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images | Danrong Zhang et.al. | 2507.02781 | null |
| 2025-07-03 | MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention | Zunhui Xia et.al. | 2507.02488 | null |
| 2025-07-03 | Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis | Byung Hyun Lee et.al. | 2507.02395 | null |
| 2025-07-03 | Perception Activator: An intuitive and portable framework for brain cognitive exploration | Le Xu et.al. | 2507.02311 | null |
| 2025-07-02 | How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks | Rahul Ramachandran et.al. | 2507.01955 | link |
| 2025-07-02 | 3D Reconstruction and Information Fusion between Dormant and Canopy Seasons in Commercial Orchards Using Deep Learning and Fast GICP | Ranjan Sapkota et.al. | 2507.01912 | null |
| 2025-07-02 | A Gift from the Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation | Hao Wang et.al. | 2507.01573 | null |
| 2025-07-02 | NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation | Max Gandyra et.al. | 2507.01463 | null |
| 2025-07-01 | Towards Open-World Human Action Segmentation Using Graph Convolutional Networks | Hao Xing et.al. | 2507.00756 | null |
| 2025-07-01 | Rectifying Magnitude Neglect in Linear Attention | Qihang Fan et.al. | 2507.00698 | link |
| 2025-07-02 | ExPaMoE: An Expandable Parallel Mixture of Experts for Continual Test-Time Adaptation | JianChao Zhao et.al. | 2507.00502 | null |
| 2025-07-01 | Process-aware and high-fidelity microstructure generation using stable diffusion | Hoang Cuong Phan et.al. | 2507.00459 | null |
| 2025-07-01 | PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching | Xin Yang et.al. | 2507.00371 | null |
| 2025-06-30 | SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures | Fengyi Jiang et.al. | 2507.00209 | null |
| 2025-06-30 | Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors | Ce Wang et.al. | 2506.23801 | null |
| 2025-06-30 | Deep Learning-Based Semantic Segmentation for Real-Time Kidney Imaging and Measurements with Augmented Reality-Assisted Ultrasound | Gijs Luijten et.al. | 2506.23721 | null |
| 2025-06-30 | PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum | Shiqi Zhang et.al. | 2506.23607 | null |
| 2025-06-30 | Interactive Interface For Semantic Segmentation Dataset Synthesis | Ngoc-Do Tran et.al. | 2506.23470 | null |
| 2025-06-30 | Contrastive Learning with Diffusion Features for Weakly Supervised Medical Image Segmentation | Dewen Zeng et.al. | 2506.23460 | null |
| 2025-06-29 | Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement | Siyuan Chai et.al. | 2506.23353 | null |
| 2025-06-29 | FastSeg: Efficient Training-Free Open-Vocabulary Segmentation via Hierarchical Attention Refinement Method | Quang-Huy Che et.al. | 2506.23323 | null |
| 2025-06-29 | BPD-Neo: An MRI Dataset for Lung-Trachea Segmentation with Clinical Data for Neonatal Bronchopulmonary Dysplasia | Rachit Saluja et.al. | 2506.23305 | null |
| 2025-06-29 | High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation | Lunhao Duan et.al. | 2506.23227 | null |
| 2025-06-29 | DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation | Jihun Kim et.al. | 2506.23104 | null |
| 2025-06-27 | Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation | Jialei Chen et.al. | 2506.22032 | null |
| 2025-06-27 | TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models | Meng Yu et.al. | 2506.21975 | null |
| 2025-06-27 | SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images | Naftaly Wambugu et.al. | 2506.21945 | null |
| 2025-06-26 | Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection | Tobias J. Riedlinger et.al. | 2506.21486 | null |
| 2025-06-26 | PanSt3R: Multi-view Consistent Panoptic Segmentation | Lojze Zust et.al. | 2506.21348 | null |
| 2025-06-26 | HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation | Diego Biagini et.al. | 2506.21287 | null |
| 2025-06-27 | ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation | Xiwei Xuan et.al. | 2506.21233 | null |
| 2025-06-26 | Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 | Jongyeon Park et.al. | 2506.21174 | null |
| 2025-06-27 | DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation | Wenzhou Lyu et.al. | 2506.21034 | null |
| 2025-06-26 | TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation | Chade Li et.al. | 2506.20991 | null |
| 2025-06-26 | Segment Anything in Pathology Images with Natural Language | Zhixuan Chen et.al. | 2506.20988 | null |
| 2025-06-25 | A Deep Learning Approach to Identify Rock Bolts in Complex 3D Point Clouds of Underground Mines Captured Using Mobile Laser Scanners | Dibyayan Patra et.al. | 2506.20464 | null |
| 2025-06-26 | Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition | Man Duc Chuc et.al. | 2506.20174 | null |
| 2025-06-24 | A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Shulan Ruan et.al. | 2506.19769 | null |
| 2025-06-24 | USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation | Lin Hong et.al. | 2506.19472 | null |
| 2025-06-24 | A Global-Local Cross-Attention Network for Ultra-high Resolution Remote Sensing Image Semantic Segmentation | Chen Yi et.al. | 2506.19406 | null |
| 2025-06-25 | AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation | Ziyan Zhao et.al. | 2506.19269 | null |
| 2025-06-23 | Orthogonal Projection Subspace to Aggregate Online Prior-knowledge for Continual Test-time Adaptation | Jinlong Li et.al. | 2506.19022 | null |
| 2025-06-23 | Multi-Scale Spectral Attention Module-based Hyperspectral Segmentation in Autonomous Driving Scenarios | Imad Ali Shah et.al. | 2506.18682 | null |
| 2025-06-23 | SafeClick: Error-Tolerant Interactive Segmentation of Any Medical Volumes via Hierarchical Expert Consensus | Yifan Gao et.al. | 2506.18404 | null |
| 2025-06-23 | Jet Reconstruction with Mamba Networks in Collider Events | Jinmian Li et.al. | 2506.18336 | null |
| 2025-06-22 | OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model | Shuaiyu Chen et.al. | 2506.18006 | null |
| 2025-06-22 | Relation3D: Enhancing Relation Modeling for Point Cloud Instance Segmentation | Jiahao Lu et.al. | 2506.17891 | null |
| 2025-06-22 | Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation | Xiaodong Guo et.al. | 2506.17869 | null |
| 2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159 | link |
| 2025-06-20 | ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds | Binbin Xiang et.al. | 2506.16991 | link |
| 2025-06-20 | LunarLoc: Segment-Based Global Localization on the Moon | Annika Thomas et.al. | 2506.16940 | link |
| 2025-06-19 | From Semantic To Instance: A Semi-Self-Supervised Learning Approach | Keyhan Najafian et.al. | 2506.16563 | null |
| 2025-06-19 | Structured Semantic 3D Reconstruction (S23DR) Challenge 2025 – Winning solution | Jan Skvrna et.al. | 2506.16421 | null |
| 2025-06-19 | LBMamba: Locally Bi-directional Mamba | Jingwei Zhang et.al. | 2506.15976 | link |
| 2025-06-19 | Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging | Jiawen Yang et.al. | 2506.15971 | null |
| 2025-06-19 | Polyline Path Masked Attention for Vision Transformer | Zhongchen Zhao et.al. | 2506.15940 | link |
| 2025-06-18 | MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning | Leonid Ivanov et.al. | 2506.15313 | link |
| 2025-06-18 | Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation | Jiaqi Shi et.al. | 2506.15160 | link |
| 2025-06-17 | Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset | Nikolaos Dionelis et.al. | 2506.14765 | null |
| 2025-06-17 | FocalClick-XL: Towards Unified and High-quality Interactive Segmentation | Xi Chen et.al. | 2506.14686 | null |
| 2025-06-17 | VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy | Zhuoyue Tan et.al. | 2506.14525 | null |
| 2025-06-17 | DepthSeg: Depth prompting in remote sensing semantic segmentation | Ning Zhou et.al. | 2506.14382 | null |
| 2025-06-17 | Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment | Weiming Zhang et.al. | 2506.14271 | null |
| 2025-06-16 | HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment | Numair Nadeem et.al. | 2506.13925 | null |
| 2025-06-16 | A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects | Guohuan Xie et.al. | 2506.13552 | null |
| 2025-06-16 | Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning | Rohit Mohan et.al. | 2506.13265 | null |
| 2025-06-16 | ViewPCL: a point cloud based active learning method for multi-view segmentation | Christian Hilaire et.al. | 2506.13043 | null |
| 2025-06-15 | A large-scale, physically-based synthetic dataset for satellite pose estimation | Szabolcs Velkei et.al. | 2506.12782 | null |
| 2025-06-15 | Unleashing Diffusion and State Space Models for Medical Image Segmentation | Rong Wu et.al. | 2506.12747 | null |
| 2025-06-15 | Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups | Zhenghao Xi et.al. | 2506.12712 | null |
| 2025-06-13 | O2Former:Direction-Aware and Multi-Scale Query Enhancement for SAR Ship Instance Segmentation | F. Gao et.al. | 2506.11913 | null |
| 2025-06-13 | Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling | Yunhan Ren et.al. | 2506.11661 | null |
| 2025-06-13 | A $^2$ LC: Active and Automated Label Correction for Semantic Segmentation | Youjin Jeon et.al. | 2506.11599 | null |
| 2025-06-13 | OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots | Juno Kim et.al. | 2506.11585 | null |
| 2025-06-12 | GynSurg: A Comprehensive Gynecology Laparoscopic Surgery Dataset | Sahar Nasirihaghighi et.al. | 2506.11356 | null |
| 2025-06-12 | Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes | Masahiro Yasuda et.al. | 2506.10676 | link |
| 2025-06-12 | Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models | Francisco Caetano et.al. | 2506.10634 | link |
| 2025-06-12 | Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration | Jun Wang et.al. | 2506.10573 | null |
| 2025-06-12 | ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation | Teerapong Panboonyuen et.al. | 2506.10524 | null |
| 2025-06-12 | Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation | Shuyang Li et.al. | 2506.10503 | null |
| 2025-06-12 | Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success | Che Wang et.al. | 2506.10359 | null |
| 2025-06-11 | Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements | Mustafa Atahan Nuhoglu et.al. | 2506.10107 | null |
| 2025-06-11 | Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation | Siyu Chen et.al. | 2506.09881 | link |
| 2025-06-11 | Accurate and efficient zero-shot 6D pose estimation with frozen foundation models | Andrea Caraffa et.al. | 2506.09784 | null |
| 2025-06-11 | The Four Color Theorem for Cell Instance Segmentation | Ye Zhang et.al. | 2506.09724 | link |
| 2025-06-11 | Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments | Fatemeh Mohammadi Amin et.al. | 2506.09552 | null |
| 2025-06-12 | Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20 $^{th}$ century Urban Landscapes with Satellite Imageries | Tianxiang Hao et.al. | 2506.09476 | null |
| 2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | null |
| 2025-06-10 | WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos | Negin Ghamsarian et.al. | 2506.08896 | null |
| 2025-06-11 | RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation | Jiayi Song et.al. | 2506.08772 | null |
| 2025-06-10 | ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction | Juan Yeo et.al. | 2506.08678 | null |
| 2025-06-10 | ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network | Feixiang Du et.al. | 2506.08629 | null |
| 2025-06-09 | LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds | Zihui Zhang et.al. | 2506.07857 | null |
| 2025-06-09 | SAM2Auto: Auto Annotation Using FLASH | Arash Rocky et.al. | 2506.07850 | null |
| 2025-06-09 | F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation | Hengzhi Chen et.al. | 2506.07847 | null |
| 2025-06-09 | Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity | Mohamed Djilani et.al. | 2506.07773 | null |
| 2025-06-09 | OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting | Jens Piekenbrinck et.al. | 2506.07697 | null |
| 2025-06-09 | Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation | Jintao Tong et.al. | 2506.07376 | null |
| 2025-06-09 | Multiple Object Stitching for Unsupervised Representation Learning | Chengchao Shen et.al. | 2506.07364 | link |
| 2025-06-08 | BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite | Liyang Chen et.al. | 2506.07116 | null |
| 2025-06-08 | Technical Report for ICRA 2025 GOOSE 3D Semantic Segmentation Challenge: Adaptive Point Cloud Understanding for Heterogeneous Robotic Systems | Xiaoya Zhang et.al. | 2506.06995 | null |
| 2025-06-07 | Position Prediction Self-Supervised Learning for Multimodal Satellite Imagery Semantic Segmentation | John Waithaka et.al. | 2506.06852 | null |
| 2025-06-06 | Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness | Steven Landgraf et.al. | 2506.05917 | null |
| 2025-06-06 | You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping | Jingshun Huang et.al. | 2506.05719 | null |
| 2025-06-05 | FRAME: Pre-Training Video Feature Representations via Anticipation and Memory | Sethuraman TV et.al. | 2506.05543 | null |
| 2025-06-05 | U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation | Marwane Kzadri et.al. | 2506.05444 | null |
| 2025-06-05 | Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting | Alfred T. Christiansen et.al. | 2506.05009 | null |
| 2025-06-05 | Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery | Mélisande Teng et.al. | 2506.04970 | null |
| 2025-06-05 | CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx | Lukas Picek et.al. | 2506.04931 | null |
| 2025-06-05 | OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model | Kunshen Zhang et.al. | 2506.04837 | null |
| 2025-06-05 | Gen-n-Val: Agentic Image Data Generation and Validation | Jing-En Huang et.al. | 2506.04676 | null |
| 2025-06-04 | You Only Train Once | Christos Sakaridis et.al. | 2506.04349 | null |
| 2025-06-04 | AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives | Aniruddh Sikdar et.al. | 2506.03709 | null |
| 2025-06-04 | OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation | Aditya Gandhamal et.al. | 2506.03706 | null |
| 2025-06-04 | BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation | Jialei Chen et.al. | 2506.03675 | null |
| 2025-06-03 | Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery | Pengyu Chen et.al. | 2506.03388 | null |
| 2025-06-03 | Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding | Weiqing Xiao et.al. | 2506.03134 | null |
| 2025-06-03 | GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Shufan Qing et.al. | 2506.02736 | link |
| 2025-06-03 | Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather | Longyu Yang et.al. | 2506.02396 | null |
| 2025-06-04 | SAB3R: Semantic-Augmented Backbone in 3D Reconstruction | Xuweiyi Chen et.al. | 2506.02112 | null |
| 2025-06-02 | SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation | Rafael Flor-Rodríguez et.al. | 2506.01418 | null |
| 2025-06-01 | Perceptual Inductive Bias Is What You Need Before Contrastive Learning | Tianqin Li et.al. | 2506.01201 | null |
| 2025-06-01 | GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning | Sahiti Yerramilli et.al. | 2506.00785 | null |
| 2025-05-31 | BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation | Wei Tao et.al. | 2506.00475 | null |
| 2025-05-30 | Bi-Manual Joint Camera Calibration and Scene Representation | Haozhan Tang et.al. | 2505.24819 | null |
| 2025-06-02 | NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation | Xuzhi Wang et.al. | 2505.24634 | null |
| 2025-05-30 | SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds | Cheng Zeng et.al. | 2505.24475 | null |
| 2025-05-30 | Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation | Roger Ferrod et.al. | 2505.24361 | null |
| 2025-05-30 | Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors | Peiran Xu et.al. | 2505.24103 | null |
| 2025-05-29 | MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking | Numair Nadeem et.al. | 2505.24026 | null |
| 2025-05-29 | Semantics-Guided Generative Image Compression | Cheng-Lin Wu et.al. | 2505.24015 | null |
| 2025-05-29 | Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts | Xuweiyi Chen et.al. | 2505.23926 | null |
| 2025-05-29 | TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models | Yao Xiao et.al. | 2505.23769 | link |
| 2025-05-29 | Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation | Georgios Voulgaris et.al. | 2505.23597 | null |
| 2025-05-29 | VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration | Ben Li et.al. | 2505.23439 | link |
| 2025-05-29 | Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation | Lingyan Ran et.al. | 2505.23438 | null |
| 2025-05-29 | Federated Unsupervised Semantic Segmentation | Evangelos Charalampakis et.al. | 2505.23292 | null |
| 2025-05-29 | LeMoRe: Learn More Details for Lightweight Semantic Segmentation | Mian Muhammad Naeem Abid et.al. | 2505.23093 | link |
| 2025-05-28 | ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions | Maxence Wynen et.al. | 2505.22537 | null |
| 2025-05-28 | Universal Domain Adaptation for Semantic Segmentation | Seun-An Choe et.al. | 2505.22458 | null |
| 2025-05-28 | LiDAR Based Semantic Perception for Forklifts in Outdoor Environments | Benjamin Serfling et.al. | 2505.22258 | null |
| 2025-05-29 | YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction | Mingzhuang Wang et.al. | 2505.22250 | null |
| 2025-05-28 | Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation | Zhisong Wang et.al. | 2505.22230 | null |
| 2025-05-28 | A Survey on Training-free Open-Vocabulary Semantic Segmentation | Naomi Kombol et.al. | 2505.22209 | null |
| 2025-05-28 | S2AFormer: Strip Self-Attention for Efficient Vision Transformer | Guoan Xu et.al. | 2505.22195 | null |
| 2025-05-28 | LiDARDustX: A LiDAR Dataset for Dusty Unstructured Road Environments | Chenfeng Wei et.al. | 2505.21914 | null |
| 2025-05-29 | CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation | Pardis Taghavi et.al. | 2505.21904 | null |
| 2025-05-28 | Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation | Mehrdad Noori et.al. | 2505.21844 | null |
| 2025-05-27 | Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO | Muzhi Zhu et.al. | 2505.21457 | null |
| 2025-05-27 | Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning | Nikos Giannakakis et.al. | 2505.20962 | null |
| 2025-05-27 | DSOcc: Leveraging Depth Awareness and Semantic Aid to Boost Camera-Based 3D Semantic Occupancy Prediction | Naiyu Fang et.al. | 2505.20951 | null |
| 2025-05-26 | Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments | Julio de la Torre-Vanegas et.al. | 2505.20423 | null |
| 2025-05-26 | A fully automated urban PV parameterization framework for improved estimation of energy production profiles | Bowen Tian et.al. | 2505.19876 | null |
| 2025-05-26 | Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation | Nagito Saito et.al. | 2505.19846 | null |
| 2025-05-26 | The Missing Point in Vision Transformers for Universal Image Segmentation | Sajjad Shahabodini et.al. | 2505.19795 | null |
| 2025-05-26 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting | Wenhua Wu et.al. | 2505.19420 | null |
| 2025-05-25 | A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic Segmentation | Yuze Wang et.al. | 2505.19159 | link |
| 2025-05-25 | SPARS: Self-Play Adversarial Reinforcement Learning for Segmentation of Liver Tumours | Catalina Tan et.al. | 2505.18989 | link |
| 2025-05-25 | How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation | Yining Pan et.al. | 2505.18956 | null |
| 2025-05-25 | LLM-Guided Taxonomy and Hierarchical Uncertainty for 3D Point CLoud Active Learning | Chenxi Li et.al. | 2505.18924 | null |
| 2025-05-24 | ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts | Shiu-hong Kao et.al. | 2505.18561 | null |
| 2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153 | null |
| 2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015 | null |
| 2025-05-23 | Semantic segmentation with reward | Xie Ting et.al. | 2505.17905 | null |
| 2025-05-23 | Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring | Nikolas Papadopoulos et.al. | 2505.17782 | null |
| 2025-05-23 | EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy | Yichun Yu et.al. | 2505.17665 | null |
| 2025-05-22 | Deep mineralogical segmentation of thin section images based on QEMSCAN maps | Jean Pablo Vieira de Mello et.al. | 2505.17008 | link |
| 2025-05-22 | OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning | Zongyan Han et.al. | 2505.16974 | link |
| 2025-05-22 | NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification | NovelSeek Team et.al. | 2505.16938 | link |
| 2025-05-22 | TextureSAM: Towards a Texture Aware Foundation Model for Segmentation | Inbal Cohen et.al. | 2505.16540 | null |
| 2025-05-22 | Detailed Evaluation of Modern Machine Learning Approaches for Optic Plastics Sorting | Vaishali Maheshkar et.al. | 2505.16513 | null |
| 2025-05-22 | Sketchy Bounding-box Supervision for 3D Instance Segmentation | Qian Deng et.al. | 2505.16399 | null |
| 2025-05-22 | Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation | Estelle Chigot et.al. | 2505.16360 | link |
| 2025-05-22 | RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition | Yechan Park et.al. | 2505.16165 | link |
| 2025-05-21 | VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation | Niccolo Avogaro et.al. | 2505.15592 | null |
| 2025-05-21 | UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset | Hua Li et.al. | 2505.15581 | link |
| 2025-05-21 | seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation | Andrew Caunes et.al. | 2505.15545 | link |
| 2025-05-21 | Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation | Ce Zhang et.al. | 2505.15491 | null |
| 2025-05-21 | gen2seg: Generative Models Enable Generalizable Instance Segmentation | Om Khangaonkar et.al. | 2505.15263 | null |
| 2025-05-21 | Zero-Shot Gaze-based Volumetric Medical Image Segmentation | Tatyana Shmykova et.al. | 2505.15256 | null |
| 2025-05-21 | From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation | Quanwei Liu et.al. | 2505.15147 | null |
| 2025-05-20 | Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning | Amine Elhafsi et.al. | 2505.14938 | null |
| 2025-05-20 | Instance Segmentation for Point Sets | Abhimanyu Talwar et.al. | 2505.14583 | null |
| 2025-05-20 | ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains | Guillaume Vray et.al. | 2505.14511 | link |
| 2025-05-20 | Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation | Bin-Bin Gao et.al. | 2505.14239 | link |
| 2025-05-20 | Intra-class Patch Swap for Self-Distillation | Hongjun Choi et.al. | 2505.14124 | link |
| 2025-05-20 | Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts | Xi Chen et.al. | 2505.14088 | null |
| 2025-05-20 | Scaling Vision Mamba Across Resolutions via Fractal Traversal | Bo Li et.al. | 2505.14062 | null |
| 2025-05-20 | EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation | Zelin Zhang et.al. | 2505.14014 | null |
| 2025-05-19 | Self-Supervised Learning for Image Segmentation: A Comprehensive Survey | Thangarajah Akilan et.al. | 2505.13584 | null |
| 2025-05-19 | FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching | Alp Eren Sari et.al. | 2505.13174 | null |
| 2025-05-20 | Industrial Synthetic Segment Pre-training | Shinichi Mae et.al. | 2505.13099 | null |
| 2025-05-19 | Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation | Jiaqi Tan et.al. | 2505.12861 | link |
| 2025-05-19 | Enhancing Transformers Through Conditioned Embedded Tokens | Hemanth Saratchandran et.al. | 2505.12789 | null |
| 2025-05-18 | Temporal-Spectral-Spatial Unified Remote Sensing Dense Prediction | Sijie Zhao et.al. | 2505.12280 | link |
| 2025-05-17 | SoftPQ: Robust Instance Segmentation Evaluation via Soft Matching and Tunable Thresholds | Ranit Karmakar et.al. | 2505.12155 | link |
| 2025-05-17 | EarthSynth: Generating Informative Earth Observation with Diffusion Models | Jiancheng Pan et.al. | 2505.12108 | null |
| 2025-05-17 | iSegMan: Interactive Segment-and-Manipulate 3D Gaussians | Yian Zhao et.al. | 2505.11934 | null |
| 2025-05-17 | Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Boosting Off-Road Segmentation via Photometric Distortion and Exponential Moving Average | Wonjune Kim et.al. | 2505.11769 | null |
| 2025-05-16 | DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation | Ziyu Zhao et.al. | 2505.11676 | null |
| 2025-05-16 | SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision | Utsav Rai et.al. | 2505.11439 | null |
| 2025-05-16 | Pseudo-Label Quality Decoupling and Correction for Semi-Supervised Instance Segmentation | Jianghang Lin et.al. | 2505.11075 | null |
| 2025-05-16 | Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation | David Minkwan Kim et.al. | 2505.10781 | null |
| 2025-05-15 | Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis | Francisco Raverta Capua et.al. | 2505.10751 | null |
| 2025-05-15 | TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation | Manthan Patel et.al. | 2505.10696 | null |
| 2025-05-15 | SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity | Shihao Zou et.al. | 2505.10352 | null |
| 2025-05-15 | APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds | Yuan Gao et.al. | 2505.09971 | link |
| 2025-05-14 | FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization | Xiaoyang Yu et.al. | 2505.09385 | null |
| 2025-05-14 | MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning | Bin-Bin Gao et.al. | 2505.09265 | link |
| 2025-05-13 | MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment | Barak Pinkovich et.al. | 2505.08589 | null |
| 2025-05-14 | The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning | Mohamed Lamine Mekhalfi et.al. | 2505.08537 | null |
| 2025-05-13 | Dynamic Snake Upsampling Operater and Boundary-Skeleton Weighted Loss for Tubular Structure Segmentation | Yiqi Chen et.al. | 2505.08525 | null |
| 2025-05-13 | Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency | Adel Ammar et.al. | 2505.08445 | null |
| 2025-05-13 | GNCAF: A GNN-based Neighboring Context Aggregation Framework for Tertiary Lymphoid Structures Semantic Segmentation in WSI | Lei Su et.al. | 2505.08430 | null |
| 2025-05-12 | Vision Foundation Model Embedding-Based Semantic Anomaly Detection | Max Peter Ronecker et.al. | 2505.07998 | null |
| 2025-05-12 | Privacy Risks of Robot Vision: A User Study on Image Modalities and Resolution | Xuying Huang et.al. | 2505.07766 | null |
| 2025-05-12 | Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation | Negin Ghamsarian et.al. | 2505.07691 | null |
| 2025-05-12 | MAIS: Memory-Attention for Interactive Segmentation | Mauricio Orbes-Arteaga et.al. | 2505.07511 | null |
| 2025-05-13 | TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset | Olaf Wysocki et.al. | 2505.07396 | null |
| 2025-05-11 | Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution | Zihang Liu et.al. | 2505.07071 | link |
| 2025-05-11 | Depth-Sensitive Soft Suppression with RGB-D Inter-Modal Stylization Flow for Domain Generalization Semantic Segmentation | Binbin Wei et.al. | 2505.07050 | null |
| 2025-05-11 | Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding | Chih-Chung Hsu et.al. | 2505.06991 | null |
| 2025-05-11 | Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation | Seokjun Kwon et.al. | 2505.06951 | null |
| 2025-05-10 | Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization | Xu Zheng et.al. | 2505.06635 | null |
| 2025-05-10 | RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation | Zhiwen Zeng et.al. | 2505.06515 | null |
| 2025-05-09 | Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet | Kodai Hirata et.al. | 2505.06185 | null |
| 2025-05-08 | CottonSim: Development of an autonomous visual-guided robotic cotton-picking system in the Gazebo | Thevathayarajh Thayananthan et.al. | 2505.05317 | null |
| 2025-05-08 | RepSNet: A Nucleus Instance Segmentation model based on Boundary Regression and Structural Re-parameterization | Shengchun Xiong et.al. | 2505.05073 | null |
| 2025-05-09 | UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model | Timo Kaiser et.al. | 2505.05049 | link |
| 2025-05-08 | Split Matching for Inductive Zero-shot Semantic Segmentation | Jialei Chen et.al. | 2505.05023 | null |
| 2025-05-08 | Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model | Navin Ranjan et.al. | 2505.04861 | null |
| 2025-05-07 | Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions? | Shashank Agnihotri et.al. | 2505.04835 | link |
| 2025-05-07 | Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer | Sainath Dey et.al. | 2505.04740 | null |
| 2025-05-07 | DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception | Junjie Wang et.al. | 2505.04410 | link |
| 2025-05-07 | MFSeg: Efficient Multi-frame 3D Semantic Segmentation | Chengjie Huang et.al. | 2505.04408 | null |
| 2025-05-06 | Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach | Srecharan Selvam et.al. | 2505.03702 | null |
| 2025-05-06 | CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting | Huawei Sun et.al. | 2505.03679 | null |
| 2025-05-06 | Panoramic Out-of-Distribution Segmentation | Mengfei Duan et.al. | 2505.03539 | link |
| 2025-05-06 | 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation | Andrew Caunes et.al. | 2505.03300 | null |
| 2025-05-05 | Platelet enumeration in dense aggregates | H. Martin Gillis et.al. | 2505.02751 | null |
| 2025-05-04 | Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation | Volodymyr Havrylov et.al. | 2505.02075 | link |
| 2025-05-04 | Segment Any RGB-Thermal Model with Language-aided Distillation | Dong Xing et.al. | 2505.01950 | null |
| 2025-05-03 | OODTE: A Differential Testing Engine for the ONNX Optimizer | Nikolaos Louloudakis et.al. | 2505.01892 | null |
| 2025-05-03 | A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory | Chenyang Fan et.al. | 2505.01656 | null |
| 2025-05-02 | A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative Learning | Anan Yaghmour et.al. | 2505.01558 | null |
| 2025-05-02 | Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation | Zhen Yao et.al. | 2505.01548 | link |
| 2025-05-02 | Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing | Fahong Zhang et.al. | 2505.01385 | null |
| 2025-05-02 | GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation | Boris Kriuk et.al. | 2505.01057 | null |
| 2025-04-30 | MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection | Qiushi Yang et.al. | 2505.00739 | null |
| 2025-05-03 | Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook | Muyi Bao et.al. | 2505.00630 | null |
| 2025-05-01 | Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation | Feng Xue et.al. | 2505.00378 | null |
| 2025-04-30 | Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space | Leonhard Sommer et.al. | 2504.21749 | null |
| 2025-04-30 | Real Time Semantic Segmentation of High Resolution Automotive LiDAR Scans | Hannes Reichert et.al. | 2504.21602 | null |
| 2025-04-30 | Make Both Ends Meet: A Synergistic Optimization Infrared Small Target Detection with Streamlined Computational Overhead | Yuxin Jing et.al. | 2504.21581 | null |
| 2025-04-30 | ClassWise-CRF: Category-Specific Fusion for Enhanced Semantic Segmentation of Remote Sensing Imagery | Qinfeng Zhu et.al. | 2504.21491 | null |
| 2025-04-29 | DeepVoid: A Deep Learning Void Detector | Sam Kumagai et.al. | 2504.21134 | null |
| 2025-04-29 | Learning a General Model: Folding Clothing with Topological Dynamics | Yiming Liu et.al. | 2504.20720 | null |
| 2025-04-29 | OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation | Long Liu et.al. | 2504.20682 | link |
| 2025-04-28 | DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes | Junlin Guo et.al. | 2504.20303 | null |
| 2025-04-28 | Learning Streaming Video Representation via Multitask Training | Yibin Yan et.al. | 2504.20041 | null |
| 2025-04-28 | SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation | Yulong Guo et.al. | 2504.19839 | null |
| 2025-04-28 | Open-set Anomaly Segmentation in Complex Scenarios | Song Xia et.al. | 2504.19706 | null |
| 2025-04-28 | SubGrapher: Visual Fingerprinting of Chemical Structures | Lucas Morin et.al. | 2504.19695 | null |
| 2025-04-28 | BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation | Pin-Chi Pan et.al. | 2504.19643 | null |
| 2025-04-28 | Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding | Yan Wang et.al. | 2504.19500 | null |
| 2025-04-28 | GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field | Zuxing Lu et.al. | 2504.19409 | null |
| 2025-04-27 | OpenFusion++: An Open-vocabulary Real-time Scene Understanding System | Xiaofeng Jin et.al. | 2504.19266 | null |
| 2025-04-27 | DeepSPG: Exploring Deep Semantic Prior Guidance for Low-light Image Enhancement with Multimodal Learning | Jialang Lu et.al. | 2504.19127 | null |
| 2025-04-26 | VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation | Niaz Ahmad et.al. | 2504.19032 | null |
| 2025-04-25 | A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes | Nicolas Münger et.al. | 2504.18213 | null |
| 2025-04-25 | Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition | Yin Tang et.al. | 2504.18201 | null |
| 2025-04-25 | What is the Added Value of UDA in the VFM Era? | Brunó B. Englert et.al. | 2504.18190 | null |
| 2025-04-25 | Back to Fundamentals: Low-Level Visual Features Guided Progressive Token Pruning | Yuanbing Ouyang et.al. | 2504.17996 | null |
| 2025-04-24 | Virtual Roads, Smarter Safety: A Digital Twin Framework for Mixed Autonomous Traffic Safety Analysis | Hao Zhang et.al. | 2504.17968 | null |
| 2025-04-24 | Masked strategies for images with small objects | H. Martin Gillis et.al. | 2504.17935 | null |
| 2025-04-24 | Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images | Zebo Huang et.al. | 2504.17582 | null |
| 2025-04-23 | Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection | Jens Petersen et.al. | 2504.17076 | null |
| 2025-04-23 | SemanticSugarBeets: A Multi-Task Framework and Dataset for Inspecting Harvest and Storage Characteristics of Sugar Beets | Gerardus Croonen et.al. | 2504.16684 | null |
| 2025-04-23 | Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections | Max Kirchner et.al. | 2504.16612 | null |
| 2025-04-23 | SAIP-Net: Enhancing Remote Sensing Image Segmentation via Spectral Adaptive Information Propagation | Zhongtao Wang et.al. | 2504.16564 | null |
| 2025-04-23 | Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks | Murat Bilgehan Ertan et.al. | 2504.16557 | null |
| 2025-04-22 | Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications | Leonardo Olivi et.al. | 2504.15991 | null |
| 2025-04-22 | DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining | Wei Zhuo et.al. | 2504.15669 | null |
| 2025-04-21 | Segmentation with Noisy Labels via Spatially Correlated Distributions | Ryu Tadokoro et.al. | 2504.14795 | link |
| 2025-04-20 | NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation | Junyuan Fang et.al. | 2504.14638 | null |
| 2025-04-19 | Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation | Johannes Spoecklberger et.al. | 2504.14231 | null |
| 2025-04-19 | Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection | Ghodsiyeh Rostami et.al. | 2504.14138 | null |
| 2025-04-19 | Lightweight Road Environment Segmentation using Vector Quantization | Jiyong Kwag et.al. | 2504.14113 | null |
| 2025-04-18 | Occlusion-Ordered Semantic Instance Segmentation | Soroosh Baselizadeh et.al. | 2504.14054 | null |
| 2025-04-18 | HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework | Shuobin Wei et.al. | 2504.13579 | null |
| 2025-04-18 | Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping | Wang Liu et.al. | 2504.13458 | link |
| 2025-04-18 | DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images | Racheal Mukisa et.al. | 2504.13415 | null |
| 2025-04-18 | Cardiac MRI Semantic Segmentation for Ventricles and Myocardium using Deep Learning | Racheal Mukisa et.al. | 2504.13391 | null |
| 2025-04-17 | SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling | Yasin Almalioglu et.al. | 2504.13310 | null |
| 2025-04-17 | Digital Twin Generation from Visual Data: A Survey | Andrew Melnik et.al. | 2504.13159 | null |
| 2025-04-17 | High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion | Libo Zhang et.al. | 2504.12844 | null |
| 2025-04-17 | Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation | Siyu Chen et.al. | 2504.12753 | link |
| 2025-04-17 | Parsimonious Dataset Construction for Laparoscopic Cholecystectomy Structure Segmentation | Yuning Zhou et.al. | 2504.12573 | null |
| 2025-04-17 | Privacy-Preserving Operating Room Workflow Analysis using Digital Twins | Alejandra Perez et.al. | 2504.12552 | null |
| 2025-04-16 | 3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap | Minmin Yang et.al. | 2504.12442 | null |
| 2025-04-16 | Remote sensing colour image semantic segmentation of trails created by large herbivorous Mammals | Jose Francisco Diez-Pastor et.al. | 2504.12121 | null |
| 2025-04-17 | DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Mengshi Qi et.al. | 2504.12080 | link |
| 2025-04-16 | Single-shot Star-convex Polygon-based Instance Segmentation for Spatially-correlated Biomedical Objects | Trina De et.al. | 2504.12078 | null |
| 2025-04-16 | CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting | Wei Sun et.al. | 2504.11893 | null |
| 2025-04-15 | CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image | Jingshun Huang et.al. | 2504.11230 | null |
| 2025-04-15 | Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation | Andrea Simonelli et.al. | 2504.11024 | null |
| 2025-04-15 | PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation | Bo-Cheng Hu et.al. | 2504.10986 | null |
| 2025-04-15 | LightFormer: A lightweight and efficient decoder for remote sensing image segmentation | Sihang Chen et.al. | 2504.10834 | null |
| 2025-04-15 | OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding | Dianbing Xi et.al. | 2504.10825 | null |
| 2025-04-15 | Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics’ Gramian on the Manifold Underlying the Patch Space | Kelum Gajamannage et.al. | 2504.10820 | null |
| 2025-04-14 | Real-time Seafloor Segmentation and Mapping | Michele Grimaldi et.al. | 2504.10750 | null |
| 2025-04-14 | FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation | Yasser Benigmim et.al. | 2504.10487 | null |
| 2025-04-14 | The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Weixian Lei et.al. | 2504.10462 | null |
| 2025-04-14 | M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data | Tzu-Yun Tseng et.al. | 2504.10123 | null |
| 2025-04-14 | DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation | Beomseok Kang et.al. | 2504.09814 | null |
| 2025-04-14 | IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme | Dinh Dai Quan Tran et.al. | 2504.09797 | null |
| 2025-04-14 | Advancing RFI-Detection in Radio Astronomy with Liquid State Machines | Nicholas J Pritchard et.al. | 2504.09796 | null |
| 2025-04-12 | Evolved Hierarchical Masking for Self-Supervised Learning | Zhanzhou Feng et.al. | 2504.09155 | null |
| 2025-04-11 | Data-Importance-Aware Power Allocation for Adaptive Real-Time Communication in Computer Vision Applications | Chunmei Xu et.al. | 2504.08922 | null |
| 2025-04-11 | Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing | Vinal Asodia et.al. | 2504.08704 | null |
| 2025-04-11 | Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Bram Vanherle et.al. | 2504.08473 | link |
| 2025-04-11 | SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis | Yi Chen et.al. | 2504.08361 | null |
| 2025-04-11 | DSM: Building A Diverse Semantic Map for 3D Visual Grounding | Qinghongbing Xie et.al. | 2504.08307 | null |
| 2025-04-10 | ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings | Astitva Srivastava et.al. | 2504.08022 | null |
| 2025-04-10 | P2Object: Single Point Supervised Object Detection and Instance Segmentation | Pengfei Chen et.al. | 2504.07813 | null |
| 2025-04-10 | Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation | Yanglin Huang et.al. | 2504.07691 | null |
| 2025-04-10 | SydneyScapes: Image Segmentation for Australian Environments | Hongyu Lyu et.al. | 2504.07542 | null |
| 2025-04-10 | RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability | Jonggwon Park et.al. | 2504.07416 | null |
| 2025-04-09 | RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration | Omar Alama et.al. | 2504.06994 | null |
| 2025-04-09 | Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Daiwei Zhang et.al. | 2504.06978 | null |
| 2025-04-09 | Domain Generalization through Attenuation of Domain-Specific Information | Reiji Saito et.al. | 2504.06781 | null |
| 2025-04-08 | SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation | Hritam Basak et.al. | 2504.06389 | null |
| 2025-04-09 | Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation | Xiaoxing Hu et.al. | 2504.06220 | null |
| 2025-04-08 | WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care | Vanessa Borst et.al. | 2504.06185 | null |
| 2025-04-08 | Towards Varroa destructor mite detection using a narrow spectra illumination | Samuel Bielik et.al. | 2504.06099 | null |
| 2025-04-08 | econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians | Can Zhang et.al. | 2504.06003 | null |
| 2025-04-08 | Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques | Luca Barco et.al. | 2504.05882 | null |
| 2025-04-08 | DefMamba: Deformable Visual State Space Model | Leiye Liu et.al. | 2504.05794 | null |
| 2025-04-08 | Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation | Enming Zhang et.al. | 2504.05774 | null |
| 2025-04-07 | S^4M: Boosting Semi-Supervised Instance Segmentation with SAM | Heeji Yoon et.al. | 2504.05301 | null |
| 2025-04-07 | BoxSeg: Quality-Aware and Peer-Assisted Learning for Box-supervised Instance Segmentation | Jinxiang Lai et.al. | 2504.05137 | null |
| 2025-04-07 | Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection | Jon Gutiérrez Zaballa et.al. | 2504.05119 | null |
| 2025-04-07 | Prior2Former – Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation | Sebastian Schmidt et.al. | 2504.04841 | null |
| 2025-04-07 | DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation | Bo-Wen Yin et.al. | 2504.04701 | link |
| 2025-04-06 | Statistical Guarantees Of False Discovery Rate In Medical Instance Segmentation Tasks Based on Conformal Risk Control | Mengxia Dai et.al. | 2504.04482 | null |
| 2025-04-06 | Evaluation framework for Image Segmentation Algorithms | Tatiana Merkulova et.al. | 2504.04435 | null |
| 2025-04-05 | CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation | Kai Fang et.al. | 2504.04156 | null |
| 2025-04-05 | DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning | Xiao-Hui Li et.al. | 2504.04085 | null |
| 2025-04-04 | Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation | Xin Zhang et.al. | 2504.03193 | null |
| 2025-04-03 | Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation | Feng Gao et.al. | 2504.02647 | null |
| 2025-04-03 | Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results | Andrei Dumitriu et.al. | 2504.02558 | null |
| 2025-04-03 | Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery | Mykola Lavreniuk et.al. | 2504.02534 | null |
| 2025-04-03 | Semantic segmentation of forest stands using deep learning | Håkon Næss Sandum et.al. | 2504.02471 | null |
| 2025-04-03 | Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation | Changshuo Wang et.al. | 2504.02454 | null |
| 2025-04-03 | Benchmark of Segmentation Techniques for Pelvic Fracture in CT and X-ray: Summary of the PENGWIN 2024 Challenge | Yudi Sang et.al. | 2504.02382 | null |
| 2025-04-03 | APSeg: Auto-Prompt Model with Acquired and Injected Knowledge for Nuclear Instance Segmentation and Classification | Liying Xu et.al. | 2504.02222 | null |
| 2025-04-02 | Scene-Centric Unsupervised Panoptic Segmentation | Oliver Hahn et.al. | 2504.01955 | link |
| 2025-04-02 | Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation | Junjie Chen et.al. | 2504.01668 | null |
| 2025-04-03 | Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks | Haosheng Li et.al. | 2504.01659 | null |
| 2025-04-02 | ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation | Haosheng Li et.al. | 2504.01648 | null |
| 2025-04-02 | Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions | Giulia Marchiori Pietrosanti et.al. | 2504.01632 | null |
| 2025-04-02 | Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology | Lirui Qi et.al. | 2504.01577 | null |
| 2025-04-02 | Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training | Luca Ciampi et.al. | 2504.01547 | null |
| 2025-04-02 | Beyond Nearest Neighbor Interpolation in Data Augmentation | Olivier Rukundo et.al. | 2504.01527 | null |
| 2025-04-02 | Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement | Zaipeng Duan et.al. | 2504.01449 | null |
| 2025-04-02 | v-CLR: View-Consistent Learning for Open-World Instance Segmentation | Chang-Bin Zhang et.al. | 2504.01383 | null |
| 2025-03-31 | Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes | Daichi Otsuka et.al. | 2503.24229 | null |
| 2025-03-31 | Spectral-Adaptive Modulation Networks for Visual Perception | Guhnoo Yun et.al. | 2503.23947 | null |
| 2025-03-31 | Bridge the Gap Between Visual and Linguistic Comprehension for Generalized Zero-shot Semantic Segmentation | Xiaoqing Guo et.al. | 2503.23806 | null |
| 2025-03-31 | Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks | Yu Zhou et.al. | 2503.23751 | null |
| 2025-03-31 | Semantic Packet Aggregation and Repeated Transmission for Text-to-Image Generation | Seunghun Lee et.al. | 2503.23734 | null |
| 2025-03-31 | CrossFormer: Cross-Segment Semantic Fusion for Document Segmentation | Tongke Ni et.al. | 2503.23671 | null |
| 2025-03-30 | BoundMatch: Boundary detection applied to semi-supervised segmentation for urban-driving scenes | Haruya Ishikawa et.al. | 2503.23519 | null |
| 2025-03-30 | Improving underwater semantic segmentation with underwater image quality attention and muti-scale aggregation attention | Xin Zuo et.al. | 2503.23422 | null |
| 2025-03-29 | Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments | Yifan Xu et.al. | 2503.23105 | null |
| 2025-03-28 | Enhancing DeepLabV3+ to Fuse Aerial and Satellite Images for Semantic Segmentation | Anas Berka et.al. | 2503.22909 | null |
| 2025-03-28 | KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation | Thomas Boucher et.al. | 2503.22592 | null |
| 2025-03-28 | A Dataset for Semantic Segmentation in the Presence of Unknowns | Zakaria Laskar et.al. | 2503.22309 | null |
| 2025-03-28 | Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation | Minho Park et.al. | 2503.22172 | null |
| 2025-03-28 | Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation | Hongmei Yin et.al. | 2503.22136 | null |
| 2025-03-28 | Semantic segmentation for building houses from wooden cubes | Ivan Beleacov et.al. | 2503.22125 | null |
| 2025-03-28 | Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes | Binh Thien Nguyen et.al. | 2503.22088 | null |
| 2025-03-28 | A Deep Learning Framework for Boundary-Aware Semantic Segmentation | Tai An et.al. | 2503.22050 | null |
| 2025-03-27 | Foveated Instance Segmentation | Hongyi Zeng et.al. | 2503.21854 | null |
| 2025-03-27 | Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation | Reza Qorbani et.al. | 2503.21780 | link |
| 2025-03-27 | A Unified Image-Dense Annotation Generation Model for Underwater Scenes | Hongkai Lin et.al. | 2503.21771 | link |
| 2025-03-27 | Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving | Lucas Nunes et.al. | 2503.21449 | link |
| 2025-03-26 | Exploring CLIP’s Dense Knowledge for Weakly Supervised Semantic Segmentation | Zhiwei Yang et.al. | 2503.20826 | null |
| 2025-03-26 | Exploiting Temporal State Space Sharing for Video Semantic Segmentation | Syed Ariff Syed Hesham et.al. | 2503.20824 | null |
| 2025-03-26 | Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery | Mélisande Teng et.al. | 2503.20199 | null |
| 2025-03-25 | Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception | Luke Chen et.al. | 2503.20011 | null |
| 2025-03-25 | The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs | Jonathan Sauder et.al. | 2503.20000 | null |
| 2025-03-25 | LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation | Vladan Stojnić et.al. | 2503.19777 | link |
| 2025-03-25 | OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations | Christina Kassab et.al. | 2503.19764 | null |
| 2025-03-25 | Show or Tell? Effectively prompting Vision-Language Models for semantic segmentation | Niccolo Avogaro et.al. | 2503.19647 | null |
| 2025-03-25 | Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model | Peishan Huang et.al. | 2503.19386 | null |
| 2025-03-25 | BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation | Hanshuo Qiu et.al. | 2503.19303 | null |
| 2025-03-25 | Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines | Junle Liu et.al. | 2503.19278 | null |
| 2025-03-25 | Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications | Ben Rahman et.al. | 2503.19276 | null |
| 2025-03-24 | DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation | Karim Abou Zeid et.al. | 2503.18944 | link |
| 2025-03-24 | Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation | DeShin Hwa et.al. | 2503.18862 | null |
| 2025-03-24 | EgoSurgery-HTS: A Dataset for Egocentric Hand-Tool Segmentation in Open Surgery Videos | Nathan Darjana et.al. | 2503.18755 | null |
| 2025-03-24 | HiRes-FusedMIM: A High-Resolution RGB-DSM Pre-trained Model for Building-Level Remote Sensing Applications | Guneet Mutreja et.al. | 2503.18540 | null |
| 2025-03-24 | Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness | Chenfei Liao et.al. | 2503.18445 | null |
| 2025-03-24 | PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes | Xinhua Xu et.al. | 2503.18393 | null |
| 2025-03-24 | MaSS13K: A Matting-level Semantic Segmentation Benchmark | Chenxi Xie et.al. | 2503.18364 | null |
| 2025-03-23 | PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding | Hongjia Zhai et.al. | 2503.18107 | null |
| 2025-03-23 | Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images | Yara AlaaEldin et.al. | 2503.17982 | null |
| 2025-03-23 | FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation | Dong Zhao et.al. | 2503.17940 | null |
| 2025-03-21 | Center-guided Classifier for Semantic Segmentation of Remote Sensing Images | Wei Zhang et.al. | 2503.16963 | null |
| 2025-03-21 | Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision | Maoji Zheng et.al. | 2503.16811 | null |
| 2025-03-20 | SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality | Chiara Schiavo et.al. | 2503.16747 | null |
| 2025-03-20 | Panoptic-CUDAL Technical Report: Rural Australia Point Cloud Dataset in Rainy Conditions | Tzu-Yun Tseng et.al. | 2503.16378 | null |
| 2025-03-20 | M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation | Markus Karmann et.al. | 2503.16254 | null |
| 2025-03-20 | Controllable Segmentation-Based Text-Guided Style Editing | Jingwen Li et.al. | 2503.16129 | null |
| 2025-03-20 | No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather | Junsung Park et.al. | 2503.15910 | null |
| 2025-03-19 | High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight | Cédric Vincent et.al. | 2503.15676 | link |
| 2025-03-19 | Transport-Related Surface Detection with Machine Learning: Analyzing Temporal Trends in Madrid and Vienna | Miguel Ureña Pliego et.al. | 2503.15653 | link |
| 2025-03-19 | CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation | Masud Ahmed et.al. | 2503.15617 | null |
| 2025-03-19 | SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes | Weixiao Gao et.al. | 2503.15300 | null |
| 2025-03-19 | Semantic Segmentation of Transparent and Opaque Drinking Glasses with the Help of Zero-shot Learning | Annalena Blänsdorf et.al. | 2503.15004 | null |
| 2025-03-19 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network | Joseph Emmanuel DL Dayo et.al. | 2503.14950 | null |
| 2025-03-19 | SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments | Yinqi Chen et.al. | 2503.14837 | null |
| 2025-03-18 | Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting | Runsong Zhu et.al. | 2503.14029 | link |
| 2025-03-18 | PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds | Barza Nisar et.al. | 2503.13914 | null |
| 2025-03-18 | Exploiting Inherent Class Label: Towards Robust Scribble Supervised Semantic Segmentation | Xinliang Zhang et.al. | 2503.13895 | link |
| 2025-03-17 | SED-MVS: Segmentation-Driven and Edge-Aligned Deformation Multi-View Stereo with Depth Restoration and Occlusion Constraint | Zhenlong Yuan et.al. | 2503.13721 | null |
| 2025-03-17 | Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization | Hao Li et.al. | 2503.13617 | null |
| 2025-03-17 | Clustering is back: Reaching state-of-the-art LiDAR instance segmentation without training | Corentin Sautier et.al. | 2503.13203 | null |
| 2025-03-17 | 3D Hierarchical Panoptic Segmentation in Real Orchard Environments Across Different Sensors | Matteo Sodano et.al. | 2503.13188 | null |
| 2025-03-17 | DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model | Zhicheng Zhao et.al. | 2503.13073 | null |
| 2025-03-17 | Adaptive Transformer Attention and Multi-Scale Fusion for Spine 3D Segmentation | Yanlin Xiang et.al. | 2503.12853 | null |
| 2025-03-17 | LangDA: Building Context-Awareness via Language for Domain Adaptive Semantic Segmentation | Chang Liu et.al. | 2503.12780 | null |
| 2025-03-17 | TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image | Haoxiao Wang et.al. | 2503.12779 | null |
| 2025-03-16 | Point Cloud Based Scene Segmentation: A Survey | Dan Halperin et.al. | 2503.12595 | null |
| 2025-03-16 | BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis | Weiguang Zhao et.al. | 2503.12539 | null |
| 2025-03-16 | SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs | Guibiao Liao et.al. | 2503.12535 | null |
| 2025-03-16 | Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation | Edgar Heinert et.al. | 2503.12453 | null |
| 2025-03-14 | COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation | Sanghyun Jo et.al. | 2503.11439 | null |
| 2025-03-14 | CyclePose – Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy | Jonas Utz et.al. | 2503.11266 | null |
| 2025-03-14 | SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets | Hao Liu et.al. | 2503.11133 | null |
| 2025-03-14 | A Novel Decomposed Feature-Oriented Framework for Open-Set Semantic Segmentation on LiDAR Data | Wenbang Deng et.al. | 2503.11097 | null |
| 2025-03-12 | Knowledge Consultation for Semi-Supervised Semantic Segmentation | Thuan Than et.al. | 2503.10693 | null |
| 2025-03-13 | RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Fengxiang Wang et.al. | 2503.10392 | link |
| 2025-03-13 | OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions | Maxim Popov et.al. | 2503.10331 | null |
| 2025-03-12 | CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation | Hariprasath Govindarajan et.al. | 2503.09878 | null |
| 2025-03-12 | Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets | Hannah Kniesel et.al. | 2503.09221 | null |
| 2025-03-12 | Learning Appearance and Motion Cues for Panoptic Tracking | Juana Valeria Hurtado et.al. | 2503.09191 | null |
| 2025-03-11 | SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories | Muzhi Zhu et.al. | 2503.08625 | null |
| 2025-03-11 | SAS: Segment Any 3D Scene with Integrated 2D Priors | Zhuoyuan Li et.al. | 2503.08512 | null |
| 2025-03-11 | WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images | Yansong Guo et.al. | 2503.08407 | null |
| 2025-03-11 | nnInteractive: Redefining 3D Promptable Segmentation | Fabian Isensee et.al. | 2503.08373 | link |
| 2025-03-11 | SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation | Sachin Verma et.al. | 2503.08290 | null |
| 2025-03-11 | Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation | Deyi Ji et.al. | 2503.08043 | null |
| 2025-03-11 | DiffEGG: Diffusion-Driven Edge Generation as a Pixel-Annotation-Free Alternative for Instance Annotation | Sanghyun Jo et.al. | 2503.07982 | null |
| 2025-03-10 | Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models? | Yuru Jia et.al. | 2503.07890 | null |
| 2025-03-10 | REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding | Yan Tai et.al. | 2503.07413 | link |
| 2025-03-10 | Semantic Communications with Computer Vision Sensing for Edge Video Transmission | Yubo Peng et.al. | 2503.07252 | null |
| 2025-03-10 | OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation | Ding Zhong et.al. | 2503.07098 | null |
| 2025-03-10 | Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation | Xingye Fan et.al. | 2503.06954 | null |
| 2025-03-10 | Aligning Instance-Semantic Sparse Representation towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives | Jiaxin Li et.al. | 2503.06947 | null |
| 2025-03-10 | HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors | Siyu Li et.al. | 2503.06821 | null |
| 2025-03-09 | CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving | Rui Song et.al. | 2503.06744 | null |
| 2025-03-09 | Continuous Online Adaptation Driven by User Interaction for Medical Image Segmentation | Wentian Xu et.al. | 2503.06717 | null |
| 2025-03-09 | MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation | Chenfei Liao et.al. | 2503.06700 | null |
| 2025-03-09 | Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence | Zhaowei Chen et.al. | 2503.06685 | null |
| 2025-03-07 | Joint 3D Point Cloud Segmentation using Real-Sim Loop: From Panels to Trees and Branches | Tian Qiu et.al. | 2503.05630 | null |
| 2025-03-07 | TomatoScanner: phenotyping tomato fruit based on only RGB image | Xiaobei Zhao et.al. | 2503.05568 | null |
| 2025-03-07 | S4M: Segment Anything with 4 Extreme Points | Adrien Meyer et.al. | 2503.05534 | null |
| 2025-03-07 | Kaiwu: A Multimodal Manipulation Dataset and Framework for Robot Learning and Human-Robot Interaction | Shuo Jiang et.al. | 2503.05231 | null |
| 2025-03-06 | EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images | Rohit Menon et.al. | 2503.04441 | null |
| 2025-03-06 | PointsToWood: A deep learning framework for complete canopy leaf-wood segmentation of TLS data across diverse European forests | Harry J. F. Owen et.al. | 2503.04420 | null |
| 2025-03-06 | Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes | Hui Zhang et.al. | 2503.04235 | null |
| 2025-03-06 | MASTER: Multimodal Segmentation with Text Prompts | Fuyang Liu et.al. | 2503.04199 | null |
| 2025-03-06 | Towards Intelligent Transportation with Pedestrians and Vehicles In-the-Loop: A Surveillance Video-Assisted Federated Digital Twin Framework | Xiaolong Li et.al. | 2503.04170 | null |
| 2025-03-06 | H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision | Yunxiao Shi et.al. | 2503.04059 | null |
| 2025-03-06 | GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding | Xihan Wang et.al. | 2503.04034 | null |
| 2025-03-06 | DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation | Amin Karimi et.al. | 2503.04006 | null |
| 2025-03-05 | COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation | Aurelio Noca et.al. | 2503.03947 | null |
| 2025-03-05 | SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection | Devanish N. Kamtam et.al. | 2503.03942 | null |
| 2025-03-05 | Automatic Drywall Analysis for Progress Tracking and Quality Control in Construction | Mariusz Trzeciakiewicz et.al. | 2503.03422 | null |
| 2025-03-05 | Golden Cudgel Network for Real-Time Semantic Segmentation | Guoyu Yang et.al. | 2503.03325 | null |
| 2025-03-05 | Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters | Julia Hindel et.al. | 2503.03299 | null |
| 2025-03-05 | Interactive Segmentation and Report Generation for CT Images | Yannian Gu et.al. | 2503.03294 | null |
| 2025-03-05 | Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria | Asma A. Almutairi et.al. | 2503.03100 | null |
| 2025-03-05 | AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model | Wenlun Zhang et.al. | 2503.03088 | null |
| 2025-03-04 | Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance | Jiayi Zhao et.al. | 2503.02581 | link |
| 2025-03-04 | MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments | Ege Özsoy et.al. | 2503.02579 | link |
| 2025-03-04 | TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping | Xinying Hong et.al. | 2503.02578 | null |
| 2025-03-04 | Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation | Dengke Zhang et.al. | 2503.02459 | null |
| 2025-03-04 | Label-Efficient LiDAR Panoptic Segmentation | Ahmet Selim Çanakçı et.al. | 2503.02372 | null |
| 2025-03-03 | SAGE: A Framework of Precise Retrieval for RAG | Jintao Zhang et.al. | 2503.01713 | null |
| 2025-03-04 | UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface | Hao Tang et.al. | 2503.01342 | link |
| 2025-03-03 | OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging | Yijie Tang et.al. | 2503.01309 | null |
| 2025-03-03 | Convex Hull-based Algebraic Constraint for Visual Quadric SLAM | Xiaolong Yu et.al. | 2503.01254 | link |
| 2025-03-03 | Identity documents recognition and detection using semantic segmentation with convolutional neural network | Mykola Kozlenko et.al. | 2503.01085 | null |
| 2025-02-28 | The Common Objects Underwater (COU) Dataset for Robust Underwater Object Detection | Rishi Mukherjee et.al. | 2502.20651 | null |
| 2025-02-27 | Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds | Mohamed Abdelsamad et.al. | 2502.20316 | null |
| 2025-02-27 | OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels | Meng Lou et.al. | 2502.20087 | link |
| 2025-02-28 | SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation | Zijie Zhou et.al. | 2502.20077 | link |
| 2025-03-03 | 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds | Hengshuo Chu et.al. | 2502.20041 | null |
| 2025-02-27 | Learning Mask Invariant Mutual Information for Masked Image Modeling | Tao Huang et.al. | 2502.19718 | null |
| 2025-02-28 | You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving | Guangfeng Jiang et.al. | 2502.19698 | null |
| 2025-02-26 | Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach | Anton Backhaus et.al. | 2502.19177 | null |
| 2025-02-26 | Enhanced Neuromorphic Semantic Segmentation Latency through Stream Event | D. Hareb et.al. | 2502.18982 | null |
| 2025-02-28 | OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation | Yunpeng Gao et.al. | 2502.18041 | null |
| 2025-02-25 | CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems | Rui Liu et.al. | 2502.17821 | null |
| 2025-02-24 | CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation | Vishal Thengane et.al. | 2502.17429 | link |
| 2025-02-25 | DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks | Canyu Zhao et.al. | 2502.17157 | link |
| 2025-02-24 | SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations | Wendi Liu et.al. | 2502.17056 | null |
| 2025-02-25 | VPNeXt – Rethinking Dense Decoding for Plain Vision Transformer | Xikai Tang et.al. | 2502.16654 | null |
| 2025-02-23 | Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration | Kim Jun-Seong et.al. | 2502.16652 | null |
| 2025-02-23 | OpenVox: Real-time Instance-level Open-vocabulary Probabilistic Voxel Representation | Yinan Deng et.al. | 2502.16528 | null |
| 2025-02-23 | Deep learning approaches to surgical video segmentation and object detection: A Scoping Review | Devanish N. Kamtam et.al. | 2502.16459 | null |
| 2025-02-22 | Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field | Wenhao Hu et.al. | 2502.16303 | null |
| 2025-02-22 | Importance-Aware Source-Channel Coding for Multi-Modal Task-Oriented Semantic Communication | Yi Ma et.al. | 2502.16194 | null |
| 2025-02-22 | FeatSharp: Your Vision Model Features, Sharper | Mike Ranzinger et.al. | 2502.16025 | link |
| 2025-02-21 | Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence | Yufeng Diao et.al. | 2502.15472 | null |
| 2025-02-21 | DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation | Luzhou Ge et.al. | 2502.15309 | link |
| 2025-02-21 | Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation | Ebenezer Tarubinga et.al. | 2502.15152 | link |
| 2025-02-20 | RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation | Henrique Piñeiro Monteagudo et.al. | 2502.14792 | null |
| 2025-02-20 | Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes | Lukas Rauch et.al. | 2502.14721 | null |
| 2025-02-20 | Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2502.14416 | null |
| 2025-02-20 | Bayesian SegNet for Semantic Segmentation with Improved Interpretation of Microstructural Evolution During Irradiation of Materials | Marjolein Oostrom et.al. | 2502.14184 | null |
| 2025-02-19 | SegRet: An Efficient Design for Semantic Segmentation with Retentive Network | Zhiyuan Li et.al. | 2502.14014 | link |
| 2025-02-19 | Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model | Huiying Shi et.al. | 2502.13990 | null |
| 2025-02-19 | MGFI-Net: A Multi-Grained Feature Integration Network for Enhanced Medical Image Segmentation | Yucheng Zeng et.al. | 2502.13808 | null |
| 2025-02-19 | CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models | Nikolaos Dionelis et.al. | 2502.13734 | null |
| 2025-02-18 | WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields | Ekin Celikkan et.al. | 2502.13103 | link |
| 2025-02-18 | Enhancing Power Grid Inspections with Machine Learning | Diogo Lavado et.al. | 2502.13037 | null |
| 2025-02-18 | DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Tanzhe Li et.al. | 2502.12627 | null |
| 2025-02-17 | From Open-Vocabulary to Vocabulary-Free Semantic Segmentation | Klara Reichard et.al. | 2502.11891 | null |
| 2025-02-16 | Leveraging Multimodal-LLMs Assisted by Instance Segmentation for Intelligent Traffic Monitoring | Murat Arda Onsu et.al. | 2502.11304 | null |
| 2025-02-16 | Text-promptable Propagation for Referring Medical Image Sequence Segmentation | Runtian Yuan et.al. | 2502.11093 | null |
| 2025-02-16 | Detecting Cadastral Boundary from Satellite Images Using U-Net model | Neda Rahimpour Anaraki et.al. | 2502.11044 | null |
| 2025-02-15 | NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing | Shutong Zhang et.al. | 2502.10720 | null |
| 2025-02-15 | Deep Learning for Wound Tissue Segmentation: A Comprehensive Evaluation using A Novel Dataset | Muhammad Ashad Kabir et.al. | 2502.10652 | null |
| 2025-02-14 | Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs – A Multinational Study | Yin-Chih Chelsea Wang et.al. | 2502.10277 | null |
| 2025-02-14 | FrGNet: A fourier-guided weakly-supervised framework for nuclear instance segmentation | Peng Ling et.al. | 2502.09874 | null |
| 2025-02-12 | Towards Fine-grained Interactive Segmentation in Images and Videos | Yuan Yao et.al. | 2502.09660 | null |
| 2025-02-13 | Instance Segmentation of Scene Sketches Using Natural Image Priors | Mia Tang et.al. | 2502.09608 | null |
| 2025-02-13 | SQ-GAN: Semantic Image Communications Using Masked Vector Quantization | Francesco Pezone et.al. | 2502.09520 | null |
| 2025-02-13 | FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation | Bin Yang et.al. | 2502.09274 | null |
| 2025-02-13 | Memory-based Ensemble Learning in CMR Semantic Segmentation | Yiwei Liu et.al. | 2502.09269 | link |
| 2025-02-13 | Latents of latents to delineate pixels: hybrid Matryoshka autoencoder-to-U-Net pairing for segmenting large medical images in GPU-poor and low-data regimes | Tahir Syed et.al. | 2502.08988 | null |
| 2025-02-12 | HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification | Valentina Vadori et.al. | 2502.08754 | link |
| 2025-02-12 | Generalized Class Discovery in Instance Segmentation | Cuong Manh Hoang et.al. | 2502.08149 | null |
| 2025-02-12 | Knowledge Swapping via Learning and Unlearning | Mingyu Xing et.al. | 2502.08075 | null |
| 2025-02-11 | Efficient Continuous Group Convolutions for Local SE(3) Equivariance in 3D Point Clouds | Lisa Weijler et.al. | 2502.07505 | link |
| 2025-02-11 | A Survey on Mamba Architecture for Vision Applications | Fady Ibrahim et.al. | 2502.07161 | null |
| 2025-02-09 | A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation | Wang Jiangtao et.al. | 2502.06895 | null |
| 2025-02-10 | SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement | Yuqi Lin et.al. | 2502.06756 | null |
| 2025-02-10 | A Large-scale AI-generated Image Inpainting Benchmark | Paschalis Giakoumoglou et.al. | 2502.06593 | link |
| 2025-02-11 | Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation | Emanuele Mule et.al. | 2502.06288 | null |
| 2025-02-10 | Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds | Lassi Ruoppa et.al. | 2502.06227 | null |
| 2025-02-09 | Traveling Waves Integrate Spatial Information Into Spectral Representations | Mozes Jacobs et.al. | 2502.06034 | null |
| 2025-02-11 | VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer | Xinyu Liu et.al. | 2502.05979 | null |
| 2025-02-09 | LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification | Shubham Kumar Nigam et.al. | 2502.05836 | null |
| 2025-02-08 | Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture | Mitul Goswami et.al. | 2502.05476 | null |
| 2025-02-08 | LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation | Shengdong Zhang et.al. | 2502.05473 | null |
| 2025-02-08 | A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation | Canxuan Gang et.al. | 2502.05396 | null |
| 2025-02-07 | IPSeg: Image Posterior Mitigates Semantic Drift in Class-Incremental Segmentation | Xiao Yu et.al. | 2502.04870 | null |
| 2025-02-07 | AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Runqing Jiang et.al. | 2502.04628 | null |
| 2025-02-05 | DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation | Luciano Baresi et.al. | 2502.04378 | link |
| 2025-02-06 | Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation | Jiahao Lu et.al. | 2502.04139 | null |
| 2025-02-06 | Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation | Yang Chen et.al. | 2502.04111 | null |
| 2025-02-06 | LeAP: Consistent multi-domain 3D labeling using Foundation Models | Simon Gebraad et.al. | 2502.03901 | null |
| 2025-02-06 | Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation | Xuan Li et.al. | 2502.03813 | null |
| 2025-02-05 | Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Indrashis Das et.al. | 2502.03654 | link |
| 2025-02-05 | ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models | Ying Zhang et.al. | 2502.03266 | link |
| 2025-02-05 | Disentangling CLIP Features for Enhanced Localized Understanding | Samyak Rawelekar et.al. | 2502.02977 | null |
| 2025-02-05 | From DeepSense to Open RAN: AI/ML Advancements in Dynamic Spectrum Sensing and Their Applications | Ryan Barker et.al. | 2502.02889 | null |
| 2025-02-04 | Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications | William O’Donnell et.al. | 2502.02624 | null |
| 2025-02-04 | COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation | Xueqing Deng et.al. | 2502.02589 | null |
| 2025-02-04 | Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation | Junha Lee et.al. | 2502.02548 | null |
| 2025-02-04 | Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification | Valentina Vadori et.al. | 2502.02471 | null |
| 2025-02-04 | Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation | Shutong Duan et.al. | 2502.02340 | null |
| 2025-02-04 | UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation | Tao Zhang et.al. | 2502.02257 | link |
| 2025-02-04 | Deep Ensemble approach for Enhancing Brain Tumor Segmentation in Resource-Limited Settings | Jeremiah Fadugba et.al. | 2502.02179 | null |
| 2025-02-04 | Memory Efficient Transformer Adapter for Dense Predictions | Dong Zhang et.al. | 2502.01962 | null |
| 2025-02-03 | Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis | Haowen Bai et.al. | 2502.01467 | null |
| 2025-02-03 | Temporal-consistent CAMs for Weakly Supervised Video Segmentation in Waste Sorting | Andrea Marelli et.al. | 2502.01455 | null |
| 2025-02-03 | ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies | Costin F. Ciusdel et.al. | 2502.01335 | null |
| 2025-01-31 | Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches | Ying Zang et.al. | 2501.19329 | null |
| 2025-01-31 | GO: The Great Outdoors Multimodal Dataset | Peng Jiang et.al. | 2501.19274 | null |
| 2025-01-31 | Medical Semantic Segmentation with Diffusion Pretrain | David Li et.al. | 2501.19265 | null |
| 2025-01-31 | ContextFormer: Redefining Efficiency in Semantic Segmentation | Mian Muhammad Naeem Abid et.al. | 2501.19255 | null |
| 2025-01-31 | Integrating Semi-Supervised and Active Learning for Semantic Segmentation | Wanli Ma et.al. | 2501.19227 | null |
| 2025-01-31 | Improving vision-language alignment with graph spiking hybrid Networks | Siyu Zhang et.al. | 2501.19069 | null |
| 2025-01-31 | SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging | Javier Montalvo et.al. | 2501.19035 | null |
| 2025-01-31 | Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks | Xiaoyan Jiang et.al. | 2501.18851 | null |
| 2025-01-30 | INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation | Jian Hu et.al. | 2501.18753 | null |
| 2025-02-03 | Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models | Hao Dong et.al. | 2501.18592 | link |
| 2025-01-30 | Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations | Chengxi Zeng et.al. | 2501.18474 | null |
| 2025-01-30 | Ground Awareness in Deep Learning for Large Outdoor Point Cloud Segmentation | Kevin Qiu et.al. | 2501.18246 | null |
| 2025-01-30 | ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation Transformer | Weiwei Yao et.al. | 2501.17688 | null |
| 2025-01-29 | Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation | Lin Chen et.al. | 2501.17642 | null |
| 2025-01-29 | 3DSES: an indoor Lidar point cloud segmentation dataset with real and pseudo-labels from a 3D model | Maxime Mérizette et.al. | 2501.17534 | null |
| 2025-01-29 | Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models | Muhammad Atta ur Rahman et.al. | 2501.16769 | null |
| 2025-01-28 | AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies | Surojit Saha et.al. | 2501.16760 | null |
| 2025-01-28 | SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios | Yinqi Chen et.al. | 2501.16754 | null |
| 2025-01-27 | Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation | Philip Hughes et.al. | 2501.16467 | null |
| 2025-01-27 | DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation | Han Sun et.al. | 2501.16410 | null |
| 2025-01-27 | The Linear Attention Resurrection in Vision Transformer | Chuanyang Zheng et.al. | 2501.16182 | null |
| 2025-01-27 | D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation | Maik Steinhauser et.al. | 2501.15870 | null |
| 2025-01-26 | iFormer: Integrating ConvNet and Transformer for Mobile Application | Chuanyang Zheng et.al. | 2501.15369 | link |
| 2025-01-25 | A Training-free Synthetic Data Selection Method for Semantic Segmentation | Hao Tang et.al. | 2501.15201 | null |
| 2025-01-24 | 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving | Jules Sanchez et.al. | 2501.14605 | link |
| 2025-01-24 | Effective Defect Detection Using Instance Segmentation for NDI | Ashiqur Rahman et.al. | 2501.14149 | null |
| 2025-01-23 | ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban 3D Change Detection | Luqi Zhang et.al. | 2501.14004 | link |
| 2025-01-23 | IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jiayi Lei et.al. | 2501.13920 | null |
| 2025-01-23 | Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning | Zuyao You et.al. | 2501.13893 | link |
| 2025-01-23 | Where Do You Go? Pedestrian Trajectory Prediction using Scene Features | Mohammad Ali Rezaei et.al. | 2501.13848 | null |
| 2025-01-23 | Overcoming Support Dilution for Robust Few-shot Semantic Segmentation | Wailing Tang et.al. | 2501.13529 | null |
| 2025-01-22 | Revisiting Data Augmentation for Ultrasound Images | Adam Tupper et.al. | 2501.13193 | link |
| 2025-01-22 | A Novel Scene Coupling Semantic Mask Network for Remote Sensing Image Segmentation | Xiaowen Ma et.al. | 2501.13130 | link |
| 2025-01-22 | Hybridization of Attention UNet with Repeated Atrous Spatial Pyramid Pooling for Improved Brain Tumour Segmentation | Satyaki Roy Chowdhury et.al. | 2501.13129 | null |
| 2025-01-22 | Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks | Alessio Quercia et.al. | 2501.12824 | null |
| 2025-01-19 | Comparative Analysis of Hand-Crafted and Machine-Driven Histopathological Features for Prostate Cancer Classification and Segmentation | Feda Bolus Al Baqain et.al. | 2501.12415 | null |
| 2025-01-21 | Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems | Stefano Carlo Lambertenghi et.al. | 2501.12269 | null |
| 2025-01-21 | A margin-based replacement for cross-entropy loss | Michael W. Spratling et.al. | 2501.12191 | null |
| 2025-01-21 | Foreign object segmentation in chest x-rays through anatomy-guided shape insertion | Constantin Seibold et.al. | 2501.12022 | null |
| 2025-01-21 | Data-driven Detection and Evaluation of Damages in Concrete Structures: Using Deep Learning and Computer Vision | Saeid Ataei et.al. | 2501.11836 | null |
| 2025-01-20 | MedicoSAM: Towards foundation models for medical image segmentation | Anwai Archit et.al. | 2501.11734 | link |
| 2025-01-20 | Automatic Labelling & Semantic Segmentation with 4D Radar Tensors | Botao Sun et.al. | 2501.11351 | null |
| 2025-01-20 | Enhancing Uncertainty Estimation in Semantic Segmentation via Monte-Carlo Frequency Dropout | Tal Zeevi et.al. | 2501.11258 | link |
| 2025-01-20 | Advancing Oyster Phenotype Segmentation with Multi-Network Ensemble and Multi-Scale mechanism | Wenli Yang et.al. | 2501.11203 | null |
| 2025-01-19 | Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation | Zhengwen Shen et.al. | 2501.10958 | null |
| 2025-01-18 | OpenEarthMap-SAR: A Benchmark Synthetic Aperture Radar Dataset for Global High-Resolution Land Cover Mapping | Junshi Xia et.al. | 2501.10891 | null |
| 2025-01-17 | Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks | Michael Schwingshackl et.al. | 2501.10080 | link |
| 2025-01-17 | Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework | Ali Can Karaca et.al. | 2501.10075 | link |
| 2025-01-17 | One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression | Keita Miwa et.al. | 2501.10064 | null |
| 2025-01-17 | LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks | Wei Lu et.al. | 2501.10040 | link |
| 2025-01-16 | The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning | Wonjun Jo et.al. | 2501.09485 | null |
| 2025-01-16 | Scaling up self-supervised learning for improved surgical foundation models | Tim J. M. Jaspers et.al. | 2501.09436 | link |
| 2025-01-16 | SVIA: A Street View Image Anonymization Framework for Self-Driving Applications | Dongyu Liu et.al. | 2501.09393 | link |
| 2025-01-15 | UNIR-Net: A Novel Approach for Restoring Underwater Images with Non-Uniform Illumination Using Synthetic Data | Ezequiel Perez-Zarate et.al. | 2501.09053 | link |
| 2025-01-15 | Pseudolabel guided pixels contrast for domain adaptive semantic segmentation | Jianzi Xiang et.al. | 2501.09040 | link |
| 2025-01-14 | FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing | Isaac Corley et.al. | 2501.08490 | null |
| 2025-01-14 | Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers | Efstathios Karypidis et.al. | 2501.08303 | link |
| 2025-01-14 | SmartEraser: Remove Anything from Images using Masked-Region Guidance | Longtao Jiang et.al. | 2501.08279 | null |
| 2025-01-14 | A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation | Steven Landgraf et.al. | 2501.08188 | null |
| 2025-01-14 | Threshold Attention Network for Semantic Segmentation of Remote Sensing Images | Wei Long et.al. | 2501.07984 | null |
| 2025-01-14 | SkipClick: Combining Quick Responses and Low-Level Features for Interactive Segmentation in Winter Sports Contexts | Robin Schön et.al. | 2501.07960 | null |
| 2025-01-14 | Balance Divergence for Knowledge Distillation | Yafei Qi et.al. | 2501.07804 | null |
| 2025-01-13 | Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation | Xianping Ma et.al. | 2501.07390 | link |
| 2025-01-13 | TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations | Daniel Steininger et.al. | 2501.07360 | link |
| 2025-01-13 | Toward Realistic Camouflaged Object Detection: Benchmarks and Method | Zhimeng Xin et.al. | 2501.07297 | link |
| 2025-01-13 | Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion | Li Liang et.al. | 2501.07260 | link |
| 2025-01-12 | LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier | Haojun Yu et.al. | 2501.06862 | link |
| 2025-01-12 | SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation | Javier Gamazo Tejero et.al. | 2501.06836 | null |
| 2025-01-12 | Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation | Zhenyang Feng et.al. | 2501.06749 | null |
| 2025-01-11 | Parking Space Detection in the City of Granada | Crespo-Orti Luis et.al. | 2501.06651 | link |
| 2025-01-06 | The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge | Qing Wu et.al. | 2501.05472 | null |
| 2025-01-09 | Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions | Shishir Muralidhara et.al. | 2501.05246 | null |
| 2025-01-09 | Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment | Haoyi Xiu et.al. | 2501.05095 | null |
| 2025-01-08 | Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation | Ulindu De Silva et.al. | 2501.04696 | link |
| 2025-01-08 | Rapid Automated Mapping of Clouds on Titan With Instance Segmentation | Zachary Yahn et.al. | 2501.04459 | link |
| 2025-01-07 | Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images | Hongyi Wu et.al. | 2501.03891 | null |
| 2025-01-07 | AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish | Stefan Hein Bengtson et.al. | 2501.03767 | null |
| 2025-01-07 | Image Segmentation: Inducing graph-based learning | Aryan Singh et.al. | 2501.03765 | link |
| 2025-01-06 | 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation | Jiexi Zhong et.al. | 2501.02937 | null |
| 2025-01-08 | GLoG-CSUnet: Enhancing Vision Transformers with Adaptable Radiomic Features for Medical Image Segmentation | Niloufar Eghbali et.al. | 2501.02788 | link |
| 2025-01-04 | Unsupervised Class Generation to Expand Semantic Segmentation Datasets | Javier Montalvo et.al. | 2501.02264 | null |
| 2025-01-03 | DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data | Yuanpeng Tu et.al. | 2501.02048 | null |
| 2025-01-03 | Semantic Segmentation for Sequential Historical Maps by Learning from Only One Map | Yunshuang Yuan et.al. | 2501.01845 | null |
| 2025-01-03 | Dedicated Inference Engine and Binary-Weight Neural Networks for Lightweight Instance Segmentation | Tse-Wei Chen et.al. | 2501.01841 | null |
| 2025-01-03 | IAM: Enhancing RGB-D Instance Segmentation with New Benchmarks | Aecheon Jung et.al. | 2501.01685 | link |
| 2025-01-03 | Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation | Rini Smita Thakur et.al. | 2501.01640 | null |
| 2025-01-02 | A Multi-task Supervised Compression Model for Split Computing | Yoshitomo Matsubara et.al. | 2501.01420 | link |
| 2025-01-02 | Leverage Cross-Attention for End-to-End Open-Vocabulary Panoptic Reconstruction | Xuan Yu et.al. | 2501.01119 | null |
| 2025-01-02 | Evidential Calibrated Uncertainty-Guided Interactive Segmentation paradigm for Ultrasound Images | Jiang Shang et.al. | 2501.01072 | null |
| 2025-01-02 | Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function | Anna Grim et.al. | 2501.01022 | link |
| 2025-01-03 | FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation | Bingyu Li et.al. | 2501.00877 | link |
| 2024-12-31 | Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves | Madeleine Darbyshire et.al. | 2501.00527 | link |
| 2024-12-31 | H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters | Pedram Fekri et.al. | 2501.00514 | null |
| 2024-12-31 | A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images | Dawen Yu et.al. | 2501.00360 | null |
| 2024-12-31 | PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM | Runnan Chen et.al. | 2501.00352 | null |
| 2024-12-31 | OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies | Runnan Chen et.al. | 2501.00326 | link |
| 2024-12-30 | HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization | Zijie Fang et.al. | 2412.20924 | link |
| 2024-12-30 | LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training | Fardin Ayar et.al. | 2412.20881 | null |
| 2024-12-29 | Image Augmentation Agent for Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2412.20439 | null |
| 2024-12-27 | Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP | Zhongxing Xu et.al. | 2412.19650 | null |
| 2024-12-27 | An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments | Vignesh Kottayam Viswanathan et.al. | 2412.19582 | null |
| 2024-12-27 | Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation | Chengyang Ye et.al. | 2412.19492 | link |
| 2024-12-26 | Impact of color and mixing proportion of synthetic point clouds on semantic segmentation | Shaojie Zhou et.al. | 2412.19145 | null |
| 2024-12-25 | Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model | Yi-Chia Chen et.al. | 2412.18917 | link |
| 2024-12-24 | AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction | Pufan Zou et.al. | 2412.18255 | null |
| 2024-12-25 | VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis | Shicheng Yin et.al. | 2412.18178 | link |
| 2024-12-24 | UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision | Yuru Wang et.al. | 2412.18131 | null |
| 2024-12-24 | LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding | Hao Li et.al. | 2412.17635 | null |
| 2024-12-25 | AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation | Jiaqi Ma et.al. | 2412.17601 | link |
| 2024-12-24 | Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation | Jianjian Yin et.al. | 2412.17331 | link |
| 2024-12-22 | Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation | Samuel Marschall et.al. | 2412.16990 | null |
| 2024-12-22 | Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection | Yuhang Gan et.al. | 2412.16918 | null |
| 2024-12-22 | MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection | Xu Zheng et.al. | 2412.16876 | null |
| 2024-12-22 | Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation | Jongmin Yu et.al. | 2412.16859 | null |
| 2024-12-21 | A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection | Shahid Ansari et.al. | 2412.16755 | null |
| 2024-12-21 | IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks | Yaming Zhang et.al. | 2412.16654 | link |
| 2024-12-21 | V”Mean”ba: Visual State Space Models only need 1 hidden dimension | Tien-Yu Chi et.al. | 2412.16602 | null |
| 2024-12-20 | SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data | Xinwei Ju et.al. | 2412.16078 | null |
| 2024-12-20 | Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer | Xinyue Chen et.al. | 2412.15835 | link |
| 2024-12-19 | MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance | Hallee E. Wong et.al. | 2412.15058 | link |
| 2024-12-19 | GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation | G. Andrade-Miranda et.al. | 2412.15054 | link |
| 2024-12-19 | PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation | Shoumeng Qiu et.al. | 2412.14821 | link |
| 2024-12-19 | Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers | Rui Ding et.al. | 2412.14633 | null |
| 2024-12-19 | Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation | Zhenxin Lei et.al. | 2412.14587 | null |
| 2024-12-18 | Split Learning in Computer Vision for Semantic Segmentation Delay Minimization | Nikos G. Evgenidis et.al. | 2412.14272 | null |
| 2024-12-18 | Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation | Jianyu Zhang et.al. | 2412.14145 | null |
| 2024-12-18 | Prompt Categories Cluster for Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2412.13823 | null |
| 2024-12-18 | Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data | Junki Mori et.al. | 2412.13757 | null |
| 2024-12-18 | Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration | Dominik Werner Wolf et.al. | 2412.13695 | null |
| 2024-12-18 | GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting | Yuning Peng et.al. | 2412.13654 | link |
| 2024-12-18 | RelationField: Relate Anything in Radiance Fields | Sebastian Koch et.al. | 2412.13652 | null |
| 2024-12-17 | S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging | Yimu Pan et.al. | 2412.13156 | null |
| 2024-12-17 | Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks | Xiaxin Zhu et.al. | 2412.12843 | null |
| 2024-12-17 | ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation | Shiqi Huang et.al. | 2412.12798 | link |
| 2024-12-17 | Open-World Panoptic Segmentation | Matteo Sodano et.al. | 2412.12740 | null |
| 2024-12-17 | SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing | Chen Chen et.al. | 2412.12685 | link |
| 2024-12-17 | Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation | Dongyue Wu et.al. | 2412.12672 | link |
| 2024-12-17 | Adaptive Prototype Replay for Class Incremental Semantic Segmentation | Guilin Zhu et.al. | 2412.12669 | null |
| 2024-12-17 | SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation | Shuangping Huang et.al. | 2412.12660 | null |
| 2024-12-16 | Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation | Hongwei Niu et.al. | 2412.12050 | link |
| 2024-12-16 | SAMIC: Segment Anything with In-Context Spatial Prompt Engineering | Savinay Nagendra et.al. | 2412.11998 | null |
| 2024-12-16 | SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation | Yunxiang Fu et.al. | 2412.11890 | link |
| 2024-12-16 | Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation | Svetlana Pavlitska et.al. | 2412.11608 | null |
| 2024-12-16 | PyPotteryLens: An Open-Source Deep Learning Framework for Automated Digitisation of Archaeological Pottery Documentation | Lorenzo Cardarelli et.al. | 2412.11574 | null |
| 2024-12-15 | Volumetric Mapping with Panoptic Refinement via Kernel Density Estimation for Mobile Robots | Khang Nguyen et.al. | 2412.11241 | link |
| 2024-12-15 | MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation | Zhiwei Yang et.al. | 2412.11076 | link |
| 2024-12-15 | Classification Drives Geographic Bias in Street Scene Segmentation | Rahul Nair et.al. | 2412.11061 | null |
| 2024-12-15 | SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation | Xudong Zhou et.al. | 2412.11034 | null |
| 2024-12-14 | RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone | Mustafa Munir et.al. | 2412.10995 | link |
| 2024-12-13 | A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation | Wangkai Li et.al. | 2412.10339 | null |
| 2024-12-13 | SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians | Siyun Liang et.al. | 2412.10231 | null |
| 2024-12-13 | SPT: Sequence Prompt Transformer for Interactive Image Segmentation | Senlin Cheng et.al. | 2412.10224 | null |
| 2024-12-13 | TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views | Liang Zhao et.al. | 2412.10051 | null |
| 2024-12-13 | Object-Focused Data Selection for Dense Prediction Tasks | Niclas Popp et.al. | 2412.10032 | null |
| 2024-12-12 | MaskTerial: A Foundation Model for Automated 2D Material Flake Detection | Jan-Lucas Uslu et.al. | 2412.09333 | null |
| 2024-12-12 | Towards Open-Vocabulary Video Semantic Segmentation | Xinhao Li et.al. | 2412.09329 | null |
| 2024-12-12 | FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation | Yuntian Bo et.al. | 2412.09319 | link |
| 2024-12-12 | VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation | Roberto Alcover-Couso et.al. | 2412.09240 | null |
| 2024-12-12 | STEAM: Squeeze and Transform Enhanced Attention Module | Rishabh Sabharwal et.al. | 2412.09023 | null |
| 2024-12-11 | SegFace: Face Segmentation of Long-Tail Classes | Kartik Narayan et.al. | 2412.08647 | link |
| 2024-12-11 | EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Hongwei Niu et.al. | 2412.08628 | null |
| 2024-12-12 | Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Fan Lu et.al. | 2412.08614 | link |
| 2024-12-11 | Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion | Bingzhi Shen et.al. | 2412.08315 | null |
| 2024-12-11 | Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction | Bohan Li et.al. | 2412.08243 | null |
| 2024-12-11 | THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots | Zeshun Li et.al. | 2412.08096 | null |
| 2024-12-11 | Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Zhigang Cen et.al. | 2412.08034 | null |
| 2024-12-10 | Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation | Kurt H. W. Stolle et.al. | 2412.07966 | link |
| 2024-12-11 | CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings | Jiazuo Mu et.al. | 2412.07377 | null |
| 2024-12-09 | SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception | Yaniv Benny et.al. | 2412.06968 | null |
| 2024-12-10 | ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet | Andrei-Robert Alexandrescu et.al. | 2412.06742 | null |
| 2024-12-09 | Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation | Fei Wu et.al. | 2412.06470 | null |
| 2024-12-09 | Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework | Jiuyi Xu et.al. | 2412.06268 | null |
| 2024-12-09 | GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image | Lei Su et.al. | 2412.06129 | null |
| 2024-12-08 | Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation | Zipeng Qi et.al. | 2412.05969 | null |
| 2024-12-08 | CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation | Elay Dahan et.al. | 2412.05833 | null |
| 2024-12-07 | Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards | Ranjan Sapkota et.al. | 2412.05728 | null |
| 2024-12-10 | RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts | Xu Liu et.al. | 2412.05679 | link |
| 2024-12-06 | FogROS2-FT: Fault Tolerant Cloud Robotics | Kaiyuan Chen et.al. | 2412.05408 | null |
| 2024-12-06 | DreamColour: Controllable Video Colour Editing without Training | Chaitat Utintu et.al. | 2412.05180 | null |
| 2024-12-05 | Assessing and Learning Alignment of Unimodal Vision and Language Models | Le Zhang et.al. | 2412.04616 | link |
| 2024-12-05 | Towards Real-Time Open-Vocabulary Video Instance Segmentation | Bin Yan et.al. | 2412.04434 | null |
| 2024-12-05 | A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers | Anaïs Halin et.al. | 2412.04377 | null |
| 2024-12-05 | Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts | Chenyang Zhu et.al. | 2412.04220 | null |
| 2024-12-05 | Text Change Detection in Multilingual Documents Using Image Comparison | Doyoung Park et.al. | 2412.04137 | null |
| 2024-12-05 | SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning | Seokju Yun et.al. | 2412.04077 | null |
| 2024-12-05 | Quality Control in Open-Ended Crowdsourcing: A Survey | Lei Chai et.al. | 2412.03991 | null |
| 2024-12-05 | Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation | Hao Zhu et.al. | 2412.03968 | link |
| 2024-12-05 | LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model | Yuan Xue et.al. | 2412.03841 | null |
| 2024-12-04 | Designing DNNs for a trade-off between robustness and processing performance in embedded devices | Jon Gutiérrez-Zaballa et.al. | 2412.03682 | null |
| 2024-12-04 | FLAIR: VLM with Fine-grained Language-informed Image Representations | Rui Xiao et.al. | 2412.03561 | link |
| 2024-12-04 | Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy | Ronald L. P. D. de Jong et.al. | 2412.03401 | null |
| 2024-12-04 | Task-driven Image Fusion with Learnable Fusion Loss | Haowen Bai et.al. | 2412.03240 | null |
| 2024-12-04 | Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging | Luca Ciampi et.al. | 2412.03192 | null |
| 2024-12-04 | Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype | Song Tang et.al. | 2412.02983 | null |
| 2024-12-04 | Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch | Qing Zhang et.al. | 2412.02978 | null |
| 2024-12-04 | Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution | Jiahua Xiao et.al. | 2412.02960 | null |
| 2024-12-04 | Panoptic Diffusion Models: co-generation of images and segmentation maps | Yinghan Long et.al. | 2412.02929 | null |
| 2024-12-03 | SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection | Joongwon Chae et.al. | 2412.02565 | null |
| 2024-12-03 | Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps | Malik Abdul Manan et.al. | 2412.02443 | null |
| 2024-12-03 | AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation | Jaehyun Choi et.al. | 2412.02280 | null |
| 2024-12-03 | Vision Transformers for Weakly-Supervised Microorganism Enumeration | Javier Ureña Santiago et.al. | 2412.02250 | link |
| 2024-12-03 | Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance | Jing Zeng et.al. | 2412.02249 | null |
| 2024-12-02 | INSIGHT: Explainable Weakly-Supervised Medical Image Analysis | Wenbo Zhang et.al. | 2412.02012 | null |
| 2024-12-02 | Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers | Alberto Gonzalo Rodriguez Salgado et.al. | 2412.01941 | null |
| 2024-12-02 | COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training | Sanghwan Kim et.al. | 2412.01814 | null |
| 2024-12-02 | Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior | Yi Yu et.al. | 2412.01646 | null |
| 2024-12-02 | Epipolar Attention Field Transformers for Bird’s Eye View Semantic Segmentation | Christian Witte et.al. | 2412.01595 | null |
| 2024-11-29 | LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention | Zewen Du et.al. | 2411.19585 | link |
| 2024-11-29 | Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Wenbo Zhang et.al. | 2411.19551 | null |
| 2024-11-29 | Retrieval-guided Cross-view Image Synthesis | Hongji Yang et.al. | 2411.19510 | null |
| 2024-11-29 | Adaptive Interactive Segmentation for Multimodal Medical Imaging via Selection Engine | Zhi Li et.al. | 2411.19447 | link |
| 2024-11-28 | GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model | Rui Zhou et.al. | 2411.19289 | null |
| 2024-11-28 | InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Haijie Li et.al. | 2411.19235 | null |
| 2024-11-28 | MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers | Jongseong Bae et.al. | 2411.18995 | null |
| 2024-11-28 | Textured As-Is BIM via GIS-informed Point Cloud Segmentation | Mohamed S. H. Alabassy et.al. | 2411.18898 | null |
| 2024-11-27 | The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation | Daniel Morales-Brotons et.al. | 2411.18728 | null |
| 2024-11-27 | HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Li-Yuan Tsao et.al. | 2411.18662 | link |
| 2024-11-26 | Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation | Sudarshan Rajagopalan et.al. | 2411.17814 | null |
| 2024-11-26 | Efficient Multi-modal Large Language Models via Visual Token Grouping | Minbin Huang et.al. | 2411.17773 | null |
| 2024-11-26 | Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation | Niharika Hegde et.al. | 2411.17610 | null |
| 2024-11-26 | A Bilayer Segmentation-Recombination Network for Accurate Segmentation of Overlapping C. elegans | Mengqian Dinga et.al. | 2411.17557 | null |
| 2024-11-26 | Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17543 | null |
| 2024-11-26 | Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning | Hoàng-Ân Lê et.al. | 2411.17536 | link |
| 2024-11-26 | TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Xiaowen Ma et.al. | 2411.17473 | link |
| 2024-11-26 | Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps | Xue Xia et.al. | 2411.17425 | null |
| 2024-11-26 | MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection | Juefei He et.al. | 2411.17167 | null |
| 2024-11-26 | Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation | Chanyoung Kim et.al. | 2411.17150 | null |
| 2024-11-26 | ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction | Chang Li et.al. | 2411.17088 | null |
| 2024-11-26 | SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation | Guoan Xu et.al. | 2411.17061 | null |
| 2024-11-25 | Deformable Mamba for Wide Field of View Segmentation | Jie Hu et.al. | 2411.16481 | link |
| 2024-11-25 | A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models | Manuel Schwonberg et.al. | 2411.16407 | null |
| 2024-11-25 | CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation | Leon Sick et.al. | 2411.16319 | null |
| 2024-11-25 | An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models | Wentao Qu et.al. | 2411.16308 | null |
| 2024-11-25 | A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads | Rafael S. Toledo et.al. | 2411.16295 | null |
| 2024-11-25 | Weakly supervised image segmentation for defect-based grading of fresh produce | Manuel Knott et.al. | 2411.16219 | null |
| 2024-11-25 | Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Yanan Wang et.al. | 2411.16196 | null |
| 2024-11-25 | Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking | Phuc Nguyen et.al. | 2411.16183 | null |
| 2024-11-25 | Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training | Man Yao et.al. | 2411.16061 | link |
| 2024-11-24 | Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan | Saba Zahid et.al. | 2411.15923 | null |
| 2024-11-22 | Effective SAM Combination for Open-Vocabulary Semantic Segmentation | Minhyeok Lee et.al. | 2411.14723 | null |
| 2024-11-21 | Revisiting the Integration of Convolution and Attention for Vision Backbone | Lei Zhu et.al. | 2411.14429 | link |
| 2024-11-21 | CompetitorFormer: Competitor Transformer for 3D Instance Segmentation | Duanchu Wang et.al. | 2411.14179 | null |
| 2024-11-21 | CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation | Lin Sun et.al. | 2411.13836 | link |
| 2024-11-21 | Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals | Hussni Mohd Zakir et.al. | 2411.13774 | null |
| 2024-11-20 | FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting | Ola Shorinwa et.al. | 2411.13753 | null |
| 2024-11-20 | DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines | Mizanur Rahman Jewel et.al. | 2411.13544 | null |
| 2024-11-21 | Entropy Bootstrapping for Weakly Supervised Nuclei Detection | James Willoughby et.al. | 2411.13528 | null |
| 2024-11-20 | BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation | Umamaheswaran Raman Kumar et.al. | 2411.13251 | null |
| 2024-11-20 | XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation | Ziyi Wang et.al. | 2411.13243 | link |
| 2024-11-20 | Automating Sonologists USG Commands with AI and Voice Interface | Emad Mohamed et.al. | 2411.13006 | null |
| 2024-11-19 | Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline | Junlong Cheng et.al. | 2411.12814 | link |
| 2024-11-19 | A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation | Jiaqi Yang et.al. | 2411.12615 | link |
| 2024-11-19 | SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation | Ron Keuth et.al. | 2411.12602 | link |
| 2024-11-19 | ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator | Xiao Jiang et.al. | 2411.12250 | null |
| 2024-11-18 | ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements | M. Arda Aydın et.al. | 2411.12044 | link |
| 2024-11-18 | Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation | Hanieh Shojaei Miandashti et.al. | 2411.11935 | null |
| 2024-11-18 | MGNiceNet: Unified Monocular Geometric Scene Understanding | Markus Schön et.al. | 2411.11466 | null |
| 2024-11-18 | MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models | Harshita Sharma et.al. | 2411.11362 | null |
| 2024-11-18 | Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications | Scarlett Raine et.al. | 2411.11287 | null |
| 2024-11-18 | Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development | Ranjan Sapkota et.al. | 2411.11285 | null |
| 2024-11-16 | Attention-based U-Net Method for Autonomous Lane Detection | Mohammadhamed Tangestanizadeh et.al. | 2411.10902 | null |
| 2024-11-16 | Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation | Jaisidh Singh et.al. | 2411.10845 | null |
| 2024-11-16 | Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients | Maria Monzon et.al. | 2411.10755 | null |
| 2024-11-15 | Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation | Markus Karmann et.al. | 2411.10411 | null |
| 2024-11-15 | Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images | Ammar Qammaz et.al. | 2411.10334 | null |
| 2024-11-15 | RETR: Multi-View Radar Detection Transformer for Indoor Perception | Ryoma Yataka et.al. | 2411.10293 | null |
| 2024-11-15 | CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation | Dengke Zhang et.al. | 2411.10086 | link |
| 2024-11-14 | OneNet: A Channel-Wise 1D Convolutional U-Net | Sanghyun Byun et.al. | 2411.09838 | link |
| 2024-11-14 | Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks | Zengyi Yang et.al. | 2411.09387 | null |
| 2024-11-14 | Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation | Yuheng Shi et.al. | 2411.09219 | link |
| 2024-11-14 | Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery | Ashim Dahal et.al. | 2411.09101 | link |
| 2024-11-13 | CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation | Xuming Zhang et.al. | 2411.09023 | null |
| 2024-11-14 | Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation | Yangyang Li et.al. | 2411.08756 | null |
| 2024-11-13 | Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model | Jun Xie et.al. | 2411.08592 | null |
| 2024-11-13 | UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation | Chengyuan Zhang et.al. | 2411.08569 | null |
| 2024-11-13 | Detection and classification of radio sources with deep learning | S. Riggi et.al. | 2411.08519 | null |
| 2024-11-12 | Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry | Christopher Hahne et.al. | 2411.07918 | link |
| 2024-11-12 | INTRABENCH: Interactive Radiological Benchmark | Constantin Ulrich et.al. | 2411.07885 | null |
| 2024-11-12 | Horticultural Temporal Fruit Monitoring via 3D Instance Segmentation and Re-Identification using Point Clouds | Daniel Fusaro et.al. | 2411.07799 | link |
| 2024-11-12 | Semantic segmentation on multi-resolution optical and microwave data using deep learning | Jai G Singla et.al. | 2411.07581 | null |
| 2024-11-12 | GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting | Umangi Jain et.al. | 2411.07555 | null |
| 2024-11-11 | Data-Centric Learning Framework for Real-Time Detection of Aiming Beam in Fluorescence Lifetime Imaging Guided Surgery | Mohamed Abul Hassan et.al. | 2411.07395 | null |
| 2024-11-11 | SAMPart3D: Segment Any Part in 3D Objects | Yunhan Yang et.al. | 2411.07184 | link |
| 2024-11-11 | SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation | Jiale Chen et.al. | 2411.06991 | null |
| 2024-11-11 | Fast and Efficient Transformer-based Method for Bird’s Eye View Instance Prediction | Miguel Antunes-García et.al. | 2411.06851 | link |
| 2024-11-11 | Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision | Yueyang Cang et.al. | 2411.06727 | null |
| 2024-11-10 | Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments | Deegan Atha et.al. | 2411.06632 | null |
| 2024-11-09 | Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing | Kaixuan Lu et.al. | 2411.06091 | null |
| 2024-11-08 | Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model | Shuchang Lyu et.al. | 2411.05878 | link |
| 2024-11-08 | Agricultural Landscape Understanding At Country-Scale | Radhika Dua et.al. | 2411.05359 | null |
| 2024-11-08 | Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation | Sien Li et.al. | 2411.05307 | link |
| 2024-11-07 | In the Era of Prompt Learning with Vision-Language Models | Ankit Jha et.al. | 2411.04892 | null |
| 2024-11-08 | ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset | Olaf Wysocki et.al. | 2411.04865 | link |
| 2024-11-06 | Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts | Zhitong Gao et.al. | 2411.03829 | link |
| 2024-11-06 | SA3DIP: Segment Any 3D Instance with Potential 3D Priors | Xi Yang et.al. | 2411.03819 | link |
| 2024-11-06 | Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model | Yansong Qu et.al. | 2411.03672 | null |
| 2024-11-05 | Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation | Zhiling Yue et.al. | 2411.03551 | null |
| 2024-11-05 | SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture | Andrew Heschl et.al. | 2411.03505 | link |
| 2024-11-05 | Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need | Qishuai Wen et.al. | 2411.03033 | link |
| 2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
| 2024-11-05 | Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery | Mohammad Kakooei et.al. | 2411.02935 | null |
| 2024-11-05 | CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation | Jinchao Ge et.al. | 2411.02715 | null |
| 2024-11-04 | Deep Learning on 3D Semantic Segmentation: A Detailed Review | Thodoris Betsas et.al. | 2411.02104 | null |
| 2024-11-04 | Tree level change detection over Ahmedabad city using very high resolution satellite images and Deep Learning | Jai G Singla et.al. | 2411.02009 | null |
| 2024-11-04 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | null |
| 2024-11-04 | DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability | Bo Gao et.al. | 2411.01819 | null |
| 2024-11-04 | Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations | Thanh Nguyen Canh et.al. | 2411.01816 | null |
| 2024-11-05 | MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation | Duc Dang Trung Tran et.al. | 2411.01781 | null |
| 2024-11-03 | PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation | Xinyu Xu et.al. | 2411.01624 | null |
| 2024-11-01 | Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions | Lixiao Yang et.al. | 2411.01039 | null |
| 2024-11-01 | Event-guided Low-light Video Semantic Segmentation | Zhen Yao et.al. | 2411.00639 | null |
| 2024-11-01 | Automated Classification of Cell Shapes: A Comparative Evaluation of Shape Descriptors | Valentina Vadori et.al. | 2411.00561 | null |
| 2024-10-31 | Federated Black-Box Adaptation for Semantic Segmentation | Jay N. Paranjape et.al. | 2410.24181 | null |
| 2024-10-31 | COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes | Muhammad Ali et.al. | 2410.24139 | link |
| 2024-10-31 | Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model | Hao Zhang et.al. | 2410.23905 | link |
| 2024-10-30 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving | Maciej K. Wozniak et.al. | 2410.23085 | null |
| 2024-10-31 | CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Ziyang Gong et.al. | 2410.22629 | link |
| 2024-10-29 | Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation | Zhaochong An et.al. | 2410.22489 | link |
| 2024-10-29 | Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation | Jintao Tong et.al. | 2410.22135 | null |
| 2024-10-29 | Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models | Imad Ali Shah et.al. | 2410.22101 | null |
| 2024-10-29 | Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation | Ruihao Xia et.al. | 2410.21708 | link |
| 2024-10-28 | Domain Adaptation with a Single Vision-Language Embedding | Mohammad Fahes et.al. | 2410.21361 | null |
| 2024-10-28 | IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks | Manjunath D et.al. | 2410.20953 | null |
| 2024-10-27 | A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models | Camilo Espinosa-Curilem et.al. | 2410.20595 | link |
| 2024-10-27 | Unlocking Comics: The AI4VA Dataset for Visual Understanding | Peter Grönquist et.al. | 2410.20459 | link |
| 2024-10-27 | Historical Test-time Prompt Tuning for Vision Foundation Models | Jingyi Zhang et.al. | 2410.20346 | null |
| 2024-10-25 | OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery | Philipe Dias et.al. | 2410.19965 | null |
| 2024-10-25 | IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Kaixian Qu et.al. | 2410.19697 | null |
| 2024-10-25 | Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation | Yao Wu et.al. | 2410.19446 | link |
| 2024-10-25 | Context-Based Visual-Language Place Recognition | Soojin Woo et.al. | 2410.19341 | link |
| 2024-10-24 | Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks | Alexander Jaus et.al. | 2410.18684 | null |
| 2024-10-24 | Unsupervised semantic segmentation of urban high-density multispectral point clouds | Oona Oinonen et.al. | 2410.18520 | null |
| 2024-10-26 | CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator | Stefanos Pasios et.al. | 2410.18238 | link |
| 2024-10-23 | Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers | Achille Chiuchiarelli et.al. | 2410.17738 | null |
| 2024-10-23 | YOLOv11: An Overview of the Key Architectural Enhancements | Rahima Khanam et.al. | 2410.17725 | null |
| 2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
| 2024-10-22 | EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding | Zhiyi Pan et.al. | 2410.17207 | null |
| 2024-10-22 | LIMIS: Towards Language-based Interactive Medical Image Segmentation | Lena Heinemann et.al. | 2410.16939 | null |
| 2024-10-22 | DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Zhixiong Nan et.al. | 2410.16707 | null |
| 2024-10-22 | SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments | Jumman Hossain et.al. | 2410.16686 | null |
| 2024-10-22 | NucleiMix: Realistic Data Augmentation for Nuclei Instance Segmentation | Jiamu Wang et.al. | 2410.16671 | null |
| 2024-10-21 | PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model | Zhongchen Deng et.al. | 2410.16545 | null |
| 2024-10-21 | TIPS: Text-Image Pretraining with Spatial Awareness | Kevis-Kokitsi Maninis et.al. | 2410.16512 | link |
| 2024-10-21 | GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation | Nazanin Moradinasab et.al. | 2410.16485 | null |
| 2024-10-21 | Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation | Ruting Chi et.al. | 2410.16063 | null |
| 2024-10-21 | LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training | Thomas Kreutz et.al. | 2410.15833 | link |
| 2024-10-21 | TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight | Hyun-Kurl Jang et.al. | 2410.15674 | link |
| 2024-10-21 | Deep Learning and Machine Learning – Object Detection and Semantic Segmentation: From Theory to Applications | Jintao Ren et.al. | 2410.15584 | null |
| 2024-10-20 | Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation | Fnu Neha et.al. | 2410.15472 | null |
| 2024-10-20 | Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing | Daniya Najiha Abdul Kareem et.al. | 2410.15360 | null |
| 2024-10-18 | On the Influence of Shape, Texture and Color for Learning Semantic Segmentation | Annika Mütze et.al. | 2410.14878 | null |
| 2024-10-18 | Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ | Arpan Mahara et.al. | 2410.14836 | null |
| 2024-10-18 | Impact of imperfect annotations on CNN training and performance for instance segmentation and classification in digital pathology | Laura Gálvez Jiménez et.al. | 2410.14365 | null |
| 2024-10-17 | ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding | Guangda Ji et.al. | 2410.13924 | link |
| 2024-10-17 | Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks | Clément Playout et.al. | 2410.13822 | link |
| 2024-10-18 | Enhanced Prompt-leveraged Weakly Supervised Cancer Segmentation based on Segment Anything | Joonhyeon Song et.al. | 2410.13621 | link |
| 2024-10-17 | Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation | Ziyang Chen et.al. | 2410.13472 | null |
| 2024-10-17 | SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing | Bin Wang et.al. | 2410.13471 | link |
| 2024-10-17 | Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation | Florian Wulff et.al. | 2410.13383 | null |
| 2024-10-17 | LESS: Label-Efficient and Single-Stage Referring 3D Segmentation | Xuexun Liu et.al. | 2410.13294 | link |
| 2024-10-17 | Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation | Houze Liu et.al. | 2410.13099 | null |
| 2024-10-16 | Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation | Wenbo Xu et.al. | 2410.13094 | null |
| 2024-10-16 | Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation | Anthony Opipari et.al. | 2410.12995 | null |
| 2024-10-16 | Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation | Jesús Alejandro Loera-Ponce et.al. | 2410.12988 | null |
| 2024-10-16 | VividMed: Vision Language Model with Versatile Visual Grounding for Medicine | Lingxiao Luo et.al. | 2410.12694 | null |
| 2024-10-16 | Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans | Luca Marsilio et.al. | 2410.12641 | null |
| 2024-10-16 | Order-Aware Interactive Segmentation | Bin Wang et.al. | 2410.12214 | null |
| 2024-10-16 | SAM-Guided Masked Token Prediction for 3D Scene Understanding | Zhimin Chen et.al. | 2410.12158 | null |
| 2024-10-15 | WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation | Chenghao Qian et.al. | 2410.12075 | link |
| 2024-10-15 | Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning | Rijun Wang et.al. | 2410.11913 | null |
| 2024-10-15 | Fractal Calibration for long-tailed object detection | Konstantinos Panagiotis Alexandridis et.al. | 2410.11774 | link |
| 2024-10-15 | RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation | Anton Antonov et.al. | 2410.11722 | link |
| 2024-10-15 | InvSeg: Test-Time Prompt Inversion for Semantic Segmentation | Jiayi Lin et.al. | 2410.11473 | null |
| 2024-10-15 | MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation | Xianping Ma et.al. | 2410.11160 | link |
| 2024-10-14 | Locality Alignment Improves Vision-Language Models | Ian Covert et.al. | 2410.11087 | null |
| 2024-10-14 | Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes | Tim Broedermann et.al. | 2410.10791 | null |
| 2024-10-14 | UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation | Lihe Yang et.al. | 2410.10777 | link |
| 2024-10-14 | PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion | Runsong Zhu et.al. | 2410.10659 | link |
| 2024-10-14 | Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Daniel Fusaro et.al. | 2410.10510 | link |
| 2024-10-14 | LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections | Xuezhi Xiang et.al. | 2410.10433 | null |
| 2024-10-14 | V2M: Visual 2-Dimensional Mamba for Image Representation Learning | Chengkun Wang et.al. | 2410.10382 | link |
| 2024-10-14 | GlobalMamba: Global Image Serialization for Vision Mamba | Chengkun Wang et.al. | 2410.10316 | link |
| 2024-10-13 | UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation | Ye Sun et.al. | 2410.09909 | null |
| 2024-10-13 | AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model | Yuchen Li et.al. | 2410.09714 | null |
| 2024-10-12 | An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation | Wei Liang et.al. | 2410.09443 | null |
| 2024-10-11 | Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation | Varduhi Yeghiazaryan et.al. | 2410.08946 | null |
| 2024-10-11 | Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation | Hanieh Shojaei et.al. | 2410.08687 | null |
| 2024-10-11 | DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Nguyen Huu Bao Long et.al. | 2410.08582 | link |
| 2024-10-10 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Samir Abou Haidar et.al. | 2410.08365 | null |
| 2024-10-10 | Interactive4D: Interactive 4D LiDAR Segmentation | Ilya Fradlin et.al. | 2410.08206 | link |
| 2024-10-10 | Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation | Zhiyi Pan et.al. | 2410.08091 | null |
| 2024-10-10 | Shift and matching queries for video semantic segmentation | Tsubasa Mizuno et.al. | 2410.07635 | null |
| 2024-10-10 | 3D Vision-Language Gaussian Splatting | Qucheng Peng et.al. | 2410.07577 | null |
| 2024-10-09 | Segmenting objects with Bayesian fusion of active contour models and convnet priors | Przemyslaw Polewski et.al. | 2410.07421 | null |
| 2024-10-11 | Bridge the Points: Graph-based Few-shot Segment Anything Semantically | Anqi Zhang et.al. | 2410.06964 | null |
| 2024-10-09 | Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation | Seungho Lee et.al. | 2410.06893 | null |
| 2024-10-09 | Rethinking the Evaluation of Visible and Infrared Image Fusion | Dayan Guan et.al. | 2410.06811 | link |
| 2024-10-10 | QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Fei Xie et.al. | 2410.06806 | link |
| 2024-10-09 | Transesophageal Echocardiography Generation using Anatomical Models | Emmanuel Oladokun et.al. | 2410.06781 | null |
| 2024-10-09 | Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy | Qinfeng Zhu et.al. | 2410.06725 | null |
| 2024-10-09 | Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments | Meng Yu et.al. | 2410.06626 | null |
| 2024-10-09 | Towards Natural Image Matting in the Wild via Real-Scenario Prior | Ruihao Xia et.al. | 2410.06593 | link |
| 2024-10-08 | Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions | Mateus Karvat et.al. | 2410.06380 | link |
| 2024-10-08 | Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts | Zhiwei Lin et.al. | 2410.05963 | null |
| 2024-10-07 | Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation | Vince Zhu et.al. | 2410.04689 | null |
| 2024-10-06 | In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding | Shenghao Li et.al. | 2410.04529 | null |
| 2024-10-05 | ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments | Lorenzo Terenzi et.al. | 2410.04250 | null |
| 2024-10-04 | SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 | Hao Yu et.al. | 2410.03962 | null |
| 2024-10-04 | Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features | Benyuan Meng et.al. | 2410.03558 | link |
| 2024-10-04 | Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images | Abhijeet Patil et.al. | 2410.03289 | link |
| 2024-10-04 | HRVMamba: High-Resolution Visual State Space Model for Dense Prediction | Hao Zhang et.al. | 2410.03174 | null |
| 2024-10-03 | HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer | Jingjing Ren et.al. | 2410.02528 | null |
| 2024-10-06 | SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations | Nikolaos Giakoumoglou et.al. | 2410.02401 | link |
| 2024-10-04 | Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation | Muzhi Zhu et.al. | 2410.02369 | null |
| 2024-10-03 | ProtoSeg: A Prototype-Based Point Cloud Instance Segmentation Method | Remco Royen et.al. | 2410.02352 | null |
| 2024-10-03 | RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds | Remco Royen et.al. | 2410.02323 | null |
| 2024-10-03 | Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network | Yangyang Qiu et.al. | 2410.02224 | null |
| 2024-10-03 | Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images | Qingyuan Liu et.al. | 2410.02207 | null |
| 2024-10-02 | SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images | Kaiyu Li et.al. | 2410.01768 | link |
| 2024-10-02 | One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations | Shaokang Wu et.al. | 2410.01630 | null |
| 2024-10-02 | Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation | Zhaofeng Shi et.al. | 2410.01341 | null |
| 2024-10-02 | VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings | Andrea Carrara et.al. | 2410.01336 | null |
| 2024-10-01 | RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation | Yazhou Zhu et.al. | 2410.01110 | null |
| 2024-10-01 | Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer | Vlatko Spasev et.al. | 2410.01092 | null |
| 2024-10-01 | Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time | Chiao-An Yang et.al. | 2410.01083 | link |
| 2024-10-01 | DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles | Robert Krajewski et.al. | 2410.00769 | null |
| 2024-10-01 | Optimizing Drug Delivery in Smart Pharmacies: A Novel Framework of Multi-Stage Grasping Network Combined with Adaptive Robotics Mechanism | Rui Tang et.al. | 2410.00753 | null |
| 2024-10-01 | Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection | Pengxi Zeng et.al. | 2410.00582 | null |
| 2024-09-30 | AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation | Boyu Han et.al. | 2409.20398 | null |
| 2024-09-30 | Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation | Tillmann Rheude et.al. | 2409.20287 | link |
| 2024-09-30 | Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model | Fulong Ma et.al. | 2409.20164 | null |
| 2024-09-30 | Segmenting Wood Rot using Computer Vision Models | Roland Kammerbauer et.al. | 2409.20137 | null |
| 2024-09-30 | Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels | Heeseong Shin et.al. | 2409.19846 | null |
| 2024-09-27 | ProMerge: Prompt and Merge for Unsupervised Instance Segmentation | Dylan Li et.al. | 2409.18961 | null |
| 2024-09-27 | Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation | Raphael Hagmanns et.al. | 2409.18788 | null |
| 2024-09-27 | Learning from Pattern Completion: Self-supervised Controllable Generation | Zhiqiang Chen et.al. | 2409.18694 | link |
| 2024-09-27 | Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast | Xiaoke Hao et.al. | 2409.18543 | link |
| 2024-10-01 | Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization | Siru Li et.al. | 2409.18434 | null |
| 2024-09-27 | Search3D: Hierarchical Open-Vocabulary 3D Segmentation | Ayca Takmaz et.al. | 2409.18431 | null |
| 2024-09-26 | Efficient Microscopic Image Instance Segmentation for Food Crystal Quality Control | Xiaoyu Ji et.al. | 2409.18291 | null |
| 2024-09-26 | Amodal Instance Segmentation with Diffusion Shape Prior Estimation | Minh Tran et.al. | 2409.18256 | null |
| 2024-09-26 | Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning | Siyi Lu et.al. | 2409.17659 | null |
| 2024-09-26 | Global-Local Medical SAM Adaptor Based on Full Adaption | Meng Wang et.al. | 2409.17486 | null |
| 2024-09-25 | VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection | Liangyu Zhong et.al. | 2409.17330 | null |
| 2024-09-25 | 2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation | Tommie Kerssies et.al. | 2409.17208 | link |
| 2024-09-25 | WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks | Alberto Bacchin et.al. | 2409.16999 | link |
| 2024-09-25 | Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis | Illia Tsiporenko et.al. | 2409.16940 | null |
| 2024-09-24 | A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation | Avisha Kumar et.al. | 2409.16441 | null |
| 2024-09-24 | Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds | Asad Ur Rahman et.al. | 2409.16381 | null |
| 2024-09-24 | Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation | Yong Xien Chng et.al. | 2409.16278 | null |
| 2024-09-24 | Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation | Hannah Kerner et.al. | 2409.16252 | link |
| 2024-09-24 | Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation | Harry Rogers et.al. | 2409.16213 | link |
| 2024-09-24 | Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification | Pang-Yuan Pao et.al. | 2409.15846 | null |
| 2024-09-24 | Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks | Roberto Alcover-Couso et.al. | 2409.15813 | null |
| 2024-09-24 | DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation | Soojin Jang et.al. | 2409.15801 | null |
| 2024-09-24 | Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis | Camndon Reed et.al. | 2409.15671 | null |
| 2024-09-23 | Adapting Segment Anything Model for Unseen Object Instance Segmentation | Rui Cao et.al. | 2409.15481 | null |
| 2024-09-23 | ZeroSCD: Zero-Shot Street Scene Change Detection | Shyam Sundar Kannan et.al. | 2409.15255 | null |
| 2024-09-23 | Diffusion-based RGB-D Semantic Segmentation with Deformable Attention Transformer | Minh Bui et.al. | 2409.15117 | null |
| 2024-09-18 | Applications of Knowledge Distillation in Remote Sensing: A Survey | Yassine Himeur et.al. | 2409.12111 | null |
| 2024-09-18 | Panoptic-Depth Forecasting | Juana Valeria Hurtado et.al. | 2409.12008 | null |
| 2024-09-18 | Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments | Gang Chen et.al. | 2409.11975 | null |
| 2024-09-17 | Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks | Edgar Heinert et.al. | 2409.11373 | null |
| 2024-09-17 | MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping | Amirreza Fateh et.al. | 2409.11316 | link |
| 2024-09-17 | Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark | Clifford Broni-Bediako et.al. | 2409.11227 | link |
| 2024-09-17 | HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios | Nick Theisen et.al. | 2409.11205 | link |
| 2024-09-16 | Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks? | Kaleb Kassaw et.al. | 2409.10775 | null |
| 2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
| 2024-09-16 | BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images | Wentao Wang et.al. | 2409.10269 | null |
| 2024-09-15 | Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation | Zhanteng Xie et.al. | 2409.09899 | null |
| 2024-09-15 | Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation | Qilong Zhangli et.al. | 2409.09893 | null |
| 2024-09-15 | High Definition Map Mapping and Update: A General Overview and Future Directions | Benny Wijaya et.al. | 2409.09726 | null |
| 2024-09-14 | One missing piece in Vision and Language: A Survey on Comics Understanding | Emanuele Vivoli et.al. | 2409.09502 | link |
| 2024-09-14 | Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation | Hugo Porta et.al. | 2409.09497 | null |
| 2024-09-14 | LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation | Qiyuan Wang et.al. | 2409.09360 | null |
| 2024-09-16 | QueryCAD: Grounded Question Answering for CAD Models | Claudius Kienle et.al. | 2409.08704 | null |
| 2024-09-13 | AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation | Zechao Sun et.al. | 2409.08516 | null |
| 2024-09-13 | VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation | Ezra MacDonald et.al. | 2409.08461 | link |
| 2024-09-12 | Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding | Hongyu Li et.al. | 2409.08251 | null |
| 2024-09-12 | Bayesian Self-Training for Semi-Supervised 3D Segmentation | Ozan Unal et.al. | 2409.08102 | null |
| 2024-09-12 | Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes | Siyu Chen et.al. | 2409.07995 | null |
| 2024-09-12 | UNIT: Unsupervised Online Instance Segmentation through Time | Corentin Sautier et.al. | 2409.07887 | null |
| 2024-09-12 | SURGIVID: Annotation-Efficient Surgical Video Object Discovery | Çağhan Köksal et.al. | 2409.07801 | null |
| 2024-09-12 | Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation | Fuchen Zheng et.al. | 2409.07793 | link |
| 2024-09-12 | ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation | Fuchen Zheng et.al. | 2409.07779 | link |
| 2024-09-12 | Open-Vocabulary Remote Sensing Image Semantic Segmentation | Qinglong Cao et.al. | 2409.07683 | null |
| 2024-09-11 | Token Turing Machines are Efficient Vision Models | Purvish Jajal et.al. | 2409.07613 | null |
| 2024-09-11 | AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution | Wangduo Xie et.al. | 2409.07171 | null |
| 2024-09-11 | Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images | Xuexue Li et.al. | 2409.07022 | null |
| 2024-09-11 | Brain-Inspired Stepwise Patch Merging for Vision Transformers | Yonghao Yu et.al. | 2409.06963 | null |
| 2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | link |
| 2024-09-10 | A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO | Sabit Ahamed Preanto et.al. | 2409.06671 | null |
| 2024-09-10 | Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data | Ali Tourani et.al. | 2409.06625 | null |
| 2024-09-10 | PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation | Yin Hu et.al. | 2409.06309 | null |
| 2024-09-10 | EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation | Nischal Khanal et.al. | 2409.06183 | link |
| 2024-09-09 | SVS-GAN: Leveraging GANs for Semantic Video Synthesis | Khaled M. Seyam et.al. | 2409.06074 | null |
| 2024-09-09 | Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance | Quang-Huy Che et.al. | 2409.06002 | null |
| 2024-09-09 | Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features | Jacob Gildenblat et.al. | 2409.05697 | null |
| 2024-09-09 | ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions | Furqan Ahmed Shaik et.al. | 2409.05327 | null |
| 2024-09-08 | RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network | Zhiwei Lin et.al. | 2409.04979 | null |
| 2024-09-06 | Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation | Björn Michele et.al. | 2409.04409 | link |
| 2024-09-06 | Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor Manufacturing for Advanced IC Nodes | Bappaditya Dey et.al. | 2409.04310 | null |
| 2024-09-06 | CISCA and CytoDArk0: a Cell Instance Segmentation and Classification method for histo(patho)logical image Analyses and a new, open, Nissl-stained dataset for brain cytoarchitecture studies | Valentina Vadori et.al. | 2409.04175 | null |
| 2024-09-05 | Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution | Marga Don et.al. | 2409.03754 | link |
| 2024-09-05 | MaskVal: Simple but Effective Uncertainty Quantification for 6D Pose Estimation | Philipp Quentin et.al. | 2409.03556 | null |
| 2024-09-05 | LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones | Moritz Nottebaum et.al. | 2409.03460 | link |
| 2024-09-05 | Automatic occlusion removal from 3D maps for maritime situational awareness | Felix Sattler et.al. | 2409.03451 | null |
| 2024-09-05 | Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications | Tong Bu et.al. | 2409.03368 | null |
| 2024-09-05 | MouseSIS: A Frames-and-Events Dataset for Space-Time Instance Segmentation of Mice | Friedhelm Hamann et.al. | 2409.03358 | null |
| 2024-09-05 | UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking | Md. Mahfuzur Rahman et.al. | 2409.03245 | null |
| 2024-09-05 | Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation | Xixi Jiang et.al. | 2409.03228 | link |
| 2024-09-05 | iSeg: An Iterative Refinement-based Framework for Training-free Segmentation | Lin Sun et.al. | 2409.03209 | link |
| 2024-09-04 | iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation | Hayeon Jo et.al. | 2409.02838 | null |
| 2024-09-04 | CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation | Minhee Cho et.al. | 2409.02699 | null |
| 2024-09-04 | Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation | Tiantian Zhang et.al. | 2409.02567 | null |
| 2024-09-04 | SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction | Sumin Son et.al. | 2409.02513 | null |
| 2024-09-03 | K-Origins: Better Colour Quantification for Neural Networks | Lewis Mason et.al. | 2409.02281 | null |
| 2024-09-03 | AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions | Chenghao Qian et.al. | 2409.02045 | null |
| 2024-09-03 | MetaFood3D: Large 3D Food Object Dataset with Nutrition Values | Yuhao Chen et.al. | 2409.01966 | null |
| 2024-09-03 | Segmenting Object Affordances: Reproducibility and Sensitivity to Scale | Tommaso Apicella et.al. | 2409.01814 | link |
| 2024-09-03 | Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation | Haodong Wang et.al. | 2409.01662 | null |
| 2024-09-02 | Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition | Xuanrui Zeng et.al. | 2409.01472 | link |
| 2024-08-30 | Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes | Li Zhang et.al. | 2408.17421 | link |
| 2024-08-30 | Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations | Ahmed Hammam et.al. | 2408.17311 | null |
| 2024-08-30 | Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training | Zizheng Huang et.al. | 2408.17081 | link |
| 2024-08-30 | Transient Fault Tolerant Semantic Segmentation for Autonomous Driving | Leonardo Iurada et.al. | 2408.16952 | link |
| 2024-08-29 | Eigen-Cluster VIS: Improving Weakly-supervised Video Instance Segmentation by Leveraging Spatio-temporal Consistency | Farnoosh Arefi et.al. | 2408.16661 | link |
| 2024-08-29 | SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection | Rohit Venkata Sai Dulam et.al. | 2408.16645 | null |
| 2024-08-29 | A Simple and Generalist Approach for Panoptic Segmentation | Nedyalko Prisadnikov et.al. | 2408.16504 | null |
| 2024-08-29 | MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation | Linyan Yang et.al. | 2408.16478 | null |
| 2024-08-29 | Multi-source Domain Adaptation for Panoramic Semantic Segmentation | Jing Jiang et.al. | 2408.16469 | null |
| 2024-08-29 | EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More | Kanghao Chen et.al. | 2408.16254 | null |
| 2024-08-28 | InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation | Thibaut Goldsborough et.al. | 2408.15954 | link |
| 2024-08-28 | SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors | Zhiqing Zhang et.al. | 2408.15887 | null |
| 2024-08-28 | DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries | Yu Yang et.al. | 2408.15813 | null |
| 2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | link |
| 2024-08-27 | Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images | Silvia Seidlitz et.al. | 2408.15373 | link |
| 2024-08-27 | An Investigation on The Position Encoding in Vision-Based Dynamics Prediction | Jiageng Zhu et.al. | 2408.15201 | null |
| 2024-08-27 | Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation | Elona Shatri et.al. | 2408.15002 | null |
| 2024-08-27 | Applying ViT in Generalized Few-shot Semantic Segmentation | Liyuan Geng et.al. | 2408.14957 | link |
| 2024-08-27 | Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack | Naufal Suryanto et.al. | 2408.14879 | null |
| 2024-08-27 | MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation | Yuanbing Zhu et.al. | 2408.14776 | null |
| 2024-08-26 | Physically Feasible Semantic Segmentation | Shamik Basu et.al. | 2408.14672 | link |
| 2024-08-26 | A Survey of Camouflaged Object Detection and Beyond | Fengyang Xiao et.al. | 2408.14562 | null |
| 2024-08-26 | Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping | Vishal Batchu et.al. | 2408.14400 | null |
| 2024-08-25 | OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation | Muhammad Rameez ur Rahman et.al. | 2408.13936 | link |
| 2024-08-25 | Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation | Yuwen Pan et.al. | 2408.13838 | null |
| 2024-08-25 | TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Xiongwei Zhao et.al. | 2408.13802 | link |
| 2024-08-25 | ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation | Xin Zhang et.al. | 2408.13771 | null |
| 2024-08-25 | Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation | Zhaoyang Li et.al. | 2408.13752 | null |
| 2024-08-24 | ESA: Annotation-Efficient Active Learning for Semantic Segmentation | Jinchao Ge et.al. | 2408.13491 | link |
| 2024-08-23 | Accuracy Improvement of Cell Image Segmentation Using Feedback Former | Hinako Mitsuoka et.al. | 2408.12974 | null |
| 2024-08-23 | Image Segmentation in Foundation Model Era: A Survey | Tianfei Zhou et.al. | 2408.12957 | null |
| 2024-08-23 | Symmetric masking strategy enhances the performance of Masked Image Modeling | Khanh-Binh Nguyen et.al. | 2408.12772 | null |
| 2024-08-22 | Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets | Wolfgang Boettcher et.al. | 2408.12489 | null |
| 2024-08-22 | The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation | Tuyen Tran et.al. | 2408.12447 | null |
| 2024-08-22 | ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving Scenes | Zhenyi Liu et.al. | 2408.12048 | link |
| 2024-08-21 | EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Xiuwei Xu et.al. | 2408.11811 | null |
| 2024-08-21 | NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation | Zhenye Lou et.al. | 2408.11787 | link |
| 2024-08-21 | Open-Ended 3D Point Cloud Instance Segmentation | Phuc D. A. Nguyen et.al. | 2408.11747 | null |
| 2024-08-21 | UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images | Enze Zhu et.al. | 2408.11545 | null |
| 2024-08-22 | SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything | Chongkai Yu et.al. | 2408.11535 | null |
| 2024-08-21 | Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation | Chuandong Liu et.al. | 2408.11280 | null |
| 2024-08-20 | An Interpretable Deep Learning Approach for Morphological Script Type Analysis | Malamatenia Vlachou-Efstathiou et.al. | 2408.11150 | null |
| 2024-08-20 | NeCo: Improving DINOv2’s spatial representations in 19 GPU hours with Patch Neighbor Consistency | Valentinos Pariza et.al. | 2408.11054 | null |
| 2024-08-20 | CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients | Karen Sanchez et.al. | 2408.10827 | null |
| 2024-08-20 | Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant | Guofeng Mei et.al. | 2408.10652 | null |
| 2024-08-20 | Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? | Chen Liang et.al. | 2408.10627 | null |
| 2024-08-20 | Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation | Jiawei Han et.al. | 2408.10537 | link |
| 2024-08-21 | LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS | Xinyu Liu et.al. | 2408.10469 | null |
| 2024-08-19 | Leveraging Superfluous Information in Contrastive Representation Learning | Xuechu Yu et.al. | 2408.10292 | null |
| 2024-08-19 | Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network | Rasha Alshawi et.al. | 2408.10181 | null |
| 2024-08-19 | Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Emanuele Caruso et.al. | 2408.10031 | link |
| 2024-08-19 | Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis | Kira Maag et.al. | 2408.10021 | null |
| 2024-08-19 | DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery | Corentin Dumery et.al. | 2408.09928 | null |
| 2024-08-19 | 3D-Aware Instance Segmentation and Tracking in Egocentric Videos | Yash Bhalgat et.al. | 2408.09860 | null |
| 2024-08-19 | Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Jun Yan et.al. | 2408.09839 | link |
| 2024-08-18 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Muhammad Rameez Ur Rahman et.al. | 2408.09424 | link |
| 2024-08-18 | VrdONE: One-stage Video Visual Relation Detection | Xinjie Jiang et.al. | 2408.09408 | link |
| 2024-08-18 | Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration | Hao Ai et.al. | 2408.09336 | null |
| 2024-08-17 | Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology | Junchao Zhu et.al. | 2408.09278 | link |
| 2024-08-16 | Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation | Tri Ton et.al. | 2408.08591 | null |
| 2024-08-16 | Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation | Linghao Zheng et.al. | 2408.08576 | null |
| 2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
| 2024-08-15 | 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks | Dongshuo Yin et.al. | 2408.08345 | link |
| 2024-08-14 | MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis | Nimeesha Chan et.al. | 2408.07773 | link |
| 2024-08-15 | MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation | Beoungwoo Kang et.al. | 2408.07576 | link |
| 2024-08-15 | MagicFace: Training-free Universal-Style Human Image Customized Synthesis | Yibin Wang et.al. | 2408.07433 | null |
| 2024-08-14 | Segment Using Just One Example | Pratik Vora et.al. | 2408.07393 | null |
| 2024-08-14 | Ensemble architecture in polyp segmentation | Hao-Yun Hsu et.al. | 2408.07262 | link |
| 2024-08-14 | Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks | Raghavendra Singh et.al. | 2408.07243 | null |
| 2024-08-14 | Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training | Ethan Kou et.al. | 2408.07239 | null |
| 2024-08-13 | ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Jingyun Wang et.al. | 2408.06747 | link |
| 2024-08-10 | Dilated Convolution with Learnable Spacings | Ismail Khalfaoui-Hassani et.al. | 2408.06383 | null |
| 2024-08-12 | Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images | Siladittya Manna et.al. | 2408.06235 | null |
| 2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
| 2024-08-13 | ClickAttention: Click Region Similarity Guided Interactive Segmentation | Long Xu et.al. | 2408.06021 | null |
| 2024-08-12 | Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning | Xinrong Hu et.al. | 2408.05889 | null |
| 2024-08-11 | Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task | Hannuo Zhang et.al. | 2408.05777 | null |
| 2024-08-11 | MacFormer: Semantic Segmentation with Fine Object Boundaries | Guoan Xu et.al. | 2408.05699 | null |
| 2024-08-13 | Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment | Abdul-Razak Alhassan Gamani et.al. | 2408.05661 | null |
| 2024-08-10 | Multimodal generative semantic communication based on latent diffusion model | Weiqi Fu et.al. | 2408.05455 | null |
| 2024-08-09 | PRISM Lite: A lightweight model for interactive 3D placenta segmentation in ultrasound | Hao Li et.al. | 2408.05372 | link |
| 2024-08-09 | In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Dahyun Kang et.al. | 2408.04961 | link |
| 2024-08-09 | ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Mengcheng Lan et.al. | 2408.04883 | link |
| 2024-08-09 | Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning | Fumihiro Kaneko et.al. | 2408.04795 | null |
| 2024-08-08 | Embodied Uncertainty-Aware Object Segmentation | Xiaolin Fang et.al. | 2408.04760 | null |
| 2024-08-08 | SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation | Jieming Yu et.al. | 2408.04593 | null |
| 2024-08-08 | Robust Approximate Characterization of Single-Cell Heterogeneity in Microbial Growth | Richard D. Paul et.al. | 2408.04501 | link |
| 2024-08-08 | SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios | Sriram Mandalika et.al. | 2408.04482 | null |
| 2024-08-08 | What could go wrong? Discovering and describing failure modes in computer vision | Gabriela Csurka et.al. | 2408.04471 | null |
| 2024-08-07 | Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Yiqing Shen et.al. | 2408.04098 | null |
| 2024-08-07 | CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Tianfang Zhang et.al. | 2408.03703 | link |
| 2024-08-07 | SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology | Mingya Zhang et.al. | 2408.03651 | link |
| 2024-08-06 | Post-Mortem Human Iris Segmentation Analysis with Deep Learning | Afzal Hossain et.al. | 2408.03448 | null |
| 2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | link |
| 2024-08-06 | Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment | Shijie Lian et.al. | 2408.02924 | link |
| 2024-08-05 | Scribble-Based Interactive Segmentation of Medical Hyperspectral Images | Zhonghao Wang et.al. | 2408.02708 | null |
| 2024-08-05 | Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation | Sai Prasanna et.al. | 2408.02297 | null |
| 2024-08-05 | Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs | Jeongkee Lim et.al. | 2408.02261 | null |
| 2024-08-05 | Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders | Muhammad Abdullah Jamal et.al. | 2408.02245 | null |
| 2024-08-04 | Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation | Ye Du et.al. | 2408.02039 | null |
| 2024-08-03 | NuLite – Lightweight and Fast Model for Nuclei Instance Segmentation and Classification | Cristian Tommasino et.al. | 2408.01797 | null |
| 2024-08-03 | Bayesian Active Learning for Semantic Segmentation | Sima Didari et.al. | 2408.01694 | null |
| 2024-08-03 | A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection | Omkar Oak et.al. | 2408.01692 | null |
| 2024-08-03 | Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation | Balázs Opra et.al. | 2408.01640 | null |
| 2024-08-02 | Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans | Lukas Kratochvila et.al. | 2408.01526 | null |
| 2024-08-02 | Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation | Yuanzhi Su et.al. | 2408.01356 | null |
| 2024-08-02 | StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Bingyu Li et.al. | 2408.01343 | null |
| 2024-08-02 | Amodal Segmentation for Laparoscopic Surgery Video Instruments | Ruohua Shi et.al. | 2408.01067 | null |
| 2024-08-02 | Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Yabin Zhu et.al. | 2408.00969 | null |
| 2024-08-01 | Medical SAM 2: Segment medical images as video via Segment Anything Model 2 | Jiayuan Zhu et.al. | 2408.00874 | link |
| 2024-08-01 | Leaf Angle Estimation using Mask R-CNN and LETR Vision Transformer | Venkat Margapuri et.al. | 2408.00749 | null |
| 2024-08-01 | Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Siyu Jiao et.al. | 2408.00744 | link |
| 2024-08-01 | Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function | Matias Oscar Volman Stern et.al. | 2408.00707 | null |
| 2024-08-01 | AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation | Asbjørn Munk et.al. | 2408.00640 | null |
| 2024-08-01 | SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation | Shengbo Tan et.al. | 2408.00496 | null |
| 2024-08-01 | A Simple Background Augmentation Method for Object Detection with Diffusion Model | Yuhang Li et.al. | 2408.00350 | null |
| 2024-07-31 | Con4m: Context-aware Consistency Learning Framework for Segmented Time Series Classification | Junru Chen et.al. | 2408.00041 | null |
| 2024-07-31 | Open-Vocabulary Audio-Visual Semantic Segmentation | Ruohao Guo et.al. | 2407.21721 | link |
| 2024-07-31 | MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment | Anurag Das et.al. | 2407.21654 | null |
| 2024-07-31 | MaskUno: Switch-Split Block For Enhancing Instance Segmentation | Jawad Haidar et.al. | 2407.21498 | null |
| 2024-07-31 | Small Object Few-shot Segmentation for Vision-based Industrial Inspection | Zilong Zhang et.al. | 2407.21351 | null |
| 2024-07-31 | On-the-fly Point Feature Representation for Point Clouds Analysis | Jiangyi Wang et.al. | 2407.21335 | null |
| 2024-07-31 | Fine-grained Metrics for Point Cloud Semantic Segmentation | Zhuheng Lu et.al. | 2407.21289 | null |
| 2024-07-30 | PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds | Kerem Mertoğlu et.al. | 2407.21150 | null |
| 2024-07-30 | Learning Ordinality in Semantic Segmentation | Rafael Cristino et.al. | 2407.20959 | null |
| 2024-07-29 | Improving 2D Feature Representations by 3D-Aware Fine-Tuning | Yuanwen Yue et.al. | 2407.20229 | link |
| 2024-07-29 | Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset | Yimian Dai et.al. | 2407.20078 | link |
| 2024-07-29 | Language-driven Grasp Detection with Mask-guided Attention | Tuan Van Vo et.al. | 2407.19877 | null |
| 2024-07-29 | Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets | Muhammad Abdullah Jamal et.al. | 2407.19714 | null |
| 2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
| 2024-07-28 | ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding | Zhen Chen et.al. | 2407.19435 | link |
| 2024-07-28 | Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets | Tianxiao Zhang et.al. | 2407.19394 | link |
| 2024-07-27 | Ensembling convolutional neural networks for human skin segmentation | Patryk Kuban et.al. | 2407.19310 | null |
| 2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
| 2024-07-26 | Sparse Refinement for Efficient High-Resolution Semantic Segmentation | Zhijian Liu et.al. | 2407.19014 | null |
| 2024-07-26 | A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention | João D. Nunes et.al. | 2407.18673 | null |
| 2024-07-26 | Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation | Jingjun Yi et.al. | 2407.18568 | null |
| 2024-07-25 | Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception | Julia Hindel et.al. | 2407.18145 | null |
| 2024-07-25 | LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels | Ziwei Cui et.al. | 2407.18054 | link |
| 2024-07-25 | TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework | Guanfeng Tang et.al. | 2407.18038 | null |
| 2024-07-25 | Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions | Jan Nikolas Morshuis et.al. | 2407.18026 | link |
| 2024-07-26 | Quality Assured: Rethinking Annotation Strategies in Imaging AI | Tim Rädsch et.al. | 2407.17596 | null |
| 2024-07-24 | Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation | Hyunwoo Yu et.al. | 2407.17261 | link |
| 2024-07-24 | Trans2Unet: Neural fusion for Nuclei Semantic Segmentation | Dinh-Phu Tran et.al. | 2407.17181 | null |
| 2024-07-24 | PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning | Mu Chen et.al. | 2407.17101 | null |
| 2024-07-25 | Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste | Qinfeng Zhu et.al. | 2407.17028 | link |
| 2024-07-24 | Progressive Query Refinement Framework for Bird’s-Eye-View Semantic Segmentation from Surrounding Images | Dooseop Choi et.al. | 2407.17003 | link |
| 2024-07-24 | McGAN: Generating Manufacturable Designs by Embedding Manufacturing Rules into Conditional Generative Adversarial Network | Zhichao Wang et.al. | 2407.16943 | null |
| 2024-07-23 | SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation | Pengfei Chen et.al. | 2407.16682 | null |
| 2024-07-23 | Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving | Anam Manzoor et.al. | 2407.16647 | null |
| 2024-07-23 | Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging | Daniela L. Ramos et.al. | 2407.16608 | null |
| 2024-07-23 | Strike a Balance in Continual Panoptic Segmentation | Jinpeng Chen et.al. | 2407.16354 | link |
| 2024-07-23 | Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision | Aditya Krishnan et.al. | 2407.16102 | null |
| 2024-07-22 | Enhancing Cell Instance Segmentation in Scanning Electron Microscopy Images via a Deep Contour Closing Operator | Florian Robert et.al. | 2407.15817 | null |
| 2024-07-22 | MILAN: Milli-Annotations for Lidar Semantic Segmentation | Nermin Samet et.al. | 2407.15797 | null |
| 2024-07-22 | Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond | Silvio Galesso et.al. | 2407.15739 | link |
| 2024-07-22 | MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics | Alexander Melekhin et.al. | 2407.15663 | link |
| 2024-07-22 | Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling | Bo Yuan et.al. | 2407.15429 | link |
| 2024-07-22 | Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data | Junha Song et.al. | 2407.15383 | null |
| 2024-07-21 | Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation | Xiaoyang Wu et.al. | 2407.15282 | null |
| 2024-07-20 | Downstream-Pretext Domain Knowledge Traceback for Active Learning | Beichen Zhang et.al. | 2407.14720 | null |
| 2024-07-19 | Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model | Kun Zhao et.al. | 2407.14326 | null |
| 2024-07-19 | Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation | Zhengyuan Xie et.al. | 2407.14142 | link |
| 2024-07-19 | MC-PanDA: Mask Confidence for Panoptic Domain Adaptation | Ivan Martinović et.al. | 2407.14110 | link |
| 2024-07-19 | GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation | Florian Chabot et.al. | 2407.14108 | null |
| 2024-07-19 | Scale Disparity of Instances in Interactive Point Cloud Segmentation | Chenrui Han et.al. | 2407.14009 | null |
| 2024-07-18 | Many Perception Tasks are Highly Redundant Functions of their Input Data | Rahul Ramesh et.al. | 2407.13841 | null |
| 2024-07-18 | GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model | Abdelrahman Shaker et.al. | 2407.13772 | link |
| 2024-07-18 | SegPoint: Segment Any Point Cloud via Large Language Model | Shuting He et.al. | 2407.13761 | null |
| 2024-07-18 | MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis | Ziming Zhong et.al. | 2407.13675 | link |
| 2024-07-18 | Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models | Xiaoyu Zhu et.al. | 2407.13642 | null |
| 2024-07-18 | FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures | Hao Lu et.al. | 2407.13500 | null |
| 2024-07-18 | FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions | Sohyun Lee et.al. | 2407.13437 | null |
| 2024-07-18 | Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability | Judith Dijk et.al. | 2407.13392 | null |
| 2024-07-18 | Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation | Chang Liu et.al. | 2407.13363 | null |
| 2024-07-18 | Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation | Shoumeng Qiu et.al. | 2407.13254 | link |
| 2024-07-18 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation | Jian Sun et.al. | 2407.13137 | null |
| 2024-07-17 | FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification | Yiqing Shen et.al. | 2407.12658 | null |
| 2024-07-17 | Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation | Prantik Howlader et.al. | 2407.12630 | link |
| 2024-07-17 | Instance-wise Uncertainty for Class Imbalance in Semantic Segmentation | Luís Almeida et.al. | 2407.12609 | null |
| 2024-07-17 | Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Antoni Kowalczuk et.al. | 2407.12588 | link |
| 2024-07-17 | Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation | Ruijie Xu et.al. | 2407.12489 | link |
| 2024-07-17 | Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation | Hyun Seok Seong et.al. | 2407.12463 | null |
| 2024-07-17 | Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation | Kaixin Bai et.al. | 2407.12449 | null |
| 2024-07-17 | ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference | Mengcheng Lan et.al. | 2407.12442 | null |
| 2024-07-17 | Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model | Tao Wang et.al. | 2407.12319 | null |
| 2024-07-16 | FoodMem: Near Real-time and Precise Food Video Segmentation | Ahmad AlMughrabi et.al. | 2407.12121 | null |
| 2024-07-16 | Mitigating Background Shift in Class-Incremental Semantic Segmentation | Gilhan Park et.al. | 2407.11859 | link |
| 2024-07-16 | Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation | Juncheng Ma et.al. | 2407.11820 | null |
| 2024-07-16 | Click-Gaussian: Interactive Segmentation to Any 3D Gaussians | Seokhun Choi et.al. | 2407.11793 | null |
| 2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | null |
| 2024-07-16 | OAM-TCD: A globally diverse dataset of high-resolution tree cover maps | Josh Veitch-Michaelis et.al. | 2407.11743 | link |
| 2024-07-16 | SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds | Yanbo Wang et.al. | 2407.11569 | link |
| 2024-07-16 | SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation | Lei Yao et.al. | 2407.11564 | link |
| 2024-07-16 | Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes | Zhi Cai et.al. | 2407.11464 | link |
| 2024-07-16 | Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations | Yunya Gao et.al. | 2407.11381 | link |
| 2024-07-16 | Generative AI Driven Task-Oriented Adaptive Semantic Communications | Yuzhou Fu et.al. | 2407.11354 | null |
| 2024-07-15 | No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations | Walter Simoncini et.al. | 2407.10964 | link |
| 2024-07-15 | APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2407.10649 | null |
| 2024-07-15 | Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs | Rong Ma et.al. | 2407.10534 | null |
| 2024-07-14 | Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data | Tuo Feng et.al. | 2407.10200 | link |
| 2024-07-14 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation | Li Li et.al. | 2407.10159 | link |
| 2024-07-14 | Part2Object: Hierarchical Unsupervised 3D Instance Segmentation | Cheng Shi et.al. | 2407.10084 | link |
| 2024-07-14 | HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation | Chengjie Jiang et.al. | 2407.10047 | null |
| 2024-07-13 | Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation | Anqi Zhang et.al. | 2407.09838 | null |
| 2024-07-13 | Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach | Md Rakibul Islam et.al. | 2407.09828 | null |
| 2024-07-13 | 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance | Xiaoxu Xu et.al. | 2407.09826 | link |
| 2024-07-12 | FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background | Muhammad Ali et.al. | 2407.09379 | link |
| 2024-07-12 | WSESeg: Introducing a Dataset for the Segmentation of Winter Sports Equipment with a Baseline for Interactive Segmentation | Robin Schön et.al. | 2407.09288 | null |
| 2024-07-12 | A Fair Ranking and New Model for Panoptic Scene Graph Generation | Julian Lorenz et.al. | 2407.09216 | link |
| 2024-07-12 | Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy | Julian Wyatt et.al. | 2407.09192 | null |
| 2024-07-12 | From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation | Hanrong Shi et.al. | 2407.09191 | null |
| 2024-07-12 | Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off | Levente Halmosi et.al. | 2407.09150 | link |
| 2024-07-12 | Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation | Wei Cong et.al. | 2407.09047 | null |
| 2024-07-12 | Textual Query-Driven Mask Transformer for Domain Generalized Segmentation | Byeonghyun Pak et.al. | 2407.09033 | link |
| 2024-07-12 | Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation | Zihao Li et.al. | 2407.08994 | null |
| 2024-07-11 | SLoRD: Structural Low-Rank Descriptors for Shape Consistency in Vertebrae Segmentation | Xin You et.al. | 2407.08555 | null |
| 2024-07-11 | Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation | Tong Shao et.al. | 2407.08268 | link |
| 2024-07-11 | Enrich the content of the image Using Context-Aware Copy Paste | Qiushi Guo et.al. | 2407.08151 | null |
| 2024-07-10 | MambaVision: A Hybrid Mamba-Transformer Vision Backbone | Ali Hatamizadeh et.al. | 2407.08083 | link |
| 2024-07-10 | Interactive Segmentation Model for Placenta Segmentation from 3D Ultrasound images | Hao Li et.al. | 2407.08020 | link |
| 2024-07-10 | Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift | Elliot Vincent et.al. | 2407.07616 | link |
| 2024-07-10 | H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper | Ryan Banks et.al. | 2407.07604 | link |
| 2024-07-11 | Trainable Highly-expressive Activation Functions | Irit Chelly et.al. | 2407.07564 | null |
| 2024-07-10 | Panoptic Segmentation of Galactic Structures in LSB Images | Felix Richards et.al. | 2407.07494 | null |
| 2024-07-10 | Deformable-Heatmap-Segmentation for Automobile Visual Perception | Hongyu Jin et.al. | 2407.07493 | null |
| 2024-07-10 | Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining | Tianfang Sun et.al. | 2407.07465 | null |
| 2024-07-11 | HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation | Guoan Xu et.al. | 2407.07441 | null |
| 2024-07-10 | Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation | Hao Fang et.al. | 2407.07427 | link |
| 2024-07-09 | ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation | Yuyuan Liu et.al. | 2407.07171 | link |
| 2024-07-09 | Improved Block Merging for 3D Point Cloud Instance Segmentation | Leon Denis et.al. | 2407.06991 | null |
| 2024-07-09 | Joint prototype and coefficient prediction for 3D instance segmentation | Remco Royen et.al. | 2407.06958 | null |
| 2024-07-08 | Training-free CryoET Tomogram Segmentation | Yizhou Zhao et.al. | 2407.06833 | link |
| 2024-07-09 | CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM | Aditya Murali et.al. | 2407.06795 | null |
| 2024-07-09 | LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration | Jiayi Liu et.al. | 2407.06512 | link |
| 2024-07-08 | Leveraging image captions for selective whole slide image annotation | Jingna Qiu et.al. | 2407.06363 | null |
| 2024-07-08 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots | Siva Krishna Ravipati et.al. | 2407.06077 | null |
| 2024-07-08 | Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts | Puzuo Wang et.al. | 2407.06043 | null |
| 2024-07-08 | RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation | Sarah Elmahdy et.al. | 2407.06016 | link |
| 2024-07-07 | Semantic Segmentation for Real-World and Synthetic Vehicle’s Forward-Facing Camera Images | Tuan T. Nguyen et.al. | 2407.05452 | null |
| 2024-07-07 | Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness | Idris Hamoud et.al. | 2407.05448 | null |
| 2024-07-06 | A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation | Monika Wysoczańska et.al. | 2407.05061 | null |
| 2024-07-06 | BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support | Vladyslav Polushko et.al. | 2407.05007 | null |
| 2024-07-05 | Explainable Metric Learning for Deflating Data Bias | Emma Andrews et.al. | 2407.04866 | null |
| 2024-07-05 | Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge | Yuanze Lin et.al. | 2407.04681 | null |
| 2024-07-05 | LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes | Zexian Huang et.al. | 2407.04326 | null |
| 2024-07-04 | Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing | Anushrut Jignasu et.al. | 2407.04180 | link |
| 2024-07-04 | Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier | Prantik Howlader et.al. | 2407.04036 | link |
| 2024-07-04 | Performance of Medical Image Fusion in High-level Analysis Tasks: A Mutual Enhancement Framework for Unaligned PAT and MRI Image Fusion | Yutian Zhong et.al. | 2407.03992 | link |
| 2024-07-04 | Relative Difficulty Distillation for Semantic Segmentation | Dong Liang et.al. | 2407.03719 | null |
| 2024-07-04 | POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation | Arindam Dutta et.al. | 2407.03549 | null |
| 2024-07-03 | A Unified Framework for 3D Scene Understanding | Wei Xu et.al. | 2407.03263 | null |
| 2024-07-03 | ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation | Chang Li et.al. | 2407.03033 | null |
| 2024-07-03 | Context-Aware Video Instance Segmentation | Seunghun Lee et.al. | 2407.03010 | link |
| 2024-07-03 | ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation | Yipin Guo et.al. | 2407.02881 | null |
| 2024-07-03 | Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation | Tao Chen et.al. | 2407.02768 | null |
| 2024-07-03 | ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers | Yanfeng Jiang et.al. | 2407.02763 | null |
| 2024-07-02 | Open Panoramic Segmentation | Junwei Zheng et.al. | 2407.02685 | link |
| 2024-07-02 | Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction | Tinghuai Wang et.al. | 2407.02639 | null |
| 2024-07-02 | Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Junsung Park et.al. | 2407.02286 | link |
| 2024-07-02 | MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders | Baijiong Lin et.al. | 2407.02228 | link |
| 2024-07-02 | Occlusion-Aware Seamless Segmentation | Yihong Cao et.al. | 2407.02182 | link |
| 2024-07-02 | VRBiom: A New Periocular Dataset for Biometric Applications of HMD | Ketan Kotwal et.al. | 2407.02150 | null |
| 2024-07-02 | HRSAM: Efficiently Segment Anything in High-Resolution Images | You Huang et.al. | 2407.02109 | null |
| 2024-07-02 | Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts | Pasquale De Marinis et.al. | 2407.02075 | link |
| 2024-07-02 | LiDAR-based HD Map Localization using Semantic Generalized ICP with Road Marking Detection | Yansong Gong et.al. | 2407.02061 | null |
| 2024-07-02 | Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning | Chengchao Shen et.al. | 2407.02014 | link |
| 2024-07-01 | Label-free Neural Semantic Image Synthesis | Jiayi Wang et.al. | 2407.01790 | null |
| 2024-07-01 | PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction | Xuan Yu et.al. | 2407.01349 | null |
| 2024-06-28 | EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model | Yuxuan Zhang et.al. | 2406.20076 | link |
| 2024-07-01 | Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding | Yifan Tang et.al. | 2406.19791 | null |
| 2024-06-28 | PM-VIS+: High-Performance Video Instance Segmentation without Video Annotation | Zhangjing Yang et.al. | 2406.19665 | link |
| 2024-06-28 | Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation | Junsung Park et.al. | 2406.19638 | link |
| 2024-06-28 | PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation | Deyi Ji et.al. | 2406.19632 | null |
| 2024-06-27 | Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model | Haobo Yuan et.al. | 2406.19369 | null |
| 2024-06-27 | ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation | Nazanin Moradinasab et.al. | 2406.19225 | null |
| 2024-06-30 | Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO | Fuseini Mumuni et.al. | 2406.19057 | null |
| 2024-06-27 | Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation | Tao Lian et.al. | 2406.18809 | null |
| 2024-07-01 | 3D Feature Distillation with Object-Centric Priors | Georgios Tziafas et.al. | 2406.18742 | null |
| 2024-06-26 | CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data | Nikolaos Dionelis et.al. | 2406.18279 | null |
| 2024-06-26 | CoDA: Interactive Segmentation and Morphological Analysis of Dendroid Structures Exemplified on Stony Cold-Water Corals | Kira Schmitt et.al. | 2406.18236 | link |
| 2024-06-26 | The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval | Meinardus Boris et.al. | 2406.18113 | link |
| 2024-06-26 | Few-Shot Medical Image Segmentation with High-Fidelity Prototypes | Song Tang et.al. | 2406.18074 | link |
| 2024-06-25 | Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation | Bernardo Silva et.al. | 2406.17915 | null |
| 2024-06-25 | Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation | Xuming Zhang et.al. | 2406.17679 | null |
| 2024-06-25 | DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation | Ahmad Mohammadshirazi et.al. | 2406.17591 | link |
| 2024-06-25 | Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation | Felix Stillger et.al. | 2406.17541 | null |
| 2024-06-25 | Investigating Self-Supervised Methods for Label-Efficient Learning | Srinivasa Rao Nandam et.al. | 2406.17460 | null |
| 2024-06-25 | Pseudo Labelling for Enhanced Masked Autoencoders | Srinivasa Rao Nandam et.al. | 2406.17450 | null |
| 2024-06-25 | Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model | Zhuoyuan Li et.al. | 2406.17442 | null |
| 2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | null |
| 2024-06-25 | Depth-Guided Semi-Supervised Instance Segmentation | Xin Chen et.al. | 2406.17413 | null |
| 2024-06-25 | XAMI – A Benchmark Dataset for Artefact Detection in XMM-Newton Optical Images | Elisabeta-Iulia Dima et.al. | 2406.17323 | link |
| 2024-06-24 | GMT: Guided Mask Transformer for Leaf Instance Segmentation | Feng Chen et.al. | 2406.17109 | null |
| 2024-06-24 | Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation | Yizheng Wu et.al. | 2406.16776 | link |
| 2024-06-24 | μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation | Pierangela Bruno et.al. | 2406.16724 | null |
| 2024-06-24 | GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection | Harnaik Dhami et.al. | 2406.16625 | null |
| 2024-06-24 | LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images | Xiaowen Ma et.al. | 2406.16502 | link |
| 2024-06-24 | Cascade Reward Sampling for Efficient Decoding-Time Alignment | Bolian Li et.al. | 2406.16306 | link |
| 2024-06-24 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments | Neng Wang et.al. | 2406.16279 | link |
| 2024-06-23 | UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery | Pengfei Zhang et.al. | 2406.16129 | null |
| 2024-06-23 | CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery | Oluwatosin Alabi et.al. | 2406.16039 | null |
| 2024-06-22 | Fine-grained Background Representation for Weakly Supervised Semantic Segmentation | Xu Yin et.al. | 2406.15755 | null |
| 2024-06-21 | TraceNet: Segment one thing efficiently | Mingyuan Wu et.al. | 2406.14874 | null |
| 2024-06-19 | 3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data | Siddiqui Muhammad Yasir et.al. | 2406.14581 | null |
| 2024-06-20 | Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery | Ilham Adi Panuntun et.al. | 2406.14220 | null |
| 2024-06-20 | Trusting Semantic Segmentation Networks | Samik Some et.al. | 2406.14201 | null |
| 2024-06-20 | EvSegSNN: Neuromorphic Semantic Segmentation for Event Data | Dalia Hareb et.al. | 2406.14178 | null |
| 2024-06-20 | Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images | Qinfeng Zhu et.al. | 2406.14086 | link |
| 2024-06-20 | 2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Bin Cao et.al. | 2406.13939 | null |
| 2024-06-19 | Search-based DNN Testing and Retraining with GAN-enhanced Simulations | Mohammed Oualid Attaoui et.al. | 2406.13359 | null |
| 2024-06-19 | Deep Learning-Based 3D Instance and Semantic Segmentation: A Review | Siddiqui Muhammad Yasir et.al. | 2406.13308 | null |
| 2024-06-18 | Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation | Guoyu Yang et.al. | 2406.12496 | link |
| 2024-06-18 | Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines | Honglei Zhang et.al. | 2406.12367 | null |
| 2024-06-18 | Agriculture-Vision Challenge 2024 – The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble | Wang Liu et.al. | 2406.12271 | null |
| 2024-06-17 | OoDIS: Anomaly Instance Segmentation Benchmark | Alexey Nekrasov et.al. | 2406.11835 | link |
| 2024-06-17 | Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT | Maximilian E. Tschuchnig et.al. | 2406.11650 | null |
| 2024-06-17 | Learning from Exemplars for Interactive Image Segmentation | Kun Li et.al. | 2406.11472 | null |
| 2024-06-17 | SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation | Zhenchao Lin et.al. | 2406.11441 | link |
| 2024-06-17 | Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding | Yunsong Wang et.al. | 2406.11283 | null |
| 2024-06-17 | Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation | Bingfeng Zhang et.al. | 2406.11189 | null |
| 2024-06-16 | $α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion | Sanbao Su et.al. | 2406.11021 | null |
| 2024-06-16 | Benchmarking Label Noise in Instance Segmentation: Spatial Noise Matters | Moshe Kimhi et.al. | 2406.10891 | link |
| 2024-06-16 | PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery | Libo Wang et.al. | 2406.10828 | link |
| 2024-06-15 | GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR | Bharat Singh et.al. | 2406.10722 | null |
| 2024-06-14 | Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations | Daan de Geus et.al. | 2406.10114 | null |
| 2024-06-14 | ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers | Narges Norouzi et.al. | 2406.09936 | null |
| 2024-06-14 | Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions | Aldi Piroli et.al. | 2406.09906 | null |
| 2024-06-14 | Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation | Brunó B. Englert et.al. | 2406.09896 | link |
| 2024-06-14 | Open-Vocabulary Semantic Segmentation with Image Embedding Balancing | Xiangheng Shan et.al. | 2406.09829 | link |
| 2024-06-14 | 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities | Roman Bachmann et.al. | 2406.09406 | null |
| 2024-06-13 | Instance-level quantitative saliency in multiple sclerosis lesion segmentation | Federico Spagnolo et.al. | 2406.09335 | null |
| 2024-06-13 | APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation | Weizhao He et.al. | 2406.08372 | null |
| 2024-06-12 | Dataset Enhancement with Instance-Level Augmentations | Orest Kupyn et.al. | 2406.08249 | link |
| 2024-06-12 | 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Zhensong Xu et.al. | 2406.08192 | null |
| 2024-06-13 | A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder | Lixian Zhang et.al. | 2406.08079 | null |
| 2024-06-12 | OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Yinan Deng et.al. | 2406.08009 | link |
| 2024-06-12 | SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation | Chanda Grover Kamra et.al. | 2406.07986 | link |
| 2024-06-12 | Small Scale Data-Free Knowledge Distillation | He Liu et.al. | 2406.07876 | link |
| 2024-06-11 | Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph | Sergey Linok et.al. | 2406.07113 | null |
| 2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037 | null |
| 2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032 | null |
| 2024-06-12 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023 | null |
| 2024-06-11 | Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples | Kailas Dayanandan et.al. | 2406.06967 | link |
| 2024-06-11 | UVIS: Unsupervised Video Instance Segmentation | Shuaiyi Huang et.al. | 2406.06908 | null |
| 2024-06-10 | Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation | Dong Zhao et.al. | 2406.06813 | null |
| 2024-06-10 | Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Louis Blankemeier et.al. | 2406.06512 | link |
| 2024-06-10 | UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving | Daniel Bogdoll et.al. | 2406.06370 | null |
| 2024-06-10 | Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Shijie Lian et.al. | 2406.06039 | link |
| 2024-06-09 | Scaling Graph Convolutions for Mobile Vision | William Avery et.al. | 2406.05850 | link |
| 2024-06-09 | Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation | Jun Yu et.al. | 2406.05837 | null |
| 2024-06-09 | Convolution and Attention-Free Mamba-based Cardiac Image Segmentation | Abbas Khan et.al. | 2406.05786 | null |
| 2024-06-09 | Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language | Mark Hamilton et.al. | 2406.05629 | link |
| 2024-06-08 | A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ | Jianzhao Wang et.al. | 2406.05513 | null |
| 2024-06-08 | Layered Image Vectorization via Semantic Simplification | Zhenyu Wang et.al. | 2406.05404 | null |
| 2024-06-08 | 1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR’24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation | Qingfeng Liu et.al. | 2406.05352 | null |
| 2024-06-07 | Semantic Segmentation on VSPW Dataset through Masked Video Consistency | Chen Liang et.al. | 2406.04979 | null |
| 2024-06-07 | Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment | Venkanna Babu Guthula et.al. | 2406.04949 | null |
| 2024-06-06 | Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis | Chengeng Liu et.al. | 2406.04149 | null |
| 2024-06-07 | 3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation | Ruipu Wu et.al. | 2406.04002 | null |
| 2024-06-06 | Frequency-based Matcher for Long-tailed Semantic Segmentation | Shan Li et.al. | 2406.03917 | link |
| 2024-06-07 | Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge | Nan Zhang et.al. | 2406.03799 | link |
| 2024-06-06 | Instance Segmentation and Teeth Classification in Panoramic X-rays | Devichand Budagam et.al. | 2406.03747 | link |
| 2024-06-06 | DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation | Zilu Guo et.al. | 2406.03702 | link |
| 2024-06-05 | Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation | Maximilian Zenk et.al. | 2406.03323 | null |
| 2024-06-05 | Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy | Yunho Kim et.al. | 2406.02989 | null |
| 2024-06-04 | W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics | Andre Schreiber et.al. | 2406.02822 | link |
| 2024-06-04 | Window to Wall Ratio Detection using SegFormer | Zoe De Simone et.al. | 2406.02706 | link |
| 2024-06-04 | Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Mohamed El Amine Boudjoghra et.al. | 2406.02548 | link |
| 2024-06-04 | Generative Active Learning for Long-tailed Instance Segmentation | Muzhi Zhu et.al. | 2406.02435 | link |
| 2024-06-04 | Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning | Heather Doig et.al. | 2406.01932 | null |
| 2024-06-03 | MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild | Zeren Jiang et.al. | 2406.01595 | null |
| 2024-06-03 | Towards Flexible Interactive Reflection Removal with Human Guidance | Xiao Chen et.al. | 2406.01555 | link |
| 2024-06-03 | EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding | Thanh-Dat Truong et.al. | 2406.01429 | null |
| 2024-06-03 | An expert-driven data generation pipeline for histological images | Roberto Basla et.al. | 2406.01403 | link |
| 2024-06-03 | TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation | Antonio Santo et.al. | 2406.01395 | link |
| 2024-06-03 | MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images | Ke-Lei Wang et.al. | 2406.01356 | null |
| 2024-06-03 | ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds | Ka Lung Cheung et.al. | 2406.01337 | link |
| 2024-05-31 | Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks | Linlin Yu et.al. | 2405.20986 | null |
| 2024-05-31 | Extreme Point Supervised Instance Segmentation | Hyeonjun Lee et.al. | 2405.20729 | null |
| 2024-05-31 | Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation | Wooseok Shin et.al. | 2405.20610 | link |
| 2024-05-30 | P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation | Qi Zhang et.al. | 2405.20443 | null |
| 2024-05-30 | SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow | Chaoyang Wang et.al. | 2405.20282 | link |
| 2024-05-30 | MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion | Angel Villar-Corrales et.al. | 2405.19921 | link |
| 2024-05-30 | Open-Set Domain Adaptation for Semantic Segmentation | Seun-An Choe et.al. | 2405.19899 | link |
| 2024-05-30 | DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation | Ron Keuth et.al. | 2405.19746 | link |
| 2024-05-30 | Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes | Yong-Qiang Mao et.al. | 2405.19735 | null |
| 2024-05-30 | CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation | Ankush Gajanan Arudkar et.al. | 2405.19672 | null |
| 2024-05-29 | Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation | Lianlei Shan et.al. | 2405.19568 | null |
| 2024-05-29 | Enabling Visual Recognition at Radio Frequency | Haowen Lai et.al. | 2405.19516 | null |
| 2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | null |
| 2024-05-29 | A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation | Niclas Vödisch et.al. | 2405.19035 | link |
| 2024-05-29 | Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation | Zelin Peng et.al. | 2405.18840 | null |
| 2024-05-29 | FocSAM: Delving Deeply into Focused Objects in Segmenting Anything | You Huang et.al. | 2405.18706 | null |
| 2024-05-28 | Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation | JuneHyoung Kwon et.al. | 2405.18148 | null |
| 2024-05-28 | Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images | Lianlei Shan et.al. | 2405.18078 | null |
| 2024-05-28 | RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields | Mihnea-Bogdan Jurca et.al. | 2405.18033 | null |
| 2024-05-28 | DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture | Shentong Mo et.al. | 2405.17995 | link |
| 2024-05-28 | Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation | Yangxiao Lu et.al. | 2405.17859 | link |
| 2024-05-28 | The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention | Xingyu Ding et.al. | 2405.17776 | null |
| 2024-05-27 | Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation | Steven Landgraf et.al. | 2405.17097 | null |
| 2024-05-27 | DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking | Hongtao Wang et.al. | 2405.16980 | null |
| 2024-05-27 | Collective Perception Datasets for Autonomous Driving: A Comprehensive Review | Sven Teufel et.al. | 2405.16973 | null |
| 2024-05-27 | Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models | Qian Wang et.al. | 2405.16947 | null |
| 2024-05-27 | A re-calibration method for object detection with multi-modal alignment bias in autonomous driving | Zhihang Song et.al. | 2405.16848 | null |
| 2024-05-26 | Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning | Neha Kalibhat et.al. | 2405.16401 | null |
| 2024-05-25 | Video Prediction Models as General Visual Encoders | James Maier et.al. | 2405.16382 | null |
| 2024-05-25 | BOLD: Boolean Logic Deep Learning | Van Minh Nguyen et.al. | 2405.16339 | null |
| 2024-05-25 | Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation | Huizhou Chen et.al. | 2405.16099 | null |
| 2024-05-25 | Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality | Hakim Ikebayashi et.al. | 2405.16008 | null |
| 2024-05-24 | Visualize and Paint GAN Activations | Rudolf Herdt et.al. | 2405.15636 | null |
| 2024-05-24 | Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets | Hoàng-Ân Lê et.al. | 2405.15394 | null |
| 2024-05-24 | Autonomous Quilt Spreading for Caregiving Robots | Yuchun Guo et.al. | 2405.15373 | null |
| 2024-05-24 | U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation | Bingyu Li et.al. | 2405.15365 | link |
| 2024-05-24 | Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation | Jiayi Chen et.al. | 2405.15265 | null |
| 2024-05-23 | Mamba-R: Vision Mamba ALSO Needs Registers | Feng Wang et.al. | 2405.14858 | null |
| 2024-05-23 | Efficient Robot Learning for Perception and Mapping | Niclas Vödisch et.al. | 2405.14688 | null |
| 2024-05-23 | Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation | Daniel Kienzle et.al. | 2405.14467 | null |
| 2024-05-23 | MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | Jiuming Liu et.al. | 2405.14338 | null |
| 2024-05-23 | Tuning-free Universally-Supervised Semantic Segmentation | Xiaobo Yang et.al. | 2405.14294 | null |
| 2024-05-23 | SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation | Kai Yao et.al. | 2405.14278 | null |
| 2024-05-23 | Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations | Mohammed Baharoon et.al. | 2405.14239 | null |
| 2024-05-23 | Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification | Taylor Archibald et.al. | 2405.14162 | null |
| 2024-05-23 | Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips | Yaotian Liu et.al. | 2405.14154 | null |
| 2024-05-22 | TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System | Diogo Lavado et.al. | 2405.13989 | null |
| 2024-05-21 | Transparency Distortion Robustness for SOTA Image Segmentation Tasks | Volker Knauthe et.al. | 2405.12864 | null |
| 2024-05-20 | A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation | Sushmita Sarker et.al. | 2405.11903 | null |
| 2024-05-20 | Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments | Jooyong Park et.al. | 2405.11855 | null |
| 2024-05-20 | Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate Model | Mounes Zaval et.al. | 2405.11837 | null |
| 2024-05-20 | Universal Organizer of SAM for Unsupervised Semantic Segmentation | Tingting Li et.al. | 2405.11742 | null |
| 2024-05-19 | Interpreting a Semantic Segmentation Model for Coastline Detection | Conor O’Sullivan et.al. | 2405.11500 | null |
| 2024-05-19 | Unifying 3D Vision-Language Understanding via Promptable Queries | Ziyu Zhu et.al. | 2405.11442 | null |
| 2024-05-18 | PS6D: Point Cloud Based Symmetry-Aware 6D Object Pose Estimation in Robot Bin-Picking | Yifan Yang et.al. | 2405.11257 | null |
| 2024-05-17 | CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation | Mushui Liu et.al. | 2405.10530 | link |
| 2024-05-16 | 4D Panoptic Scene Graph Generation | Jingkang Yang et.al. | 2405.10305 | link |
| 2024-05-16 | Towards Task-Compatible Compressible Representations | Anderson de Andrade et.al. | 2405.10244 | link |
| 2024-05-16 | DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | Chengxiang Fan et.al. | 2405.10185 | link |
| 2024-05-16 | An Integrated Framework for Multi-Granular Explanation of Video Summarization | Konstantinos Tsigos et.al. | 2405.10082 | null |
| 2024-05-16 | A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance | Andrea Matteazzi et.al. | 2405.10046 | null |
| 2024-05-16 | Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation | Jihwan Kwak et.al. | 2405.09858 | null |
| 2024-05-15 | Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation | Guo Yachan et.al. | 2405.09682 | null |
| 2024-05-14 | CLIP with Quality Captions: A Strong Pretraining for Vision Tasks | Pavan Kumar Anasosalu Vasu et.al. | 2405.08911 | null |
| 2024-05-14 | Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study | Qinfeng Zhu et.al. | 2405.08493 | null |
| 2024-05-14 | TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road Detection | Martín Bayón-Gutiérrez et.al. | 2405.08429 | link |
| 2024-05-13 | IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data | Ziyang Zhang et.al. | 2405.07916 | null |
| 2024-05-13 | PLUTO: Pathology-Universal Transformer | Dinkar Juyal et.al. | 2405.07905 | null |
| 2024-05-12 | PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification | Mohammad Shafiul Alam et.al. | 2405.07332 | link |
| 2024-05-12 | Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception | Haoming Chen et.al. | 2405.07201 | null |
| 2024-05-11 | Global Motion Understanding in Large-Scale Video Object Segmentation | Volodymyr Fedynyak et.al. | 2405.07031 | null |
| 2024-05-10 | GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs | Mustafa Munir et.al. | 2405.06849 | link |
| 2024-05-10 | Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach | Elham Ravanbakhsh et.al. | 2405.06586 | null |
| 2024-05-10 | Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation | Xiaowen Ma et.al. | 2405.06525 | link |
| 2024-05-10 | Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data | Yonghao Xu et.al. | 2405.06502 | null |
| 2024-05-10 | Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data | Rongyu Zhang et.al. | 2405.06413 | null |
| 2024-05-10 | Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation | Zhenliang Ni et.al. | 2405.06228 | link |
| 2024-05-10 | Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection | Koji Takeda et.al. | 2405.06185 | null |
| 2024-05-10 | Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging | Zhuchen Shao et.al. | 2405.06175 | null |
| 2024-05-09 | Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation | Yudian Zhang et.al. | 2405.05830 | null |
| 2024-05-09 | CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks | Nick et.al. | 2405.05755 | null |
| 2024-05-08 | OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies | Lingdong Kong et.al. | 2405.05259 | link |
| 2024-05-08 | Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving | Lingdong Kong et.al. | 2405.05258 | link |
| 2024-05-08 | Weakly-supervised Semantic Segmentation via Dual-stream Contrastive Learning of Cross-image Contextual Information | Qi Lai et.al. | 2405.04913 | null |
| 2024-05-08 | DeepDamageNet: A two-step deep-learning model for multi-disaster building damage segmentation and classification using satellite imagery | Irene Alisjahbana et.al. | 2405.04800 | null |
| 2024-05-07 | A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images | László Kopácsi et.al. | 2405.04650 | null |
| 2024-05-07 | FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes | Charles Gaydon et.al. | 2405.04634 | link |
| 2024-05-07 | AugmenTory: A Fast and Flexible Polygon Augmentation Library | Tanaz Ghahremani et.al. | 2405.04442 | null |
| 2024-05-07 | A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields | Raiyan Rahman et.al. | 2405.04305 | null |
| 2024-05-07 | ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation | Zhibo Zhang et.al. | 2405.04121 | null |
| 2024-05-07 | Structured Click Control in Transformer-based Interactive Segmentation | Long Xu et.al. | 2405.04009 | link |
| 2024-05-06 | PTQ4SAM: Post-Training Quantization for Segment Anything | Chengtao Lv et.al. | 2405.03144 | link |
| 2024-05-04 | MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | Vishal Nedungadi et.al. | 2405.02771 | null |
| 2024-05-04 | Few-Shot Fruit Segmentation via Transfer Learning | Jordan A. James et.al. | 2405.02556 | null |
| 2024-05-03 | Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic Segmentation | Gabriel Fischer Abati et.al. | 2405.02177 | null |
| 2024-05-03 | Towards general deep-learning-based tree instance segmentation models | Jonathan Henrich et.al. | 2405.02061 | null |
| 2024-05-03 | DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model | Peijin Jia et.al. | 2405.02008 | null |
| 2024-05-02 | Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey | Guoping Xu et.al. | 2405.01725 | link |
| 2024-05-02 | Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey | Rokas Gipiškis et.al. | 2405.01636 | null |
| 2024-05-02 | CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation | Chenying Liu et.al. | 2405.01217 | null |
| 2024-05-02 | Uncertainty-aware self-training with expectation maximization basis transformation | Zijia Wang et.al. | 2405.01175 | null |
| 2024-05-01 | GraCo: Granularity-Controllable Interactive Segmentation | Yian Zhao et.al. | 2405.00587 | null |
| 2024-05-01 | Exploring Self-Supervised Vision Transformers for Deepfake Detection: A Comparative Analysis | Huy H. Nguyen et.al. | 2405.00355 | null |
| 2024-04-30 | Masked Multi-Query Slot Attention for Unsupervised Object Discovery | Rishav Pramanik et.al. | 2404.19654 | link |
| 2024-04-30 | UniFS: Universal Few-shot Instance Perception with Point Representations | Sheng Jin et.al. | 2404.19401 | null |
| 2024-04-30 | DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents | Taylor Archibald et.al. | 2404.19259 | null |
| 2024-04-29 | Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing | Leonardo Rossi et.al. | 2404.18924 | null |
| 2024-04-29 | IPixMatch: Boost Semi-supervised Semantic Segmentation with Inter-Pixel Relation | Kebin Wu et.al. | 2404.18891 | null |
| 2024-04-29 | From Density to Geometry: YOLOv8 Instance Segmentation for Reverse Engineering of Optimized Structures | Thomas Rochefort-Beaudoin et.al. | 2404.18763 | null |
| 2024-04-29 | Towards Long-term Robotics in the Wild | Stephen Hausler et.al. | 2404.18477 | null |
| 2024-04-29 | Clicks2Line: Using Lines for Interactive Image Segmentation | Chaewon Lee et.al. | 2404.18461 | null |
| 2024-04-29 | MFP: Making Full Use of Probability Maps for Interactive Image Segmentation | Chaewon Lee et.al. | 2404.18448 | null |
| 2024-04-28 | Panoptic Segmentation and Labelling of Lumbar Spine Vertebrae using Modified Attention Unet | Rikathi Pal et.al. | 2404.18291 | null |
| 2024-04-28 | Garbage Segmentation and Attribute Analysis by Robotic Dogs | Nuo Xu et.al. | 2404.18112 | null |
| 2024-04-27 | Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic Environments | Benoît Gérin et.al. | 2404.17930 | link |
| 2024-04-27 | GLIMS: Attention-Guided Lightweight Multi-Scale Hybrid Network for Volumetric Semantic Segmentation | Ziya Ata Yazıcı et.al. | 2404.17854 | link |
| 2024-04-26 | Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment | Kazi Shahriar Sanjid et.al. | 2404.17235 | null |
| 2024-04-25 | Calculation of Femur Caput Collum Diaphyseal angle for X-Rays images using Semantic Segmentation | Deepak Bhatia et.al. | 2404.17083 | null |
| 2024-04-25 | Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals | Oliver Hahn et.al. | 2404.16818 | link |
| 2024-04-25 | Self-Balanced R-CNN for Instance Segmentation | Leonardo Rossi et.al. | 2404.16633 | link |
| 2024-04-26 | Multi-Scale Representations by Varying Window Attention for Semantic Segmentation | Haotian Yan et.al. | 2404.16573 | link |
| 2024-04-25 | 360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes | Xu Zheng et.al. | 2404.16501 | null |
| 2024-04-25 | Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models | Hedda Cohen Indelman et.al. | 2404.16325 | null |
| 2024-04-25 | Style Adaptation for Domain-adaptive Semantic Segmentation | Ting Li et.al. | 2404.16301 | null |
| 2024-04-25 | A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Yifan Zhao et.al. | 2404.16266 | link |
| 2024-04-24 | Does SAM dream of EIG? Characterizing Interactive Segmenter Performance using Expected Information Gain | Kuan-I Chung et.al. | 2404.16155 | null |
| 2024-04-24 | 3D Freehand Ultrasound using Visual Inertial and Deep Inertial Odometry for Measuring Patellar Tracking | Russell Buchanan et.al. | 2404.15847 | null |
| 2024-04-24 | Vision Transformer-based Adversarial Domain Adaptation | Yahan Li et.al. | 2404.15817 | link |
| 2024-04-23 | PRISM: A Promptable and Robust Interactive Segmentation Model with Visual Prompts | Hao Li et.al. | 2404.15028 | link |
| 2024-04-23 | Unknown Object Grasping for Assistive Robotics | Elle Miller et.al. | 2404.15001 | null |
| 2024-04-22 | Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic Surgery | Yuyang Sheng et.al. | 2404.14040 | link |
| 2024-04-22 | OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks | Sophia Sirko-Galouchenko et.al. | 2404.14027 | null |
| 2024-04-22 | PM-VIS: High-Performance Box-Supervised Video Instance Segmentation | Zhangjing Yang et.al. | 2404.13863 | null |
| 2024-04-21 | Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation | Guanlong Jiao et.al. | 2404.13701 | null |
| 2024-04-21 | PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al. | 2404.13693 | null |
| 2024-04-21 | A Complete System for Automated 3D Semantic-Geometric Mapping of Corrosion in Industrial Environments | Rui Pimentel de Figueiredo et.al. | 2404.13691 | null |
| 2024-04-21 | LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing | Tong Wang et.al. | 2404.13659 | null |
| 2024-04-21 | Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering | Ben Fei et.al. | 2404.13619 | null |
| 2024-04-20 | FisheyeDetNet: Object Detection on Fisheye Surround View Camera Systems for Automated Driving | Ganesh Sistu et.al. | 2404.13443 | null |
| 2024-04-20 | AMMUNet: Multi-Scale Attention Map Merging for Remote Sensing Image Segmentation | Yang Yang et.al. | 2404.13408 | null |
| 2024-04-19 | Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture | Zarif Ahmed et.al. | 2404.12986 | null |
| 2024-04-19 | FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving | Xingtai Gui et.al. | 2404.12867 | null |
| 2024-04-19 | Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation | Yilong Chen et.al. | 2404.12861 | null |
| 2024-04-19 | COIN: Counterfactual inpainting for weakly supervised semantic segmentation for medical images | Dmytro Shvetsov et.al. | 2404.12832 | link |
| 2024-04-19 | A Point-Based Approach to Efficient LiDAR Multi-Task Perception | Christopher Lang et.al. | 2404.12798 | null |
| 2024-04-19 | Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework | Zhuohong Li et.al. | 2404.12721 | link |
| 2024-04-19 | Improving Prediction Accuracy of Semantic Segmentation Methods Using Convolutional Autoencoder Based Pre-processing Layers | Hisashi Shimodaira et.al. | 2404.12718 | null |
| 2024-04-19 | Show and Grasp: Few-shot Semantic Segmentation for Robot Grasping through Zero-shot Foundation Models | Leonardo Barcellona et.al. | 2404.12717 | null |
| 2024-04-18 | Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds | Oliver Lemke et.al. | 2404.12440 | null |
| 2024-04-18 | A Perspective on Deep Vision Performance with Standard Image and Video Codecs | Christoph Reich et.al. | 2404.12330 | null |
| 2024-04-18 | Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery | Yona Falinie A. Gaus et.al. | 2404.12285 | null |
| 2024-04-18 | Deep Gaussian mixture model for unsupervised image segmentation | Matthias Schwab et.al. | 2404.12252 | null |
| 2024-04-18 | Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training | Jin Gao et.al. | 2404.12210 | link |
| 2024-04-18 | How to Benchmark Vision Foundation Models for Semantic Segmentation? | Tommie Kerssies et.al. | 2404.12172 | null |
| 2024-04-17 | Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding | George Retsinas et.al. | 2404.12144 | link |
| 2024-04-18 | Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation | Chongjie Si et.al. | 2404.11981 | null |
| 2024-04-18 | The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models | Cheng Shi et.al. | 2404.11957 | link |
| 2024-04-18 | Group-On: Boosting One-Shot Segmentation with Supportive Query | Hanjing Zhou et.al. | 2404.11871 | null |
| 2024-04-17 | Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach | Mir Rayat Imtiaz Hossain et.al. | 2404.11732 | null |
| 2024-04-17 | A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching | Francesco Pro et.al. | 2404.11302 | link |
| 2024-04-17 | Learning from Unlabelled Data with Transformers: Domain Adaptation for Semantic Segmentation of High Resolution Aerial Images | Nikolaos Dionelis et.al. | 2404.11299 | link |
| 2024-04-17 | Criteria for Uncertainty-based Corner Cases Detection in Instance Segmentation | Florian Heidecker et.al. | 2404.11266 | null |
| 2024-04-16 | A Concise Tiling Strategy for Preserving Spatial Context in Earth Observation Imagery | Ellianna Abrahams et.al. | 2404.10927 | link |
| 2024-04-16 | Vocabulary-free Image Classification and Semantic Segmentation | Alessandro Conti et.al. | 2404.10864 | link |
| 2024-04-16 | Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging | Toqi Tahamid Sarker et.al. | 2404.10841 | link |
| 2024-04-16 | Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark | Jiangning Zhang et.al. | 2404.10760 | null |
| 2024-04-16 | ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation | Iaroslav Melekhov et.al. | 2404.10699 | null |
| 2024-04-16 | Contextrast: Contextual Contrastive Learning for Semantic Segmentation | Changki Sung et.al. | 2404.10633 | null |
| 2024-04-16 | Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation | Aaron Kujawa et.al. | 2404.10572 | null |
| 2024-04-16 | LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System | Shijing Hu et.al. | 2404.10498 | null |
| 2024-04-16 | Adversarial Identity Injection for Semantic Face Image Synthesis | Giuseppe Tarollo et.al. | 2404.10408 | null |
| 2024-04-16 | Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation | Jiapeng Su et.al. | 2404.10322 | null |
| 2024-04-16 | Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain | Steve Andreas Immanuel et.al. | 2404.10307 | link |
| 2024-04-15 | NOISe: Nuclei-Aware Osteoclast Instance Segmentation for Mouse-to-Human Domain Transfer | Sai Kumar Reddy Manne et.al. | 2404.10130 | link |
| 2024-04-15 | Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Fangwei Zhong et.al. | 2404.09857 | null |
| 2024-04-15 | In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation | Han Xue et.al. | 2404.09633 | null |
| 2024-04-15 | The revenge of BiSeNet: Efficient Multi-Task Image Segmentation | Gabriele Rosi et.al. | 2404.09570 | null |
| 2024-04-15 | kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies | Zhongrui Gui et.al. | 2404.09447 | null |
| 2024-04-15 | Human-in-the-Loop Segmentation of Multi-species Coral Imagery | Scarlett Raine et.al. | 2404.09406 | null |
| 2024-04-14 | Bridging Data Islands: Geographic Heterogeneity-Aware Federated Learning for Collaborative Remote Sensing Semantic Segmentation | Jieyi Tan et.al. | 2404.09292 | null |
| 2024-04-12 | Structured Model Pruning for Efficient Inference in Computational Pathology | Mohammed Adnan et.al. | 2404.08831 | null |
| 2024-04-12 | COCONut: Modernizing COCO Segmentation | Xueqing Deng et.al. | 2404.08639 | null |
| 2024-04-12 | Benchmarking the Cell Image Segmentation Models Robustness under the Microscope Optical Aberrations | Boyuan Peng et.al. | 2404.08549 | null |
| 2024-04-12 | Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning | Girmaw Abebe Tadesse et.al. | 2404.08544 | null |
| 2024-04-12 | LaSagnA: Language-based Segmentation Assistant for Complex Queries | Cong Wei et.al. | 2404.08506 | link |
| 2024-04-12 | Adapting the Segment Anything Model During Usage in Novel Situations | Robin Schön et.al. | 2404.08421 | null |
| 2024-04-12 | Let It Flow: Simultaneous Optimization of 3D Flow and Object Clustering | Patrik Vacek et.al. | 2404.08363 | null |
| 2024-04-12 | AdaContour: Adaptive Contour Descriptor with Hierarchical Representation | Tianyu Ding et.al. | 2404.08292 | null |
| 2024-04-12 | Tackling Ambiguity from Perspective of Uncertainty Inference and Affinity Diversification for Weakly Supervised Semantic Segmentation | Zhiwei Yang et.al. | 2404.08195 | link |
| 2024-04-12 | Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation | Sina Hajimiri et.al. | 2404.08181 | link |
| 2024-04-11 | Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification | Ricardo Pereira et.al. | 2404.07739 | null |
| 2024-04-11 | OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities | Lasse H. Hansen et.al. | 2404.07711 | link |
| 2024-04-11 | ViM-UNet: Vision Mamba for Biomedical Segmentation | Anwai Archit et.al. | 2404.07705 | link |
| 2024-04-11 | Implicit and Explicit Language Guidance for Diffusion-based Visual Perception | Hefeng Wang et.al. | 2404.07600 | null |
| 2024-04-11 | Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling | Sourajit Saha et.al. | 2404.07410 | null |
| 2024-04-10 | AI-Guided Defect Detection Techniques to Model Single Crystal Diamond Growth | Rohan Reddy Mekala et.al. | 2404.07306 | null |
| 2024-04-10 | RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds | Remco Royen et.al. | 2404.06863 | null |
| 2024-04-10 | O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation | Muer Tie et.al. | 2404.06836 | null |
| 2024-04-10 | Convolution-based Probability Gradient Loss for Semantic Segmentation | Guohang Shan et.al. | 2404.06704 | null |
| 2024-04-09 | Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation | Luca Barsellotti et.al. | 2404.06542 | null |
| 2024-04-09 | QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding | Yash Mehan et.al. | 2404.06442 | null |
| 2024-04-09 | DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird’s Eye View Segmentation with Occlusion Reasoning | Senthil Yogamani et.al. | 2404.06352 | null |
| 2024-04-09 | Automated National Urban Map Extraction | Hasan Nasrallah et.al. | 2404.06202 | null |
| 2024-04-09 | Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation | Mariella Dreissig et.al. | 2404.06124 | null |
| 2024-04-09 | Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation | Zong-Wei Hong et.al. | 2404.06029 | null |
| 2024-04-08 | Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery | Ionut M. Motoi et.al. | 2404.05693 | null |
| 2024-04-08 | AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation | Jiannan Ge et.al. | 2404.05667 | null |
| 2024-04-08 | Impact of LiDAR visualisations on semantic segmentation of archaeological objects | Raveerat Jaturapitpornchai et.al. | 2404.05512 | null |
| 2024-04-08 | Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance | Dazhong Shen et.al. | 2404.05384 | link |
| 2024-04-08 | GPS-free Autonomous Navigation in Cluttered Tree Rows with Deep Semantic Segmentation | Alessandro Navone et.al. | 2404.05338 | null |
| 2024-04-08 | Human Detection from 4D Radar Data in Low-Visibility Field Conditions | Mikael Skog et.al. | 2404.05307 | null |
| 2024-04-08 | iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection | Nan Zhou et.al. | 2404.05207 | null |
| 2024-04-08 | UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather | Haimei Zhao et.al. | 2404.05145 | null |
| 2024-04-07 | D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation | Xuan Sun et.al. | 2404.04807 | null |
| 2024-04-06 | HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene | Ziang Guo et.al. | 2404.04653 | link |
| 2024-04-05 | Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation | Zifu Wan et.al. | 2404.04256 | link |
| 2024-04-05 | Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation | Ji-Jia Wu et.al. | 2404.04231 | link |
| 2024-04-05 | MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector | Junbo Li et.al. | 2404.04155 | null |
| 2024-04-04 | Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation | Elham Amin Mansour et.al. | 2404.03799 | null |
| 2024-04-04 | Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball | Simon Weber et.al. | 2404.03778 | null |
| 2024-04-04 | OW-VISCap: Open-World Video Instance Segmentation and Captioning | Anwesa Choudhuri et.al. | 2404.03657 | null |
| 2024-04-04 | Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation | Izumi Fujimori et.al. | 2404.03394 | null |
| 2024-04-04 | iSeg: Interactive 3D Segmentation via Interactive Attention | Itai Lang et.al. | 2404.03219 | null |
| 2024-04-04 | CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks | Beibei Wang et.al. | 2404.03191 | null |
| 2024-04-03 | GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation | Meher Niger et.al. | 2404.02813 | null |
| 2024-04-03 | RS-Mamba for Large Remote Sensing Image Dense Prediction | Sijie Zhao et.al. | 2404.02668 | link |
| 2024-04-03 | A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task | Eduardo Neto et.al. | 2404.02659 | null |
| 2024-04-03 | SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation | Junyan Ye et.al. | 2404.02638 | link |
| 2024-04-03 | Active learning for efficient annotation in precision agriculture: a use-case on crop-weed semantic segmentation | Bart M. van Marrewijk et.al. | 2404.02580 | null |
| 2024-04-03 | HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Zhongyu Xia et.al. | 2404.02517 | link |
| 2024-04-03 | Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | I. Dror et.al. | 2404.02481 | null |
| 2024-04-03 | RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation | Xianping Ma et.al. | 2404.02457 | link |
| 2024-04-02 | Constrained Robotic Navigation on Preferred Terrains Using LLMs and Speech Instruction: Exploiting the Power of Adverbs | Faraz Lotfi et.al. | 2404.02294 | null |
| 2024-04-02 | Segment Any 3D Object with Language | Seungjun Lee et.al. | 2404.02157 | link |
| 2024-04-02 | Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation | Hui Xiao et.al. | 2404.02065 | null |
| 2024-04-01 | What is Point Supervision Worth in Video Instance Segmentation? | Shuaiyi Huang et.al. | 2404.01990 | null |
| 2024-04-02 | Synthetic Data for Robust Stroke Segmentation | Liam Chalcroft et.al. | 2404.01946 | link |
| 2024-04-02 | Improving Bird’s Eye View Semantic Segmentation by Task Decomposition | Tianhao Zhao et.al. | 2404.01925 | null |
| 2024-04-02 | Rethinking Annotator Simulation: Realistic Evaluation of Whole-Body PET Lesion Interactive Segmentation Methods | Zdravko Marinov et.al. | 2404.01816 | null |
| 2024-04-02 | Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model | Qinfeng Zhu et.al. | 2404.01705 | link |
| 2024-04-02 | Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Jaeha Kim et.al. | 2404.01692 | null |
| 2024-04-02 | JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments | Duy-Tho Le et.al. | 2404.01686 | null |
| 2024-04-01 | SUGAR: Pre-training 3D Visual Representations for Robotics | Shizhe Chen et.al. | 2404.01491 | null |
| 2024-03-29 | ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning | Beomyoung Kim et.al. | 2403.20126 | link |
| 2024-03-29 | Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation | Qi Bi et.al. | 2403.20092 | null |
| 2024-03-29 | Using Images as Covariates: Measuring Curb Appeal with Deep Learning | Ardyn Nordstrom et.al. | 2403.19915 | null |
| 2024-03-29 | MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection | Ali Behrouz et.al. | 2403.19888 | link |
| 2024-03-28 | Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation | Qitian Ma et.al. | 2403.19826 | null |
| 2024-04-01 | Efficient 3D Instance Mapping and Localization with Neural Fields | George Tang et.al. | 2403.19797 | null |
| 2024-03-28 | ENet-21: An Optimized light CNN Structure for Lane Detection | Seyed Rasoul Hosseini et.al. | 2403.19782 | null |
| 2024-03-29 | Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers | Pingcheng Dong et.al. | 2403.19591 | link |
| 2024-03-28 | DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | Donghyun Kim et.al. | 2403.19588 | link |
| 2024-03-28 | Learning Multiple Representations with Inconsistency-Guided Detail Regularization for Mask-Guided Matting | Weihao Jiang et.al. | 2403.19213 | null |
| 2024-03-27 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T et.al. | 2403.18922 | null |
| 2024-03-27 | Annolid: Annotate, Segment, and Track Anything You Need | Chen Yang et.al. | 2403.18690 | null |
| 2024-03-27 | I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation | Ayoub Karine et.al. | 2403.18490 | null |
| 2024-03-28 | ViTAR: Vision Transformer with Any Resolution | Qihang Fan et.al. | 2403.18361 | null |
| 2024-03-27 | Generating Diverse Agricultural Data for Vision-Based Farming Applications | Mikolaj Cieslak et.al. | 2403.18351 | null |
| 2024-03-27 | Road Obstacle Detection based on Unknown Objectness Scores | Chihiro Noguchi et.al. | 2403.18207 | null |
| 2024-03-26 | Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision Transformer | Badri N. Patro et.al. | 2403.18063 | link |
| 2024-03-26 | The Need for Speed: Pruning Transformers with One Recipe | Samir Khaki et.al. | 2403.17921 | link |
| 2024-03-26 | Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation | Carlos Gomes et.al. | 2403.17886 | null |
| 2024-03-26 | PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition | Chenhongyi Yang et.al. | 2403.17695 | link |
| 2024-03-26 | Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion | Kazi Shahriar Sanjid et.al. | 2403.17432 | null |
| 2024-03-25 | Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions | Ye Li et.al. | 2403.17009 | link |
| 2024-03-25 | DreamLIP: Language-Image Pre-training with Long Captions | Kecheng Zheng et.al. | 2403.17007 | link |
| 2024-03-25 | TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation | Quang-Huy Che et.al. | 2403.16958 | null |
| 2024-03-25 | HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation | Linglin Jing et.al. | 2403.16788 | null |
| 2024-03-25 | Clustering Propagation for Universal Medical Image Segmentation | Yuhang Ding et.al. | 2403.16646 | null |
| 2024-03-25 | SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation | Aysim Toker et.al. | 2403.16605 | null |
| 2024-03-25 | Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes | Tianwei Zhang et.al. | 2403.16499 | null |
| 2024-03-25 | GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation | Weiming Zhang et.al. | 2403.16370 | null |
| 2024-03-24 | AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans | Cedric Perauer et.al. | 2403.16318 | null |
| 2024-03-24 | Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System | Jing Li et.al. | 2403.16227 | null |
| 2024-03-24 | Segment Anything Model for Road Network Graph Extraction | Congrui Hetang et.al. | 2403.16051 | link |
| 2024-03-24 | SM2C: Boost the Semi-supervised Segmentation for Medical Image by using Meta Pseudo Labels and Mixed Images | Yifei Wang et.al. | 2403.16009 | null |
| 2024-03-22 | Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting | Jun Guo et.al. | 2403.15624 | null |
| 2024-03-22 | A2DMN: Anatomy-Aware Dilated Multiscale Network for Breast Ultrasound Semantic Segmentation | Kyle Lucke et.al. | 2403.15560 | null |
| 2024-03-22 | InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding | Yi Wang et.al. | 2403.15377 | link |
| 2024-03-22 | Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations | Pranav Kulkarni et.al. | 2403.15218 | null |
| 2024-03-22 | Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion | Sofia Casarin et.al. | 2403.15194 | null |
| 2024-03-22 | IFSENet : Harnessing Sparse Iterations for Interactive Few-shot Segmentation Excellence | Shreyas Chandgothia et.al. | 2403.15089 | null |
| 2024-03-22 | Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans | Heng Guo et.al. | 2403.15063 | null |
| 2024-03-22 | BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation | Jiahao Lu et.al. | 2403.15019 | null |
| 2024-03-22 | Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation | Wenlve Zhou et.al. | 2403.14995 | null |
| 2024-03-21 | WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather | Blake Gella et.al. | 2403.14874 | null |
| 2024-03-21 | PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model | Zheng Zhang et.al. | 2403.14598 | link |
| 2024-03-21 | Learning to Project for Cross-Task Knowledge Distillation | Dylan Auty et.al. | 2403.14494 | null |
| 2024-03-21 | OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation | Bohao Peng et.al. | 2403.14418 | link |
| 2024-03-21 | Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Pablo Marcos-Manchón et.al. | 2403.14291 | link |
| 2024-03-21 | OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation | Kwanyoung Kim et.al. | 2403.14183 | null |
| 2024-03-21 | Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference | Junyoung Kim et.al. | 2403.14138 | null |
| 2024-03-21 | Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling | Yong He et.al. | 2403.14124 | null |
| 2024-03-21 | Semantics from Space: Satellite-Guided Thermal Semantic Segmentation Annotation for Aerial Field Robots | Connor Lee et.al. | 2403.14056 | null |
| 2024-03-20 | When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather | Giulia Rizzoli et.al. | 2403.13762 | null |
| 2024-03-20 | Next day fire prediction via semantic segmentation | Konstantinos Alexis et.al. | 2403.13545 | null |
| 2024-03-20 | MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining | Di Wang et.al. | 2403.13430 | link |
| 2024-03-20 | AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments | Mohamed Elnoor et.al. | 2403.13235 | null |
| 2024-03-20 | Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation | Linshan Wu et.al. | 2403.13225 | link |
| 2024-03-19 | Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation | Kasi Viswanath et.al. | 2403.13188 | null |
| 2024-03-19 | As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? | Anjun Hu et.al. | 2403.12693 | null |
| 2024-03-19 | PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation | Haruya Ishikawa et.al. | 2403.12530 | null |
| 2024-03-19 | Semantics, Distortion, and Style Matter: Towards Source-free UDA for Panoramic Segmentation | Xu Zheng et.al. | 2403.12505 | null |
| 2024-03-19 | CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation | Wenqi Zhu et.al. | 2403.12455 | link |
| 2024-03-19 | Multi-Object RANSAC: Efficient Plane Clustering Method in a Clutter | Seunghyeon Lim et.al. | 2403.12449 | null |
| 2024-03-18 | EffiPerception: an Efficient Framework for Various Perception Tasks | Xinhao Xiang et.al. | 2403.12317 | null |
| 2024-03-18 | Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang et.al. | 2403.11812 | null |
| 2024-03-18 | Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation | Wangbo Zhao et.al. | 2403.11808 | link |
| 2024-03-18 | LSKNet: A Foundation Lightweight Backbone for Remote Sensing | Yuxuan Li et.al. | 2403.11735 | null |
| 2024-03-18 | TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models | Lisa Weijler et.al. | 2403.11691 | null |
| 2024-03-18 | Better (pseudo-)labels for semi-supervised instance segmentation | François Porcher et.al. | 2403.11675 | null |
| 2024-03-18 | Synthesizing multi-log grasp poses | Arvid Fälldin et.al. | 2403.11623 | null |
| 2024-03-18 | OurDB: Ouroboric Domain Bridging for Multi-Target Domain Adaptive Semantic Segmentation | Seungbeom Woo et.al. | 2403.11582 | null |
| 2024-03-18 | MISS: Memory-efficient Instance Segmentation Framework By Visual Inductive Priors Flow Propagation | Chih-Chung Hsu et.al. | 2403.11576 | null |
| 2024-03-18 | Augment Before Copy-Paste: Data and Memory Efficiency-Oriented Instance Segmentation Framework for Sport-scenes | Chih-Chung Hsu et.al. | 2403.11572 | null |
| 2024-03-18 | Circle Representation for Medical Instance Object Segmentation | Juming Xiong et.al. | 2403.11507 | link |
| 2024-03-18 | MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception | Thien-Minh Nguyen et.al. | 2403.11496 | null |
| 2024-03-18 | Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting | Mingkui Tan et.al. | 2403.11491 | null |
| 2024-03-18 | ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation | Minh Tran et.al. | 2403.11376 | null |
| 2024-03-14 | PosSAM: Panoptic Open-vocabulary Segment Anything | Vibashan VS et.al. | 2403.09620 | link |
| 2024-03-14 | WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity | Qiyuan Wang et.al. | 2403.09551 | null |
| 2024-03-14 | Annotation Free Semantic Segmentation with Vision Foundation Models | Soroush Seifi et.al. | 2403.09307 | null |
| 2024-03-14 | StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images | Robert Jewsbury et.al. | 2403.09302 | link |
| 2024-03-14 | Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation | Hyung-Il Kim et.al. | 2403.09199 | null |
| 2024-03-14 | When Semantic Segmentation Meets Frequency Aliasing | Linwei Chen et.al. | 2403.09065 | link |
| 2024-03-13 | CART: Caltech Aerial RGB-Thermal Dataset in the Wild | Connor Lee et.al. | 2403.08997 | link |
| 2024-03-13 | SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net | Helin Cao et.al. | 2403.08885 | null |
| 2024-03-13 | Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches | Yun Xin Teoh et.al. | 2403.08761 | null |
| 2024-03-13 | Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Samuel Sze et.al. | 2403.08748 | null |
| 2024-03-13 | Semantic Segmentation of Solar Radio Spikes at Low Frequencies | Pearse C. Murphy et.al. | 2403.08546 | null |
| 2024-03-13 | Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation | Zicheng Zhang et.al. | 2403.08426 | null |
| 2024-03-13 | LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving | Sicen Guo et.al. | 2403.08215 | null |
| 2024-03-13 | Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks | Fuzhi Wu et.al. | 2403.08157 | link |
| 2024-03-12 | Mitigating the Impact of Attribute Editing on Face Recognition | Sudipta Banerjee et.al. | 2403.08092 | null |
| 2024-03-12 | Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation | Feilong Tang et.al. | 2403.07630 | link |
| 2024-03-12 | PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution | Honghao Chen et.al. | 2403.07589 | null |
| 2024-03-12 | Open-World Semantic Segmentation Including Class Similarity | Matteo Sodano et.al. | 2403.07532 | null |
| 2024-03-11 | Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation | Theodore Barfoot et.al. | 2403.06759 | link |
| 2024-03-11 | Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation | Bianca-Cerasela-Zelia Blaga et.al. | 2403.06621 | link |
| 2024-03-11 | OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation | Baran Ozaydin et.al. | 2403.06546 | null |
| 2024-03-11 | 3D Semantic Segmentation-Driven Representations for 3D Object Detection | Hayeon O et.al. | 2403.06501 | link |
| 2024-03-11 | Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy | Jiuming Liu et.al. | 2403.06467 | link |
| 2024-03-11 | Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation | Xiaoyang Wang et.al. | 2403.06462 | null |
| 2024-03-11 | Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation | Peng Zhang et.al. | 2403.06401 | null |
| 2024-03-10 | Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning | Woo-Jin Ahn et.al. | 2403.06122 | link |
| 2024-03-09 | Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation | Hairong Shi et.al. | 2403.05912 | null |
| 2024-03-09 | Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration | Jingyun Xue et.al. | 2403.05906 | null |
| 2024-03-08 | Attention-guided Feature Distillation for Semantic Segmentation | Amir M. Mansourian et.al. | 2403.05451 | link |
| 2024-03-08 | Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation | Yu Han et.al. | 2403.05388 | null |
| 2024-03-08 | Frequency-Adaptive Dilated Convolution for Semantic Segmentation | Linwei Chen et.al. | 2403.05369 | link |
| 2024-03-08 | Embedded Deployment of Semantic Segmentation in Medicine through Low-Resolution Inputs | Erik Ostrowski et.al. | 2403.05340 | null |
| 2024-03-08 | LVIC: Multi-modality segmentation by Lifting Visual Info as Cue | Zichao Dong et.al. | 2403.05159 | null |
| 2024-03-07 | SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising | Tao Zhou et.al. | 2403.04194 | link |
| 2024-03-06 | ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic Segmentation | Erik Brorsson et.al. | 2403.03854 | link |
| 2024-03-06 | Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision | Yajie Liu et.al. | 2403.03707 | null |
| 2024-03-06 | Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery | Jingru Zhu et.al. | 2403.03704 | null |
| 2024-03-06 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou et.al. | 2403.03608 | null |
| 2024-03-06 | Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator | Wonhyeok Choi et.al. | 2403.03468 | null |
| 2024-03-05 | CenterDisks: Real-time instance segmentation with disk covering | Katia Jodogne-Del Litto et.al. | 2403.03296 | link |
| 2024-03-05 | Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection | Mohamed Afifi et.al. | 2403.03111 | null |
| 2024-03-05 | ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving | Han Lu et.al. | 2403.02877 | null |
| 2024-03-05 | DDF: A Novel Dual-Domain Image Fusion Strategy for Remote Sensing Image Semantic Segmentation with Unsupervised Domain Adaptation | Lingyan Ran et.al. | 2403.02784 | null |
| 2024-03-05 | Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels | Zhuohong Li et.al. | 2403.02746 | null |
| 2024-03-05 | FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View | Jiawei Hou et.al. | 2403.02710 | null |
| 2024-03-05 | Deep Common Feature Mining for Efficient Video Semantic Segmentation | Yaoyan Zheng et.al. | 2403.02689 | null |
| 2024-03-04 | Self-Supervised Facial Representation Learning with Facial Region Awareness | Zheng Gao et.al. | 2403.02138 | null |
| 2024-03-04 | Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey | Lingyan Ran et.al. | 2403.01909 | null |
| 2024-03-04 | Map-aided annotation for pole base detection | Benjamin Missaoui et.al. | 2403.01868 | null |
| 2024-03-04 | AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation | Haonan Wang et.al. | 2403.01818 | link |
| 2024-03-02 | Benchmarking Segmentation Models with Mask-Preserved Attribute Editing | Zijin Yin et.al. | 2403.01231 | link |
| 2024-03-02 | Boosting Box-supervised Instance Segmentation with Pseudo Depth | Xinyi Yu et.al. | 2403.01214 | null |
| 2024-03-02 | Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation | Lian Xu et.al. | 2403.01156 | null |
| 2024-03-01 | Rethinking Few-shot 3D Point Cloud Semantic Segmentation | Zhaochong An et.al. | 2403.00592 | link |
| 2024-03-01 | Small, Versatile and Mighty: A Range-View Perception Framework | Qiang Meng et.al. | 2403.00325 | null |
| 2024-03-01 | YOLO-MED : Multi-Task Interaction Network for Biomedical Images | Suizhi Huang et.al. | 2403.00245 | null |
| 2024-02-29 | FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything | Safouane El Ghazouali et.al. | 2403.00175 | link |
| 2024-02-29 | Leveraging AI Predicted and Expert Revised Annotations in Interactive Segmentation: Continual Tuning or Full Training? | Tiezheng Zhang et.al. | 2402.19423 | null |
| 2024-03-01 | PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Niccolò Cavagnero et.al. | 2402.19422 | link |
| 2024-02-29 | RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation | Jie Zhang et.al. | 2402.19004 | null |
| 2024-02-28 | Spatial Coherence Loss for Salient and Camouflaged Object Detection and Beyond | Ziyun Yang et.al. | 2402.18698 | null |
| 2024-02-29 | Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation | Zhiwei Yang et.al. | 2402.18467 | link |
| 2024-02-29 | A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation | Francesco Barbato et.al. | 2402.18402 | null |
| 2024-02-28 | Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis | Miriam Louise Carnot et.al. | 2402.18309 | null |
| 2024-02-28 | Feature Denoising For Low-Light Instance Segmentation Using Weighted Non-Local Blocks | Joanne Lin et.al. | 2402.18307 | null |
| 2024-02-28 | Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis | Bashir Kazimi et.al. | 2402.18286 | null |
| 2024-02-28 | PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation | Haoyu Xie et.al. | 2402.18117 | null |
| 2024-02-28 | Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image Annotation | Samuel O. Folorunsho et.al. | 2402.18084 | link |
| 2024-02-27 | Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation | Xinyu Yang et.al. | 2402.17891 | link |
| 2024-02-27 | Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data | David S. W. Williams et.al. | 2402.17653 | null |
| 2024-02-27 | Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling | David S. W. Williams et.al. | 2402.17622 | null |
(<a href=../README.md>back to main</a>)