Semantic Segmentation - 2024-08

Publish Date Title Authors PDF Translate Read Code
2024-08-30 Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes Li Zhang et.al. 2408.17421 translate read link
2024-08-30 Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations Ahmed Hammam et.al. 2408.17311 translate read null
2024-08-30 Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training Zizheng Huang et.al. 2408.17081 translate read link
2024-08-30 Transient Fault Tolerant Semantic Segmentation for Autonomous Driving Leonardo Iurada et.al. 2408.16952 translate read link
2024-08-29 Eigen-Cluster VIS: Improving Weakly-supervised Video Instance Segmentation by Leveraging Spatio-temporal Consistency Farnoosh Arefi et.al. 2408.16661 translate read link
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 translate read null
2024-08-29 A Simple and Generalist Approach for Panoptic Segmentation Nedyalko Prisadnikov et.al. 2408.16504 translate read null
2024-08-29 MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation Linyan Yang et.al. 2408.16478 translate read null
2024-08-29 Multi-source Domain Adaptation for Panoramic Semantic Segmentation Jing Jiang et.al. 2408.16469 translate read null
2024-08-29 EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More Kanghao Chen et.al. 2408.16254 translate read null
2024-08-28 InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation Thibaut Goldsborough et.al. 2408.15954 translate read link
2024-08-28 SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors Zhiqing Zhang et.al. 2408.15887 translate read null
2024-08-28 DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries Yu Yang et.al. 2408.15813 translate read null
2024-08-28 TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation Junbao Zhou et.al. 2408.15657 translate read link
2024-08-27 Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images Silvia Seidlitz et.al. 2408.15373 translate read link
2024-08-27 An Investigation on The Position Encoding in Vision-Based Dynamics Prediction Jiageng Zhu et.al. 2408.15201 translate read null
2024-08-27 Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation Elona Shatri et.al. 2408.15002 translate read null
2024-08-27 Applying ViT in Generalized Few-shot Semantic Segmentation Liyuan Geng et.al. 2408.14957 translate read link
2024-08-27 Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack Naufal Suryanto et.al. 2408.14879 translate read null
2024-08-27 MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation Yuanbing Zhu et.al. 2408.14776 translate read null
2024-08-26 Physically Feasible Semantic Segmentation Shamik Basu et.al. 2408.14672 translate read link
2024-08-26 A Survey of Camouflaged Object Detection and Beyond Fengyang Xiao et.al. 2408.14562 translate read null
2024-08-26 Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping Vishal Batchu et.al. 2408.14400 translate read null
2024-08-25 OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation Muhammad Rameez ur Rahman et.al. 2408.13936 translate read link
2024-08-25 Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation Yuwen Pan et.al. 2408.13838 translate read null
2024-08-25 TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather Xiongwei Zhao et.al. 2408.13802 translate read link
2024-08-25 ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation Xin Zhang et.al. 2408.13771 translate read null
2024-08-25 Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation Zhaoyang Li et.al. 2408.13752 translate read null
2024-08-24 ESA: Annotation-Efficient Active Learning for Semantic Segmentation Jinchao Ge et.al. 2408.13491 translate read link
2024-08-23 Accuracy Improvement of Cell Image Segmentation Using Feedback Former Hinako Mitsuoka et.al. 2408.12974 translate read null
2024-08-23 Image Segmentation in Foundation Model Era: A Survey Tianfei Zhou et.al. 2408.12957 translate read null
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 translate read null
2024-08-22 Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets Wolfgang Boettcher et.al. 2408.12489 translate read null
2024-08-22 The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation Tuyen Tran et.al. 2408.12447 translate read null
2024-08-22 ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving Scenes Zhenyi Liu et.al. 2408.12048 translate read link
2024-08-21 EmbodiedSAM: Online Segment Any 3D Thing in Real Time Xiuwei Xu et.al. 2408.11811 translate read null
2024-08-21 NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation Zhenye Lou et.al. 2408.11787 translate read link
2024-08-21 Open-Ended 3D Point Cloud Instance Segmentation Phuc D. A. Nguyen et.al. 2408.11747 translate read null
2024-08-21 UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images Enze Zhu et.al. 2408.11545 translate read null
2024-08-22 SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything Chongkai Yu et.al. 2408.11535 translate read null
2024-08-21 Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation Chuandong Liu et.al. 2408.11280 translate read null
2024-08-20 An Interpretable Deep Learning Approach for Morphological Script Type Analysis Malamatenia Vlachou-Efstathiou et.al. 2408.11150 translate read null
2024-08-20 NeCo: Improving DINOv2’s spatial representations in 19 GPU hours with Patch Neighbor Consistency Valentinos Pariza et.al. 2408.11054 translate read null
2024-08-20 CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients Karen Sanchez et.al. 2408.10827 translate read null
2024-08-20 Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant Guofeng Mei et.al. 2408.10652 translate read null
2024-08-20 Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? Chen Liang et.al. 2408.10627 translate read null
2024-08-20 Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation Jiawei Han et.al. 2408.10537 translate read link
2024-08-21 LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS Xinyu Liu et.al. 2408.10469 translate read null
2024-08-19 Leveraging Superfluous Information in Contrastive Representation Learning Xuechu Yu et.al. 2408.10292 translate read null
2024-08-19 Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network Rasha Alshawi et.al. 2408.10181 translate read null
2024-08-19 Dynamic Label Injection for Imbalanced Industrial Defect Segmentation Emanuele Caruso et.al. 2408.10031 translate read link
2024-08-19 Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis Kira Maag et.al. 2408.10021 translate read null
2024-08-19 DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery Corentin Dumery et.al. 2408.09928 translate read null
2024-08-19 3D-Aware Instance Segmentation and Tracking in Egocentric Videos Yash Bhalgat et.al. 2408.09860 translate read null
2024-08-19 Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving Jun Yan et.al. 2408.09839 translate read link
2024-08-18 OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras Muhammad Rameez Ur Rahman et.al. 2408.09424 translate read link
2024-08-18 VrdONE: One-stage Video Visual Relation Detection Xinjie Jiang et.al. 2408.09408 translate read link
2024-08-18 Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration Hao Ai et.al. 2408.09336 translate read null
2024-08-17 Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology Junchao Zhu et.al. 2408.09278 translate read link
2024-08-16 Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation Tri Ton et.al. 2408.08591 translate read null
2024-08-16 Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation Linghao Zheng et.al. 2408.08576 translate read null
2024-08-16 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 translate read null
2024-08-15 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Dongshuo Yin et.al. 2408.08345 translate read link
2024-08-14 MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis Nimeesha Chan et.al. 2408.07773 translate read link
2024-08-15 MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation Beoungwoo Kang et.al. 2408.07576 translate read link
2024-08-15 MagicFace: Training-free Universal-Style Human Image Customized Synthesis Yibin Wang et.al. 2408.07433 translate read null
2024-08-14 Segment Using Just One Example Pratik Vora et.al. 2408.07393 translate read null
2024-08-14 Ensemble architecture in polyp segmentation Hao-Yun Hsu et.al. 2408.07262 translate read link
2024-08-14 Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks Raghavendra Singh et.al. 2408.07243 translate read null
2024-08-14 Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training Ethan Kou et.al. 2408.07239 translate read null
2024-08-13 ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation Jingyun Wang et.al. 2408.06747 translate read link
2024-08-10 Dilated Convolution with Learnable Spacings Ismail Khalfaoui-Hassani et.al. 2408.06383 translate read null
2024-08-12 Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images Siladittya Manna et.al. 2408.06235 translate read null
2024-08-12 A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting Felix Assion et.al. 2408.06071 translate read null
2024-08-13 ClickAttention: Click Region Similarity Guided Interactive Segmentation Long Xu et.al. 2408.06021 translate read null
2024-08-12 Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning Xinrong Hu et.al. 2408.05889 translate read null
2024-08-11 Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task Hannuo Zhang et.al. 2408.05777 translate read null
2024-08-11 MacFormer: Semantic Segmentation with Fine Object Boundaries Guoan Xu et.al. 2408.05699 translate read null
2024-08-13 Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment Abdul-Razak Alhassan Gamani et.al. 2408.05661 translate read null
2024-08-10 Multimodal generative semantic communication based on latent diffusion model Weiqi Fu et.al. 2408.05455 translate read null
2024-08-09 PRISM Lite: A lightweight model for interactive 3D placenta segmentation in ultrasound Hao Li et.al. 2408.05372 translate read link
2024-08-09 In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Dahyun Kang et.al. 2408.04961 translate read link
2024-08-09 ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation Mengcheng Lan et.al. 2408.04883 translate read link
2024-08-09 Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning Fumihiro Kaneko et.al. 2408.04795 translate read null
2024-08-08 Embodied Uncertainty-Aware Object Segmentation Xiaolin Fang et.al. 2408.04760 translate read null
2024-08-08 SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation Jieming Yu et.al. 2408.04593 translate read null
2024-08-08 Robust Approximate Characterization of Single-Cell Heterogeneity in Microbial Growth Richard D. Paul et.al. 2408.04501 translate read link
2024-08-08 SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios Sriram Mandalika et.al. 2408.04482 translate read null
2024-08-08 What could go wrong? Discovering and describing failure modes in computer vision Gabriela Csurka et.al. 2408.04471 translate read null
2024-08-07 Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation Yiqing Shen et.al. 2408.04098 translate read null
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 translate read link
2024-08-07 SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology Mingya Zhang et.al. 2408.03651 translate read link
2024-08-06 Post-Mortem Human Iris Segmentation Analysis with Deep Learning Afzal Hossain et.al. 2408.03448 translate read null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 translate read link
2024-08-06 Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment Shijie Lian et.al. 2408.02924 translate read link
2024-08-05 Scribble-Based Interactive Segmentation of Medical Hyperspectral Images Zhonghao Wang et.al. 2408.02708 translate read null
2024-08-05 Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation Sai Prasanna et.al. 2408.02297 translate read null
2024-08-05 Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs Jeongkee Lim et.al. 2408.02261 translate read null
2024-08-05 Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders Muhammad Abdullah Jamal et.al. 2408.02245 translate read null
2024-08-04 Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation Ye Du et.al. 2408.02039 translate read null
2024-08-03 NuLite – Lightweight and Fast Model for Nuclei Instance Segmentation and Classification Cristian Tommasino et.al. 2408.01797 translate read null
2024-08-03 Bayesian Active Learning for Semantic Segmentation Sima Didari et.al. 2408.01694 translate read null
2024-08-03 A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection Omkar Oak et.al. 2408.01692 translate read null
2024-08-03 Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation Balázs Opra et.al. 2408.01640 translate read null
2024-08-02 Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans Lukas Kratochvila et.al. 2408.01526 translate read null
2024-08-02 Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation Yuanzhi Su et.al. 2408.01356 translate read null
2024-08-02 StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation Bingyu Li et.al. 2408.01343 translate read null
2024-08-02 Amodal Segmentation for Laparoscopic Surgery Video Instruments Ruohua Shi et.al. 2408.01067 translate read null
2024-08-02 Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach Yabin Zhu et.al. 2408.00969 translate read null
2024-08-01 Medical SAM 2: Segment medical images as video via Segment Anything Model 2 Jiayuan Zhu et.al. 2408.00874 translate read link
2024-08-01 Leaf Angle Estimation using Mask R-CNN and LETR Vision Transformer Venkat Margapuri et.al. 2408.00749 translate read null
2024-08-01 Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation Siyu Jiao et.al. 2408.00744 translate read link
2024-08-01 Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function Matias Oscar Volman Stern et.al. 2408.00707 translate read null
2024-08-01 AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation Asbjørn Munk et.al. 2408.00640 translate read null
2024-08-01 SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation Shengbo Tan et.al. 2408.00496 translate read null
2024-08-01 A Simple Background Augmentation Method for Object Detection with Diffusion Model Yuhang Li et.al. 2408.00350 translate read null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)