Semantic Segmentation - 2024-08
Semantic Segmentation - 2024-08
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-08-30 | Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes | Li Zhang et.al. | 2408.17421 | translate | read | link |
| 2024-08-30 | Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations | Ahmed Hammam et.al. | 2408.17311 | translate | read | null |
| 2024-08-30 | Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training | Zizheng Huang et.al. | 2408.17081 | translate | read | link |
| 2024-08-30 | Transient Fault Tolerant Semantic Segmentation for Autonomous Driving | Leonardo Iurada et.al. | 2408.16952 | translate | read | link |
| 2024-08-29 | Eigen-Cluster VIS: Improving Weakly-supervised Video Instance Segmentation by Leveraging Spatio-temporal Consistency | Farnoosh Arefi et.al. | 2408.16661 | translate | read | link |
| 2024-08-29 | SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection | Rohit Venkata Sai Dulam et.al. | 2408.16645 | translate | read | null |
| 2024-08-29 | A Simple and Generalist Approach for Panoptic Segmentation | Nedyalko Prisadnikov et.al. | 2408.16504 | translate | read | null |
| 2024-08-29 | MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation | Linyan Yang et.al. | 2408.16478 | translate | read | null |
| 2024-08-29 | Multi-source Domain Adaptation for Panoramic Semantic Segmentation | Jing Jiang et.al. | 2408.16469 | translate | read | null |
| 2024-08-29 | EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More | Kanghao Chen et.al. | 2408.16254 | translate | read | null |
| 2024-08-28 | InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation | Thibaut Goldsborough et.al. | 2408.15954 | translate | read | link |
| 2024-08-28 | SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors | Zhiqing Zhang et.al. | 2408.15887 | translate | read | null |
| 2024-08-28 | DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries | Yu Yang et.al. | 2408.15813 | translate | read | null |
| 2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | translate | read | link |
| 2024-08-27 | Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images | Silvia Seidlitz et.al. | 2408.15373 | translate | read | link |
| 2024-08-27 | An Investigation on The Position Encoding in Vision-Based Dynamics Prediction | Jiageng Zhu et.al. | 2408.15201 | translate | read | null |
| 2024-08-27 | Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation | Elona Shatri et.al. | 2408.15002 | translate | read | null |
| 2024-08-27 | Applying ViT in Generalized Few-shot Semantic Segmentation | Liyuan Geng et.al. | 2408.14957 | translate | read | link |
| 2024-08-27 | Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack | Naufal Suryanto et.al. | 2408.14879 | translate | read | null |
| 2024-08-27 | MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation | Yuanbing Zhu et.al. | 2408.14776 | translate | read | null |
| 2024-08-26 | Physically Feasible Semantic Segmentation | Shamik Basu et.al. | 2408.14672 | translate | read | link |
| 2024-08-26 | A Survey of Camouflaged Object Detection and Beyond | Fengyang Xiao et.al. | 2408.14562 | translate | read | null |
| 2024-08-26 | Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping | Vishal Batchu et.al. | 2408.14400 | translate | read | null |
| 2024-08-25 | OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation | Muhammad Rameez ur Rahman et.al. | 2408.13936 | translate | read | link |
| 2024-08-25 | Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation | Yuwen Pan et.al. | 2408.13838 | translate | read | null |
| 2024-08-25 | TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Xiongwei Zhao et.al. | 2408.13802 | translate | read | link |
| 2024-08-25 | ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation | Xin Zhang et.al. | 2408.13771 | translate | read | null |
| 2024-08-25 | Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation | Zhaoyang Li et.al. | 2408.13752 | translate | read | null |
| 2024-08-24 | ESA: Annotation-Efficient Active Learning for Semantic Segmentation | Jinchao Ge et.al. | 2408.13491 | translate | read | link |
| 2024-08-23 | Accuracy Improvement of Cell Image Segmentation Using Feedback Former | Hinako Mitsuoka et.al. | 2408.12974 | translate | read | null |
| 2024-08-23 | Image Segmentation in Foundation Model Era: A Survey | Tianfei Zhou et.al. | 2408.12957 | translate | read | null |
| 2024-08-23 | Symmetric masking strategy enhances the performance of Masked Image Modeling | Khanh-Binh Nguyen et.al. | 2408.12772 | translate | read | null |
| 2024-08-22 | Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets | Wolfgang Boettcher et.al. | 2408.12489 | translate | read | null |
| 2024-08-22 | The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation | Tuyen Tran et.al. | 2408.12447 | translate | read | null |
| 2024-08-22 | ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving Scenes | Zhenyi Liu et.al. | 2408.12048 | translate | read | link |
| 2024-08-21 | EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Xiuwei Xu et.al. | 2408.11811 | translate | read | null |
| 2024-08-21 | NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation | Zhenye Lou et.al. | 2408.11787 | translate | read | link |
| 2024-08-21 | Open-Ended 3D Point Cloud Instance Segmentation | Phuc D. A. Nguyen et.al. | 2408.11747 | translate | read | null |
| 2024-08-21 | UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images | Enze Zhu et.al. | 2408.11545 | translate | read | null |
| 2024-08-22 | SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything | Chongkai Yu et.al. | 2408.11535 | translate | read | null |
| 2024-08-21 | Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation | Chuandong Liu et.al. | 2408.11280 | translate | read | null |
| 2024-08-20 | An Interpretable Deep Learning Approach for Morphological Script Type Analysis | Malamatenia Vlachou-Efstathiou et.al. | 2408.11150 | translate | read | null |
| 2024-08-20 | NeCo: Improving DINOv2’s spatial representations in 19 GPU hours with Patch Neighbor Consistency | Valentinos Pariza et.al. | 2408.11054 | translate | read | null |
| 2024-08-20 | CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients | Karen Sanchez et.al. | 2408.10827 | translate | read | null |
| 2024-08-20 | Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant | Guofeng Mei et.al. | 2408.10652 | translate | read | null |
| 2024-08-20 | Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? | Chen Liang et.al. | 2408.10627 | translate | read | null |
| 2024-08-20 | Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation | Jiawei Han et.al. | 2408.10537 | translate | read | link |
| 2024-08-21 | LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS | Xinyu Liu et.al. | 2408.10469 | translate | read | null |
| 2024-08-19 | Leveraging Superfluous Information in Contrastive Representation Learning | Xuechu Yu et.al. | 2408.10292 | translate | read | null |
| 2024-08-19 | Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network | Rasha Alshawi et.al. | 2408.10181 | translate | read | null |
| 2024-08-19 | Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Emanuele Caruso et.al. | 2408.10031 | translate | read | link |
| 2024-08-19 | Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis | Kira Maag et.al. | 2408.10021 | translate | read | null |
| 2024-08-19 | DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery | Corentin Dumery et.al. | 2408.09928 | translate | read | null |
| 2024-08-19 | 3D-Aware Instance Segmentation and Tracking in Egocentric Videos | Yash Bhalgat et.al. | 2408.09860 | translate | read | null |
| 2024-08-19 | Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Jun Yan et.al. | 2408.09839 | translate | read | link |
| 2024-08-18 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Muhammad Rameez Ur Rahman et.al. | 2408.09424 | translate | read | link |
| 2024-08-18 | VrdONE: One-stage Video Visual Relation Detection | Xinjie Jiang et.al. | 2408.09408 | translate | read | link |
| 2024-08-18 | Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration | Hao Ai et.al. | 2408.09336 | translate | read | null |
| 2024-08-17 | Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology | Junchao Zhu et.al. | 2408.09278 | translate | read | link |
| 2024-08-16 | Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation | Tri Ton et.al. | 2408.08591 | translate | read | null |
| 2024-08-16 | Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation | Linghao Zheng et.al. | 2408.08576 | translate | read | null |
| 2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | translate | read | null |
| 2024-08-15 | 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks | Dongshuo Yin et.al. | 2408.08345 | translate | read | link |
| 2024-08-14 | MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis | Nimeesha Chan et.al. | 2408.07773 | translate | read | link |
| 2024-08-15 | MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation | Beoungwoo Kang et.al. | 2408.07576 | translate | read | link |
| 2024-08-15 | MagicFace: Training-free Universal-Style Human Image Customized Synthesis | Yibin Wang et.al. | 2408.07433 | translate | read | null |
| 2024-08-14 | Segment Using Just One Example | Pratik Vora et.al. | 2408.07393 | translate | read | null |
| 2024-08-14 | Ensemble architecture in polyp segmentation | Hao-Yun Hsu et.al. | 2408.07262 | translate | read | link |
| 2024-08-14 | Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks | Raghavendra Singh et.al. | 2408.07243 | translate | read | null |
| 2024-08-14 | Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training | Ethan Kou et.al. | 2408.07239 | translate | read | null |
| 2024-08-13 | ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Jingyun Wang et.al. | 2408.06747 | translate | read | link |
| 2024-08-10 | Dilated Convolution with Learnable Spacings | Ismail Khalfaoui-Hassani et.al. | 2408.06383 | translate | read | null |
| 2024-08-12 | Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images | Siladittya Manna et.al. | 2408.06235 | translate | read | null |
| 2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | translate | read | null |
| 2024-08-13 | ClickAttention: Click Region Similarity Guided Interactive Segmentation | Long Xu et.al. | 2408.06021 | translate | read | null |
| 2024-08-12 | Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning | Xinrong Hu et.al. | 2408.05889 | translate | read | null |
| 2024-08-11 | Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task | Hannuo Zhang et.al. | 2408.05777 | translate | read | null |
| 2024-08-11 | MacFormer: Semantic Segmentation with Fine Object Boundaries | Guoan Xu et.al. | 2408.05699 | translate | read | null |
| 2024-08-13 | Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment | Abdul-Razak Alhassan Gamani et.al. | 2408.05661 | translate | read | null |
| 2024-08-10 | Multimodal generative semantic communication based on latent diffusion model | Weiqi Fu et.al. | 2408.05455 | translate | read | null |
| 2024-08-09 | PRISM Lite: A lightweight model for interactive 3D placenta segmentation in ultrasound | Hao Li et.al. | 2408.05372 | translate | read | link |
| 2024-08-09 | In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Dahyun Kang et.al. | 2408.04961 | translate | read | link |
| 2024-08-09 | ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Mengcheng Lan et.al. | 2408.04883 | translate | read | link |
| 2024-08-09 | Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning | Fumihiro Kaneko et.al. | 2408.04795 | translate | read | null |
| 2024-08-08 | Embodied Uncertainty-Aware Object Segmentation | Xiaolin Fang et.al. | 2408.04760 | translate | read | null |
| 2024-08-08 | SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation | Jieming Yu et.al. | 2408.04593 | translate | read | null |
| 2024-08-08 | Robust Approximate Characterization of Single-Cell Heterogeneity in Microbial Growth | Richard D. Paul et.al. | 2408.04501 | translate | read | link |
| 2024-08-08 | SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios | Sriram Mandalika et.al. | 2408.04482 | translate | read | null |
| 2024-08-08 | What could go wrong? Discovering and describing failure modes in computer vision | Gabriela Csurka et.al. | 2408.04471 | translate | read | null |
| 2024-08-07 | Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Yiqing Shen et.al. | 2408.04098 | translate | read | null |
| 2024-08-07 | CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Tianfang Zhang et.al. | 2408.03703 | translate | read | link |
| 2024-08-07 | SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology | Mingya Zhang et.al. | 2408.03651 | translate | read | link |
| 2024-08-06 | Post-Mortem Human Iris Segmentation Analysis with Deep Learning | Afzal Hossain et.al. | 2408.03448 | translate | read | null |
| 2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | translate | read | link |
| 2024-08-06 | Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment | Shijie Lian et.al. | 2408.02924 | translate | read | link |
| 2024-08-05 | Scribble-Based Interactive Segmentation of Medical Hyperspectral Images | Zhonghao Wang et.al. | 2408.02708 | translate | read | null |
| 2024-08-05 | Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation | Sai Prasanna et.al. | 2408.02297 | translate | read | null |
| 2024-08-05 | Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs | Jeongkee Lim et.al. | 2408.02261 | translate | read | null |
| 2024-08-05 | Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders | Muhammad Abdullah Jamal et.al. | 2408.02245 | translate | read | null |
| 2024-08-04 | Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation | Ye Du et.al. | 2408.02039 | translate | read | null |
| 2024-08-03 | NuLite – Lightweight and Fast Model for Nuclei Instance Segmentation and Classification | Cristian Tommasino et.al. | 2408.01797 | translate | read | null |
| 2024-08-03 | Bayesian Active Learning for Semantic Segmentation | Sima Didari et.al. | 2408.01694 | translate | read | null |
| 2024-08-03 | A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection | Omkar Oak et.al. | 2408.01692 | translate | read | null |
| 2024-08-03 | Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation | Balázs Opra et.al. | 2408.01640 | translate | read | null |
| 2024-08-02 | Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans | Lukas Kratochvila et.al. | 2408.01526 | translate | read | null |
| 2024-08-02 | Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation | Yuanzhi Su et.al. | 2408.01356 | translate | read | null |
| 2024-08-02 | StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Bingyu Li et.al. | 2408.01343 | translate | read | null |
| 2024-08-02 | Amodal Segmentation for Laparoscopic Surgery Video Instruments | Ruohua Shi et.al. | 2408.01067 | translate | read | null |
| 2024-08-02 | Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Yabin Zhu et.al. | 2408.00969 | translate | read | null |
| 2024-08-01 | Medical SAM 2: Segment medical images as video via Segment Anything Model 2 | Jiayuan Zhu et.al. | 2408.00874 | translate | read | link |
| 2024-08-01 | Leaf Angle Estimation using Mask R-CNN and LETR Vision Transformer | Venkat Margapuri et.al. | 2408.00749 | translate | read | null |
| 2024-08-01 | Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Siyu Jiao et.al. | 2408.00744 | translate | read | link |
| 2024-08-01 | Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function | Matias Oscar Volman Stern et.al. | 2408.00707 | translate | read | null |
| 2024-08-01 | AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation | Asbjørn Munk et.al. | 2408.00640 | translate | read | null |
| 2024-08-01 | SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation | Shengbo Tan et.al. | 2408.00496 | translate | read | null |
| 2024-08-01 | A Simple Background Augmentation Method for Object Detection with Diffusion Model | Yuhang Li et.al. | 2408.00350 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)