Semantic Segmentation - 2024-12

Publish Date Title Authors PDF Translate Read Code
2024-12-31 Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves Madeleine Darbyshire et.al. 2501.00527 translate read link
2024-12-31 H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters Pedram Fekri et.al. 2501.00514 translate read null
2024-12-31 A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images Dawen Yu et.al. 2501.00360 translate read null
2024-12-31 PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM Runnan Chen et.al. 2501.00352 translate read null
2024-12-31 OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Runnan Chen et.al. 2501.00326 translate read link
2024-12-30 HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization Zijie Fang et.al. 2412.20924 translate read link
2024-12-30 LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training Fardin Ayar et.al. 2412.20881 translate read null
2024-12-29 Image Augmentation Agent for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.20439 translate read null
2024-12-27 Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP Zhongxing Xu et.al. 2412.19650 translate read null
2024-12-27 An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments Vignesh Kottayam Viswanathan et.al. 2412.19582 translate read null
2024-12-27 Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation Chengyang Ye et.al. 2412.19492 translate read link
2024-12-26 Impact of color and mixing proportion of synthetic point clouds on semantic segmentation Shaojie Zhou et.al. 2412.19145 translate read null
2024-12-25 Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model Yi-Chia Chen et.al. 2412.18917 translate read link
2024-12-24 AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction Pufan Zou et.al. 2412.18255 translate read null
2024-12-25 VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis Shicheng Yin et.al. 2412.18178 translate read link
2024-12-24 UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision Yuru Wang et.al. 2412.18131 translate read null
2024-12-24 LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding Hao Li et.al. 2412.17635 translate read null
2024-12-25 AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation Jiaqi Ma et.al. 2412.17601 translate read link
2024-12-24 Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation Jianjian Yin et.al. 2412.17331 translate read link
2024-12-22 Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation Samuel Marschall et.al. 2412.16990 translate read null
2024-12-22 Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection Yuhang Gan et.al. 2412.16918 translate read null
2024-12-22 MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection Xu Zheng et.al. 2412.16876 translate read null
2024-12-22 Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation Jongmin Yu et.al. 2412.16859 translate read null
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 translate read null
2024-12-21 IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks Yaming Zhang et.al. 2412.16654 translate read link
2024-12-21 V”Mean”ba: Visual State Space Models only need 1 hidden dimension Tien-Yu Chi et.al. 2412.16602 translate read null
2024-12-20 SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data Xinwei Ju et.al. 2412.16078 translate read null
2024-12-20 Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer Xinyue Chen et.al. 2412.15835 translate read link
2024-12-19 MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance Hallee E. Wong et.al. 2412.15058 translate read link
2024-12-19 GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation G. Andrade-Miranda et.al. 2412.15054 translate read link
2024-12-19 PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation Shoumeng Qiu et.al. 2412.14821 translate read link
2024-12-19 Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers Rui Ding et.al. 2412.14633 translate read null
2024-12-19 Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation Zhenxin Lei et.al. 2412.14587 translate read null
2024-12-18 Split Learning in Computer Vision for Semantic Segmentation Delay Minimization Nikos G. Evgenidis et.al. 2412.14272 translate read null
2024-12-18 Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation Jianyu Zhang et.al. 2412.14145 translate read null
2024-12-18 Prompt Categories Cluster for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.13823 translate read null
2024-12-18 Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data Junki Mori et.al. 2412.13757 translate read null
2024-12-18 Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration Dominik Werner Wolf et.al. 2412.13695 translate read null
2024-12-18 GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting Yuning Peng et.al. 2412.13654 translate read link
2024-12-18 RelationField: Relate Anything in Radiance Fields Sebastian Koch et.al. 2412.13652 translate read null
2024-12-17 S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging Yimu Pan et.al. 2412.13156 translate read null
2024-12-17 Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks Xiaxin Zhu et.al. 2412.12843 translate read null
2024-12-17 ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation Shiqi Huang et.al. 2412.12798 translate read link
2024-12-17 Open-World Panoptic Segmentation Matteo Sodano et.al. 2412.12740 translate read null
2024-12-17 SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing Chen Chen et.al. 2412.12685 translate read link
2024-12-17 Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation Dongyue Wu et.al. 2412.12672 translate read link
2024-12-17 Adaptive Prototype Replay for Class Incremental Semantic Segmentation Guilin Zhu et.al. 2412.12669 translate read null
2024-12-17 SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation Shuangping Huang et.al. 2412.12660 translate read null
2024-12-16 Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation Hongwei Niu et.al. 2412.12050 translate read link
2024-12-16 SAMIC: Segment Anything with In-Context Spatial Prompt Engineering Savinay Nagendra et.al. 2412.11998 translate read null
2024-12-16 SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation Yunxiang Fu et.al. 2412.11890 translate read link
2024-12-16 Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation Svetlana Pavlitska et.al. 2412.11608 translate read null
2024-12-16 PyPotteryLens: An Open-Source Deep Learning Framework for Automated Digitisation of Archaeological Pottery Documentation Lorenzo Cardarelli et.al. 2412.11574 translate read null
2024-12-15 Volumetric Mapping with Panoptic Refinement via Kernel Density Estimation for Mobile Robots Khang Nguyen et.al. 2412.11241 translate read link
2024-12-15 MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2412.11076 translate read link
2024-12-15 Classification Drives Geographic Bias in Street Scene Segmentation Rahul Nair et.al. 2412.11061 translate read null
2024-12-15 SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation Xudong Zhou et.al. 2412.11034 translate read null
2024-12-14 RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone Mustafa Munir et.al. 2412.10995 translate read link
2024-12-13 A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation Wangkai Li et.al. 2412.10339 translate read null
2024-12-13 SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians Siyun Liang et.al. 2412.10231 translate read null
2024-12-13 SPT: Sequence Prompt Transformer for Interactive Image Segmentation Senlin Cheng et.al. 2412.10224 translate read null
2024-12-13 TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views Liang Zhao et.al. 2412.10051 translate read null
2024-12-13 Object-Focused Data Selection for Dense Prediction Tasks Niclas Popp et.al. 2412.10032 translate read null
2024-12-12 MaskTerial: A Foundation Model for Automated 2D Material Flake Detection Jan-Lucas Uslu et.al. 2412.09333 translate read null
2024-12-12 Towards Open-Vocabulary Video Semantic Segmentation Xinhao Li et.al. 2412.09329 translate read null
2024-12-12 FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation Yuntian Bo et.al. 2412.09319 translate read link
2024-12-12 VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation Roberto Alcover-Couso et.al. 2412.09240 translate read null
2024-12-12 STEAM: Squeeze and Transform Enhanced Attention Module Rishabh Sabharwal et.al. 2412.09023 translate read null
2024-12-11 SegFace: Face Segmentation of Long-Tail Classes Kartik Narayan et.al. 2412.08647 translate read link
2024-12-11 EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation Hongwei Niu et.al. 2412.08628 translate read null
2024-12-12 Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning Fan Lu et.al. 2412.08614 translate read link
2024-12-11 Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion Bingzhi Shen et.al. 2412.08315 translate read null
2024-12-11 Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction Bohan Li et.al. 2412.08243 translate read null
2024-12-11 THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots Zeshun Li et.al. 2412.08096 translate read null
2024-12-11 Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation Zhigang Cen et.al. 2412.08034 translate read null
2024-12-10 Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation Kurt H. W. Stolle et.al. 2412.07966 translate read link
2024-12-11 CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings Jiazuo Mu et.al. 2412.07377 translate read null
2024-12-09 SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception Yaniv Benny et.al. 2412.06968 translate read null
2024-12-10 ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet Andrei-Robert Alexandrescu et.al. 2412.06742 translate read null
2024-12-09 Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation Fei Wu et.al. 2412.06470 translate read null
2024-12-09 Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework Jiuyi Xu et.al. 2412.06268 translate read null
2024-12-09 GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image Lei Su et.al. 2412.06129 translate read null
2024-12-08 Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation Zipeng Qi et.al. 2412.05969 translate read null
2024-12-08 CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation Elay Dahan et.al. 2412.05833 translate read null
2024-12-07 Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards Ranjan Sapkota et.al. 2412.05728 translate read null
2024-12-10 RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts Xu Liu et.al. 2412.05679 translate read link
2024-12-06 FogROS2-FT: Fault Tolerant Cloud Robotics Kaiyuan Chen et.al. 2412.05408 translate read null
2024-12-06 DreamColour: Controllable Video Colour Editing without Training Chaitat Utintu et.al. 2412.05180 translate read null
2024-12-05 Assessing and Learning Alignment of Unimodal Vision and Language Models Le Zhang et.al. 2412.04616 translate read link
2024-12-05 Towards Real-Time Open-Vocabulary Video Instance Segmentation Bin Yan et.al. 2412.04434 translate read null
2024-12-05 A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers Anaïs Halin et.al. 2412.04377 translate read null
2024-12-05 Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts Chenyang Zhu et.al. 2412.04220 translate read null
2024-12-05 Text Change Detection in Multilingual Documents Using Image Comparison Doyoung Park et.al. 2412.04137 translate read null
2024-12-05 SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning Seokju Yun et.al. 2412.04077 translate read null
2024-12-05 Quality Control in Open-Ended Crowdsourcing: A Survey Lei Chai et.al. 2412.03991 translate read null
2024-12-05 Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation Hao Zhu et.al. 2412.03968 translate read link
2024-12-05 LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model Yuan Xue et.al. 2412.03841 translate read null
2024-12-04 Designing DNNs for a trade-off between robustness and processing performance in embedded devices Jon Gutiérrez-Zaballa et.al. 2412.03682 translate read null
2024-12-04 FLAIR: VLM with Fine-grained Language-informed Image Representations Rui Xiao et.al. 2412.03561 translate read link
2024-12-04 Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy Ronald L. P. D. de Jong et.al. 2412.03401 translate read null
2024-12-04 Task-driven Image Fusion with Learnable Fusion Loss Haowen Bai et.al. 2412.03240 translate read null
2024-12-04 Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging Luca Ciampi et.al. 2412.03192 translate read null
2024-12-04 Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype Song Tang et.al. 2412.02983 translate read null
2024-12-04 Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch Qing Zhang et.al. 2412.02978 translate read null
2024-12-04 Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution Jiahua Xiao et.al. 2412.02960 translate read null
2024-12-04 Panoptic Diffusion Models: co-generation of images and segmentation maps Yinghan Long et.al. 2412.02929 translate read null
2024-12-03 SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection Joongwon Chae et.al. 2412.02565 translate read null
2024-12-03 Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps Malik Abdul Manan et.al. 2412.02443 translate read null
2024-12-03 AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation Jaehyun Choi et.al. 2412.02280 translate read null
2024-12-03 Vision Transformers for Weakly-Supervised Microorganism Enumeration Javier Ureña Santiago et.al. 2412.02250 translate read link
2024-12-03 Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance Jing Zeng et.al. 2412.02249 translate read null
2024-12-02 INSIGHT: Explainable Weakly-Supervised Medical Image Analysis Wenbo Zhang et.al. 2412.02012 translate read null
2024-12-02 Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers Alberto Gonzalo Rodriguez Salgado et.al. 2412.01941 translate read null
2024-12-02 COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training Sanghwan Kim et.al. 2412.01814 translate read null
2024-12-02 Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior Yi Yu et.al. 2412.01646 translate read null
2024-12-02 Epipolar Attention Field Transformers for Bird’s Eye View Semantic Segmentation Christian Witte et.al. 2412.01595 translate read null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)