Semantic Segmentation - 2024-12
Semantic Segmentation - 2024-12
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-12-31 | Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves | Madeleine Darbyshire et.al. | 2501.00527 | translate | read | link |
| 2024-12-31 | H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters | Pedram Fekri et.al. | 2501.00514 | translate | read | null |
| 2024-12-31 | A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images | Dawen Yu et.al. | 2501.00360 | translate | read | null |
| 2024-12-31 | PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM | Runnan Chen et.al. | 2501.00352 | translate | read | null |
| 2024-12-31 | OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies | Runnan Chen et.al. | 2501.00326 | translate | read | link |
| 2024-12-30 | HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization | Zijie Fang et.al. | 2412.20924 | translate | read | link |
| 2024-12-30 | LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training | Fardin Ayar et.al. | 2412.20881 | translate | read | null |
| 2024-12-29 | Image Augmentation Agent for Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2412.20439 | translate | read | null |
| 2024-12-27 | Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP | Zhongxing Xu et.al. | 2412.19650 | translate | read | null |
| 2024-12-27 | An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments | Vignesh Kottayam Viswanathan et.al. | 2412.19582 | translate | read | null |
| 2024-12-27 | Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation | Chengyang Ye et.al. | 2412.19492 | translate | read | link |
| 2024-12-26 | Impact of color and mixing proportion of synthetic point clouds on semantic segmentation | Shaojie Zhou et.al. | 2412.19145 | translate | read | null |
| 2024-12-25 | Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model | Yi-Chia Chen et.al. | 2412.18917 | translate | read | link |
| 2024-12-24 | AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction | Pufan Zou et.al. | 2412.18255 | translate | read | null |
| 2024-12-25 | VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis | Shicheng Yin et.al. | 2412.18178 | translate | read | link |
| 2024-12-24 | UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision | Yuru Wang et.al. | 2412.18131 | translate | read | null |
| 2024-12-24 | LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding | Hao Li et.al. | 2412.17635 | translate | read | null |
| 2024-12-25 | AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation | Jiaqi Ma et.al. | 2412.17601 | translate | read | link |
| 2024-12-24 | Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation | Jianjian Yin et.al. | 2412.17331 | translate | read | link |
| 2024-12-22 | Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation | Samuel Marschall et.al. | 2412.16990 | translate | read | null |
| 2024-12-22 | Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection | Yuhang Gan et.al. | 2412.16918 | translate | read | null |
| 2024-12-22 | MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection | Xu Zheng et.al. | 2412.16876 | translate | read | null |
| 2024-12-22 | Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation | Jongmin Yu et.al. | 2412.16859 | translate | read | null |
| 2024-12-21 | A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection | Shahid Ansari et.al. | 2412.16755 | translate | read | null |
| 2024-12-21 | IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks | Yaming Zhang et.al. | 2412.16654 | translate | read | link |
| 2024-12-21 | V”Mean”ba: Visual State Space Models only need 1 hidden dimension | Tien-Yu Chi et.al. | 2412.16602 | translate | read | null |
| 2024-12-20 | SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data | Xinwei Ju et.al. | 2412.16078 | translate | read | null |
| 2024-12-20 | Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer | Xinyue Chen et.al. | 2412.15835 | translate | read | link |
| 2024-12-19 | MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance | Hallee E. Wong et.al. | 2412.15058 | translate | read | link |
| 2024-12-19 | GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation | G. Andrade-Miranda et.al. | 2412.15054 | translate | read | link |
| 2024-12-19 | PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation | Shoumeng Qiu et.al. | 2412.14821 | translate | read | link |
| 2024-12-19 | Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers | Rui Ding et.al. | 2412.14633 | translate | read | null |
| 2024-12-19 | Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation | Zhenxin Lei et.al. | 2412.14587 | translate | read | null |
| 2024-12-18 | Split Learning in Computer Vision for Semantic Segmentation Delay Minimization | Nikos G. Evgenidis et.al. | 2412.14272 | translate | read | null |
| 2024-12-18 | Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation | Jianyu Zhang et.al. | 2412.14145 | translate | read | null |
| 2024-12-18 | Prompt Categories Cluster for Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2412.13823 | translate | read | null |
| 2024-12-18 | Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data | Junki Mori et.al. | 2412.13757 | translate | read | null |
| 2024-12-18 | Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration | Dominik Werner Wolf et.al. | 2412.13695 | translate | read | null |
| 2024-12-18 | GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting | Yuning Peng et.al. | 2412.13654 | translate | read | link |
| 2024-12-18 | RelationField: Relate Anything in Radiance Fields | Sebastian Koch et.al. | 2412.13652 | translate | read | null |
| 2024-12-17 | S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging | Yimu Pan et.al. | 2412.13156 | translate | read | null |
| 2024-12-17 | Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks | Xiaxin Zhu et.al. | 2412.12843 | translate | read | null |
| 2024-12-17 | ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation | Shiqi Huang et.al. | 2412.12798 | translate | read | link |
| 2024-12-17 | Open-World Panoptic Segmentation | Matteo Sodano et.al. | 2412.12740 | translate | read | null |
| 2024-12-17 | SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing | Chen Chen et.al. | 2412.12685 | translate | read | link |
| 2024-12-17 | Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation | Dongyue Wu et.al. | 2412.12672 | translate | read | link |
| 2024-12-17 | Adaptive Prototype Replay for Class Incremental Semantic Segmentation | Guilin Zhu et.al. | 2412.12669 | translate | read | null |
| 2024-12-17 | SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation | Shuangping Huang et.al. | 2412.12660 | translate | read | null |
| 2024-12-16 | Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation | Hongwei Niu et.al. | 2412.12050 | translate | read | link |
| 2024-12-16 | SAMIC: Segment Anything with In-Context Spatial Prompt Engineering | Savinay Nagendra et.al. | 2412.11998 | translate | read | null |
| 2024-12-16 | SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation | Yunxiang Fu et.al. | 2412.11890 | translate | read | link |
| 2024-12-16 | Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation | Svetlana Pavlitska et.al. | 2412.11608 | translate | read | null |
| 2024-12-16 | PyPotteryLens: An Open-Source Deep Learning Framework for Automated Digitisation of Archaeological Pottery Documentation | Lorenzo Cardarelli et.al. | 2412.11574 | translate | read | null |
| 2024-12-15 | Volumetric Mapping with Panoptic Refinement via Kernel Density Estimation for Mobile Robots | Khang Nguyen et.al. | 2412.11241 | translate | read | link |
| 2024-12-15 | MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation | Zhiwei Yang et.al. | 2412.11076 | translate | read | link |
| 2024-12-15 | Classification Drives Geographic Bias in Street Scene Segmentation | Rahul Nair et.al. | 2412.11061 | translate | read | null |
| 2024-12-15 | SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation | Xudong Zhou et.al. | 2412.11034 | translate | read | null |
| 2024-12-14 | RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone | Mustafa Munir et.al. | 2412.10995 | translate | read | link |
| 2024-12-13 | A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation | Wangkai Li et.al. | 2412.10339 | translate | read | null |
| 2024-12-13 | SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians | Siyun Liang et.al. | 2412.10231 | translate | read | null |
| 2024-12-13 | SPT: Sequence Prompt Transformer for Interactive Image Segmentation | Senlin Cheng et.al. | 2412.10224 | translate | read | null |
| 2024-12-13 | TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views | Liang Zhao et.al. | 2412.10051 | translate | read | null |
| 2024-12-13 | Object-Focused Data Selection for Dense Prediction Tasks | Niclas Popp et.al. | 2412.10032 | translate | read | null |
| 2024-12-12 | MaskTerial: A Foundation Model for Automated 2D Material Flake Detection | Jan-Lucas Uslu et.al. | 2412.09333 | translate | read | null |
| 2024-12-12 | Towards Open-Vocabulary Video Semantic Segmentation | Xinhao Li et.al. | 2412.09329 | translate | read | null |
| 2024-12-12 | FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation | Yuntian Bo et.al. | 2412.09319 | translate | read | link |
| 2024-12-12 | VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation | Roberto Alcover-Couso et.al. | 2412.09240 | translate | read | null |
| 2024-12-12 | STEAM: Squeeze and Transform Enhanced Attention Module | Rishabh Sabharwal et.al. | 2412.09023 | translate | read | null |
| 2024-12-11 | SegFace: Face Segmentation of Long-Tail Classes | Kartik Narayan et.al. | 2412.08647 | translate | read | link |
| 2024-12-11 | EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Hongwei Niu et.al. | 2412.08628 | translate | read | null |
| 2024-12-12 | Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Fan Lu et.al. | 2412.08614 | translate | read | link |
| 2024-12-11 | Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion | Bingzhi Shen et.al. | 2412.08315 | translate | read | null |
| 2024-12-11 | Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction | Bohan Li et.al. | 2412.08243 | translate | read | null |
| 2024-12-11 | THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots | Zeshun Li et.al. | 2412.08096 | translate | read | null |
| 2024-12-11 | Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Zhigang Cen et.al. | 2412.08034 | translate | read | null |
| 2024-12-10 | Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation | Kurt H. W. Stolle et.al. | 2412.07966 | translate | read | link |
| 2024-12-11 | CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings | Jiazuo Mu et.al. | 2412.07377 | translate | read | null |
| 2024-12-09 | SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception | Yaniv Benny et.al. | 2412.06968 | translate | read | null |
| 2024-12-10 | ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet | Andrei-Robert Alexandrescu et.al. | 2412.06742 | translate | read | null |
| 2024-12-09 | Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation | Fei Wu et.al. | 2412.06470 | translate | read | null |
| 2024-12-09 | Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework | Jiuyi Xu et.al. | 2412.06268 | translate | read | null |
| 2024-12-09 | GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image | Lei Su et.al. | 2412.06129 | translate | read | null |
| 2024-12-08 | Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation | Zipeng Qi et.al. | 2412.05969 | translate | read | null |
| 2024-12-08 | CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation | Elay Dahan et.al. | 2412.05833 | translate | read | null |
| 2024-12-07 | Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards | Ranjan Sapkota et.al. | 2412.05728 | translate | read | null |
| 2024-12-10 | RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts | Xu Liu et.al. | 2412.05679 | translate | read | link |
| 2024-12-06 | FogROS2-FT: Fault Tolerant Cloud Robotics | Kaiyuan Chen et.al. | 2412.05408 | translate | read | null |
| 2024-12-06 | DreamColour: Controllable Video Colour Editing without Training | Chaitat Utintu et.al. | 2412.05180 | translate | read | null |
| 2024-12-05 | Assessing and Learning Alignment of Unimodal Vision and Language Models | Le Zhang et.al. | 2412.04616 | translate | read | link |
| 2024-12-05 | Towards Real-Time Open-Vocabulary Video Instance Segmentation | Bin Yan et.al. | 2412.04434 | translate | read | null |
| 2024-12-05 | A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers | Anaïs Halin et.al. | 2412.04377 | translate | read | null |
| 2024-12-05 | Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts | Chenyang Zhu et.al. | 2412.04220 | translate | read | null |
| 2024-12-05 | Text Change Detection in Multilingual Documents Using Image Comparison | Doyoung Park et.al. | 2412.04137 | translate | read | null |
| 2024-12-05 | SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning | Seokju Yun et.al. | 2412.04077 | translate | read | null |
| 2024-12-05 | Quality Control in Open-Ended Crowdsourcing: A Survey | Lei Chai et.al. | 2412.03991 | translate | read | null |
| 2024-12-05 | Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation | Hao Zhu et.al. | 2412.03968 | translate | read | link |
| 2024-12-05 | LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model | Yuan Xue et.al. | 2412.03841 | translate | read | null |
| 2024-12-04 | Designing DNNs for a trade-off between robustness and processing performance in embedded devices | Jon Gutiérrez-Zaballa et.al. | 2412.03682 | translate | read | null |
| 2024-12-04 | FLAIR: VLM with Fine-grained Language-informed Image Representations | Rui Xiao et.al. | 2412.03561 | translate | read | link |
| 2024-12-04 | Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy | Ronald L. P. D. de Jong et.al. | 2412.03401 | translate | read | null |
| 2024-12-04 | Task-driven Image Fusion with Learnable Fusion Loss | Haowen Bai et.al. | 2412.03240 | translate | read | null |
| 2024-12-04 | Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging | Luca Ciampi et.al. | 2412.03192 | translate | read | null |
| 2024-12-04 | Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype | Song Tang et.al. | 2412.02983 | translate | read | null |
| 2024-12-04 | Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch | Qing Zhang et.al. | 2412.02978 | translate | read | null |
| 2024-12-04 | Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution | Jiahua Xiao et.al. | 2412.02960 | translate | read | null |
| 2024-12-04 | Panoptic Diffusion Models: co-generation of images and segmentation maps | Yinghan Long et.al. | 2412.02929 | translate | read | null |
| 2024-12-03 | SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection | Joongwon Chae et.al. | 2412.02565 | translate | read | null |
| 2024-12-03 | Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps | Malik Abdul Manan et.al. | 2412.02443 | translate | read | null |
| 2024-12-03 | AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation | Jaehyun Choi et.al. | 2412.02280 | translate | read | null |
| 2024-12-03 | Vision Transformers for Weakly-Supervised Microorganism Enumeration | Javier Ureña Santiago et.al. | 2412.02250 | translate | read | link |
| 2024-12-03 | Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance | Jing Zeng et.al. | 2412.02249 | translate | read | null |
| 2024-12-02 | INSIGHT: Explainable Weakly-Supervised Medical Image Analysis | Wenbo Zhang et.al. | 2412.02012 | translate | read | null |
| 2024-12-02 | Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers | Alberto Gonzalo Rodriguez Salgado et.al. | 2412.01941 | translate | read | null |
| 2024-12-02 | COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training | Sanghwan Kim et.al. | 2412.01814 | translate | read | null |
| 2024-12-02 | Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior | Yi Yu et.al. | 2412.01646 | translate | read | null |
| 2024-12-02 | Epipolar Attention Field Transformers for Bird’s Eye View Semantic Segmentation | Christian Witte et.al. | 2412.01595 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)