Semantic Segmentation - 2024-12

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-12-31	Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves	Madeleine Darbyshire et.al.	2501.00527	translate	read	link
2024-12-31	H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters	Pedram Fekri et.al.	2501.00514	translate	read	null
2024-12-31	A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images	Dawen Yu et.al.	2501.00360	translate	read	null
2024-12-31	PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM	Runnan Chen et.al.	2501.00352	translate	read	null
2024-12-31	OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies	Runnan Chen et.al.	2501.00326	translate	read	link
2024-12-30	HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization	Zijie Fang et.al.	2412.20924	translate	read	link
2024-12-30	LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training	Fardin Ayar et.al.	2412.20881	translate	read	null
2024-12-29	Image Augmentation Agent for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.20439	translate	read	null
2024-12-27	Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP	Zhongxing Xu et.al.	2412.19650	translate	read	null
2024-12-27	An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments	Vignesh Kottayam Viswanathan et.al.	2412.19582	translate	read	null
2024-12-27	Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation	Chengyang Ye et.al.	2412.19492	translate	read	link
2024-12-26	Impact of color and mixing proportion of synthetic point clouds on semantic segmentation	Shaojie Zhou et.al.	2412.19145	translate	read	null
2024-12-25	Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model	Yi-Chia Chen et.al.	2412.18917	translate	read	link
2024-12-24	AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction	Pufan Zou et.al.	2412.18255	translate	read	null
2024-12-25	VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis	Shicheng Yin et.al.	2412.18178	translate	read	link
2024-12-24	UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision	Yuru Wang et.al.	2412.18131	translate	read	null
2024-12-24	LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding	Hao Li et.al.	2412.17635	translate	read	null
2024-12-25	AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation	Jiaqi Ma et.al.	2412.17601	translate	read	link
2024-12-24	Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation	Jianjian Yin et.al.	2412.17331	translate	read	link
2024-12-22	Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation	Samuel Marschall et.al.	2412.16990	translate	read	null
2024-12-22	Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection	Yuhang Gan et.al.	2412.16918	translate	read	null
2024-12-22	MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection	Xu Zheng et.al.	2412.16876	translate	read	null
2024-12-22	Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation	Jongmin Yu et.al.	2412.16859	translate	read	null
2024-12-21	A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection	Shahid Ansari et.al.	2412.16755	translate	read	null
2024-12-21	IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks	Yaming Zhang et.al.	2412.16654	translate	read	link
2024-12-21	V”Mean”ba: Visual State Space Models only need 1 hidden dimension	Tien-Yu Chi et.al.	2412.16602	translate	read	null
2024-12-20	SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data	Xinwei Ju et.al.	2412.16078	translate	read	null
2024-12-20	Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer	Xinyue Chen et.al.	2412.15835	translate	read	link
2024-12-19	MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance	Hallee E. Wong et.al.	2412.15058	translate	read	link
2024-12-19	GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation	G. Andrade-Miranda et.al.	2412.15054	translate	read	link
2024-12-19	PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation	Shoumeng Qiu et.al.	2412.14821	translate	read	link
2024-12-19	Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers	Rui Ding et.al.	2412.14633	translate	read	null
2024-12-19	Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation	Zhenxin Lei et.al.	2412.14587	translate	read	null
2024-12-18	Split Learning in Computer Vision for Semantic Segmentation Delay Minimization	Nikos G. Evgenidis et.al.	2412.14272	translate	read	null
2024-12-18	Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation	Jianyu Zhang et.al.	2412.14145	translate	read	null
2024-12-18	Prompt Categories Cluster for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.13823	translate	read	null
2024-12-18	Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data	Junki Mori et.al.	2412.13757	translate	read	null
2024-12-18	Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration	Dominik Werner Wolf et.al.	2412.13695	translate	read	null
2024-12-18	GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting	Yuning Peng et.al.	2412.13654	translate	read	link
2024-12-18	RelationField: Relate Anything in Radiance Fields	Sebastian Koch et.al.	2412.13652	translate	read	null
2024-12-17	S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging	Yimu Pan et.al.	2412.13156	translate	read	null
2024-12-17	Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks	Xiaxin Zhu et.al.	2412.12843	translate	read	null
2024-12-17	ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation	Shiqi Huang et.al.	2412.12798	translate	read	link
2024-12-17	Open-World Panoptic Segmentation	Matteo Sodano et.al.	2412.12740	translate	read	null
2024-12-17	SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing	Chen Chen et.al.	2412.12685	translate	read	link
2024-12-17	Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation	Dongyue Wu et.al.	2412.12672	translate	read	link
2024-12-17	Adaptive Prototype Replay for Class Incremental Semantic Segmentation	Guilin Zhu et.al.	2412.12669	translate	read	null
2024-12-17	SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation	Shuangping Huang et.al.	2412.12660	translate	read	null
2024-12-16	Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation	Hongwei Niu et.al.	2412.12050	translate	read	link
2024-12-16	SAMIC: Segment Anything with In-Context Spatial Prompt Engineering	Savinay Nagendra et.al.	2412.11998	translate	read	null
2024-12-16	SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation	Yunxiang Fu et.al.	2412.11890	translate	read	link
2024-12-16	Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation	Svetlana Pavlitska et.al.	2412.11608	translate	read	null
2024-12-16	PyPotteryLens: An Open-Source Deep Learning Framework for Automated Digitisation of Archaeological Pottery Documentation	Lorenzo Cardarelli et.al.	2412.11574	translate	read	null
2024-12-15	Volumetric Mapping with Panoptic Refinement via Kernel Density Estimation for Mobile Robots	Khang Nguyen et.al.	2412.11241	translate	read	link
2024-12-15	MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation	Zhiwei Yang et.al.	2412.11076	translate	read	link
2024-12-15	Classification Drives Geographic Bias in Street Scene Segmentation	Rahul Nair et.al.	2412.11061	translate	read	null
2024-12-15	SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation	Xudong Zhou et.al.	2412.11034	translate	read	null
2024-12-14	RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone	Mustafa Munir et.al.	2412.10995	translate	read	link
2024-12-13	A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation	Wangkai Li et.al.	2412.10339	translate	read	null
2024-12-13	SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians	Siyun Liang et.al.	2412.10231	translate	read	null
2024-12-13	SPT: Sequence Prompt Transformer for Interactive Image Segmentation	Senlin Cheng et.al.	2412.10224	translate	read	null
2024-12-13	TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views	Liang Zhao et.al.	2412.10051	translate	read	null
2024-12-13	Object-Focused Data Selection for Dense Prediction Tasks	Niclas Popp et.al.	2412.10032	translate	read	null
2024-12-12	MaskTerial: A Foundation Model for Automated 2D Material Flake Detection	Jan-Lucas Uslu et.al.	2412.09333	translate	read	null
2024-12-12	Towards Open-Vocabulary Video Semantic Segmentation	Xinhao Li et.al.	2412.09329	translate	read	null
2024-12-12	FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation	Yuntian Bo et.al.	2412.09319	translate	read	link
2024-12-12	VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation	Roberto Alcover-Couso et.al.	2412.09240	translate	read	null
2024-12-12	STEAM: Squeeze and Transform Enhanced Attention Module	Rishabh Sabharwal et.al.	2412.09023	translate	read	null
2024-12-11	SegFace: Face Segmentation of Long-Tail Classes	Kartik Narayan et.al.	2412.08647	translate	read	link
2024-12-11	EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation	Hongwei Niu et.al.	2412.08628	translate	read	null
2024-12-12	Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning	Fan Lu et.al.	2412.08614	translate	read	link
2024-12-11	Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion	Bingzhi Shen et.al.	2412.08315	translate	read	null
2024-12-11	Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction	Bohan Li et.al.	2412.08243	translate	read	null
2024-12-11	THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots	Zeshun Li et.al.	2412.08096	translate	read	null
2024-12-11	Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation	Zhigang Cen et.al.	2412.08034	translate	read	null
2024-12-10	Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation	Kurt H. W. Stolle et.al.	2412.07966	translate	read	link
2024-12-11	CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings	Jiazuo Mu et.al.	2412.07377	translate	read	null
2024-12-09	SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception	Yaniv Benny et.al.	2412.06968	translate	read	null
2024-12-10	ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet	Andrei-Robert Alexandrescu et.al.	2412.06742	translate	read	null
2024-12-09	Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation	Fei Wu et.al.	2412.06470	translate	read	null
2024-12-09	Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework	Jiuyi Xu et.al.	2412.06268	translate	read	null
2024-12-09	GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image	Lei Su et.al.	2412.06129	translate	read	null
2024-12-08	Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation	Zipeng Qi et.al.	2412.05969	translate	read	null
2024-12-08	CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation	Elay Dahan et.al.	2412.05833	translate	read	null
2024-12-07	Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards	Ranjan Sapkota et.al.	2412.05728	translate	read	null
2024-12-10	RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts	Xu Liu et.al.	2412.05679	translate	read	link
2024-12-06	FogROS2-FT: Fault Tolerant Cloud Robotics	Kaiyuan Chen et.al.	2412.05408	translate	read	null
2024-12-06	DreamColour: Controllable Video Colour Editing without Training	Chaitat Utintu et.al.	2412.05180	translate	read	null
2024-12-05	Assessing and Learning Alignment of Unimodal Vision and Language Models	Le Zhang et.al.	2412.04616	translate	read	link
2024-12-05	Towards Real-Time Open-Vocabulary Video Instance Segmentation	Bin Yan et.al.	2412.04434	translate	read	null
2024-12-05	A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers	Anaïs Halin et.al.	2412.04377	translate	read	null
2024-12-05	Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts	Chenyang Zhu et.al.	2412.04220	translate	read	null
2024-12-05	Text Change Detection in Multilingual Documents Using Image Comparison	Doyoung Park et.al.	2412.04137	translate	read	null
2024-12-05	SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning	Seokju Yun et.al.	2412.04077	translate	read	null
2024-12-05	Quality Control in Open-Ended Crowdsourcing: A Survey	Lei Chai et.al.	2412.03991	translate	read	null
2024-12-05	Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation	Hao Zhu et.al.	2412.03968	translate	read	link
2024-12-05	LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model	Yuan Xue et.al.	2412.03841	translate	read	null
2024-12-04	Designing DNNs for a trade-off between robustness and processing performance in embedded devices	Jon Gutiérrez-Zaballa et.al.	2412.03682	translate	read	null
2024-12-04	FLAIR: VLM with Fine-grained Language-informed Image Representations	Rui Xiao et.al.	2412.03561	translate	read	link
2024-12-04	Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy	Ronald L. P. D. de Jong et.al.	2412.03401	translate	read	null
2024-12-04	Task-driven Image Fusion with Learnable Fusion Loss	Haowen Bai et.al.	2412.03240	translate	read	null
2024-12-04	Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging	Luca Ciampi et.al.	2412.03192	translate	read	null
2024-12-04	Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype	Song Tang et.al.	2412.02983	translate	read	null
2024-12-04	Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch	Qing Zhang et.al.	2412.02978	translate	read	null
2024-12-04	Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution	Jiahua Xiao et.al.	2412.02960	translate	read	null
2024-12-04	Panoptic Diffusion Models: co-generation of images and segmentation maps	Yinghan Long et.al.	2412.02929	translate	read	null
2024-12-03	SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection	Joongwon Chae et.al.	2412.02565	translate	read	null
2024-12-03	Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps	Malik Abdul Manan et.al.	2412.02443	translate	read	null
2024-12-03	AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation	Jaehyun Choi et.al.	2412.02280	translate	read	null
2024-12-03	Vision Transformers for Weakly-Supervised Microorganism Enumeration	Javier Ureña Santiago et.al.	2412.02250	translate	read	link
2024-12-03	Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance	Jing Zeng et.al.	2412.02249	translate	read	null
2024-12-02	INSIGHT: Explainable Weakly-Supervised Medical Image Analysis	Wenbo Zhang et.al.	2412.02012	translate	read	null
2024-12-02	Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers	Alberto Gonzalo Rodriguez Salgado et.al.	2412.01941	translate	read	null
2024-12-02	COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training	Sanghwan Kim et.al.	2412.01814	translate	read	null
2024-12-02	Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior	Yi Yu et.al.	2412.01646	translate	read	null
2024-12-02	Epipolar Attention Field Transformers for Bird’s Eye View Semantic Segmentation	Christian Witte et.al.	2412.01595	translate	read	null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)