Semantic Segmentation - 2024-11

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-11-29	LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention	Zewen Du et.al.	2411.19585	translate	read	link
2024-11-29	Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding	Wenbo Zhang et.al.	2411.19551	translate	read	null
2024-11-29	Retrieval-guided Cross-view Image Synthesis	Hongji Yang et.al.	2411.19510	translate	read	null
2024-11-29	Adaptive Interactive Segmentation for Multimodal Medical Imaging via Selection Engine	Zhi Li et.al.	2411.19447	translate	read	link
2024-11-28	GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model	Rui Zhou et.al.	2411.19289	translate	read	null
2024-11-28	InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception	Haijie Li et.al.	2411.19235	translate	read	null
2024-11-28	MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers	Jongseong Bae et.al.	2411.18995	translate	read	null
2024-11-28	Textured As-Is BIM via GIS-informed Point Cloud Segmentation	Mohamed S. H. Alabassy et.al.	2411.18898	translate	read	null
2024-11-27	The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation	Daniel Morales-Brotons et.al.	2411.18728	translate	read	null
2024-11-27	HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior	Li-Yuan Tsao et.al.	2411.18662	translate	read	link
2024-11-26	Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation	Sudarshan Rajagopalan et.al.	2411.17814	translate	read	null
2024-11-26	Efficient Multi-modal Large Language Models via Visual Token Grouping	Minbin Huang et.al.	2411.17773	translate	read	null
2024-11-26	Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation	Niharika Hegde et.al.	2411.17610	translate	read	null
2024-11-26	A Bilayer Segmentation-Recombination Network for Accurate Segmentation of Overlapping C. elegans	Mengqian Dinga et.al.	2411.17557	translate	read	null
2024-11-26	Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving	Jon Gutiérrez-Zaballa et.al.	2411.17543	translate	read	null
2024-11-26	Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning	Hoàng-Ân Lê et.al.	2411.17536	translate	read	link
2024-11-26	TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba	Xiaowen Ma et.al.	2411.17473	translate	read	link
2024-11-26	Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps	Xue Xia et.al.	2411.17425	translate	read	null
2024-11-26	MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection	Juefei He et.al.	2411.17167	translate	read	null
2024-11-26	Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation	Chanyoung Kim et.al.	2411.17150	translate	read	null
2024-11-26	ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction	Chang Li et.al.	2411.17088	translate	read	null
2024-11-26	SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation	Guoan Xu et.al.	2411.17061	translate	read	null
2024-11-25	Deformable Mamba for Wide Field of View Segmentation	Jie Hu et.al.	2411.16481	translate	read	link
2024-11-25	A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models	Manuel Schwonberg et.al.	2411.16407	translate	read	null
2024-11-25	CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation	Leon Sick et.al.	2411.16319	translate	read	null
2024-11-25	An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models	Wentao Qu et.al.	2411.16308	translate	read	null
2024-11-25	A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads	Rafael S. Toledo et.al.	2411.16295	translate	read	null
2024-11-25	Weakly supervised image segmentation for defect-based grading of fresh produce	Manuel Knott et.al.	2411.16219	translate	read	null
2024-11-25	Learn from Foundation Model: Fruit Detection Model without Manual Annotation	Yanan Wang et.al.	2411.16196	translate	read	null
2024-11-25	Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking	Phuc Nguyen et.al.	2411.16183	translate	read	null
2024-11-25	Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training	Man Yao et.al.	2411.16061	translate	read	link
2024-11-24	Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan	Saba Zahid et.al.	2411.15923	translate	read	null
2024-11-22	Effective SAM Combination for Open-Vocabulary Semantic Segmentation	Minhyeok Lee et.al.	2411.14723	translate	read	null
2024-11-21	Revisiting the Integration of Convolution and Attention for Vision Backbone	Lei Zhu et.al.	2411.14429	translate	read	link
2024-11-21	CompetitorFormer: Competitor Transformer for 3D Instance Segmentation	Duanchu Wang et.al.	2411.14179	translate	read	null
2024-11-21	CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Lin Sun et.al.	2411.13836	translate	read	link
2024-11-21	Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals	Hussni Mohd Zakir et.al.	2411.13774	translate	read	null
2024-11-20	FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting	Ola Shorinwa et.al.	2411.13753	translate	read	null
2024-11-20	DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines	Mizanur Rahman Jewel et.al.	2411.13544	translate	read	null
2024-11-21	Entropy Bootstrapping for Weakly Supervised Nuclei Detection	James Willoughby et.al.	2411.13528	translate	read	null
2024-11-20	BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation	Umamaheswaran Raman Kumar et.al.	2411.13251	translate	read	null
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	translate	read	link
2024-11-20	Automating Sonologists USG Commands with AI and Voice Interface	Emad Mohamed et.al.	2411.13006	translate	read	null
2024-11-19	Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline	Junlong Cheng et.al.	2411.12814	translate	read	link
2024-11-19	A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation	Jiaqi Yang et.al.	2411.12615	translate	read	link
2024-11-19	SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation	Ron Keuth et.al.	2411.12602	translate	read	link
2024-11-19	ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator	Xiao Jiang et.al.	2411.12250	translate	read	null
2024-11-18	ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements	M. Arda Aydın et.al.	2411.12044	translate	read	link
2024-11-18	Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation	Hanieh Shojaei Miandashti et.al.	2411.11935	translate	read	null
2024-11-18	MGNiceNet: Unified Monocular Geometric Scene Understanding	Markus Schön et.al.	2411.11466	translate	read	null
2024-11-18	MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models	Harshita Sharma et.al.	2411.11362	translate	read	null
2024-11-18	Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications	Scarlett Raine et.al.	2411.11287	translate	read	null
2024-11-18	Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development	Ranjan Sapkota et.al.	2411.11285	translate	read	null
2024-11-16	Attention-based U-Net Method for Autonomous Lane Detection	Mohammadhamed Tangestanizadeh et.al.	2411.10902	translate	read	null
2024-11-16	Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation	Jaisidh Singh et.al.	2411.10845	translate	read	null
2024-11-16	Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients	Maria Monzon et.al.	2411.10755	translate	read	null
2024-11-15	Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation	Markus Karmann et.al.	2411.10411	translate	read	null
2024-11-15	Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images	Ammar Qammaz et.al.	2411.10334	translate	read	null
2024-11-15	RETR: Multi-View Radar Detection Transformer for Indoor Perception	Ryoma Yataka et.al.	2411.10293	translate	read	null
2024-11-15	CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Dengke Zhang et.al.	2411.10086	translate	read	link
2024-11-14	OneNet: A Channel-Wise 1D Convolutional U-Net	Sanghyun Byun et.al.	2411.09838	translate	read	link
2024-11-14	Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks	Zengyi Yang et.al.	2411.09387	translate	read	null
2024-11-14	Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	Yuheng Shi et.al.	2411.09219	translate	read	link
2024-11-14	Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery	Ashim Dahal et.al.	2411.09101	translate	read	link
2024-11-13	CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2411.09023	translate	read	null
2024-11-14	Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation	Yangyang Li et.al.	2411.08756	translate	read	null
2024-11-13	Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model	Jun Xie et.al.	2411.08592	translate	read	null
2024-11-13	UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation	Chengyuan Zhang et.al.	2411.08569	translate	read	null
2024-11-13	Detection and classification of radio sources with deep learning	S. Riggi et.al.	2411.08519	translate	read	null
2024-11-12	Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry	Christopher Hahne et.al.	2411.07918	translate	read	link
2024-11-12	INTRABENCH: Interactive Radiological Benchmark	Constantin Ulrich et.al.	2411.07885	translate	read	null
2024-11-12	Horticultural Temporal Fruit Monitoring via 3D Instance Segmentation and Re-Identification using Point Clouds	Daniel Fusaro et.al.	2411.07799	translate	read	link
2024-11-12	Semantic segmentation on multi-resolution optical and microwave data using deep learning	Jai G Singla et.al.	2411.07581	translate	read	null
2024-11-12	GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting	Umangi Jain et.al.	2411.07555	translate	read	null
2024-11-11	Data-Centric Learning Framework for Real-Time Detection of Aiming Beam in Fluorescence Lifetime Imaging Guided Surgery	Mohamed Abul Hassan et.al.	2411.07395	translate	read	null
2024-11-11	SAMPart3D: Segment Any Part in 3D Objects	Yunhan Yang et.al.	2411.07184	translate	read	link
2024-11-11	SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation	Jiale Chen et.al.	2411.06991	translate	read	null
2024-11-11	Fast and Efficient Transformer-based Method for Bird’s Eye View Instance Prediction	Miguel Antunes-García et.al.	2411.06851	translate	read	link
2024-11-11	Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision	Yueyang Cang et.al.	2411.06727	translate	read	null
2024-11-10	Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments	Deegan Atha et.al.	2411.06632	translate	read	null
2024-11-09	Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing	Kaixuan Lu et.al.	2411.06091	translate	read	null
2024-11-08	Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model	Shuchang Lyu et.al.	2411.05878	translate	read	link
2024-11-08	Agricultural Landscape Understanding At Country-Scale	Radhika Dua et.al.	2411.05359	translate	read	null
2024-11-08	Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation	Sien Li et.al.	2411.05307	translate	read	link
2024-11-07	In the Era of Prompt Learning with Vision-Language Models	Ankit Jha et.al.	2411.04892	translate	read	null
2024-11-08	ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset	Olaf Wysocki et.al.	2411.04865	translate	read	link
2024-11-06	Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts	Zhitong Gao et.al.	2411.03829	translate	read	link
2024-11-06	SA3DIP: Segment Any 3D Instance with Potential 3D Priors	Xi Yang et.al.	2411.03819	translate	read	link
2024-11-06	Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model	Yansong Qu et.al.	2411.03672	translate	read	null
2024-11-05	Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation	Zhiling Yue et.al.	2411.03551	translate	read	null
2024-11-05	SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture	Andrew Heschl et.al.	2411.03505	translate	read	link
2024-11-05	Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need	Qishuai Wen et.al.	2411.03033	translate	read	link
2024-11-05	Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation	Xavier Timoneda et.al.	2411.02969	translate	read	null
2024-11-05	Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery	Mohammad Kakooei et.al.	2411.02935	translate	read	null
2024-11-05	CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation	Jinchao Ge et.al.	2411.02715	translate	read	null
2024-11-04	Deep Learning on 3D Semantic Segmentation: A Detailed Review	Thodoris Betsas et.al.	2411.02104	translate	read	null
2024-11-04	Tree level change detection over Ahmedabad city using very high resolution satellite images and Deep Learning	Jai G Singla et.al.	2411.02009	translate	read	null
2024-11-04	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925	translate	read	null
2024-11-04	DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability	Bo Gao et.al.	2411.01819	translate	read	null
2024-11-04	Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations	Thanh Nguyen Canh et.al.	2411.01816	translate	read	null
2024-11-05	MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation	Duc Dang Trung Tran et.al.	2411.01781	translate	read	null
2024-11-03	PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation	Xinyu Xu et.al.	2411.01624	translate	read	null
2024-11-01	Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions	Lixiao Yang et.al.	2411.01039	translate	read	null
2024-11-01	Event-guided Low-light Video Semantic Segmentation	Zhen Yao et.al.	2411.00639	translate	read	null
2024-11-01	Automated Classification of Cell Shapes: A Comparative Evaluation of Shape Descriptors	Valentina Vadori et.al.	2411.00561	translate	read	null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)