Semantic Segmentation - 2024-08

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-08-30	Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes	Li Zhang et.al.	2408.17421	translate	read	link
2024-08-30	Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations	Ahmed Hammam et.al.	2408.17311	translate	read	null
2024-08-30	Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training	Zizheng Huang et.al.	2408.17081	translate	read	link
2024-08-30	Transient Fault Tolerant Semantic Segmentation for Autonomous Driving	Leonardo Iurada et.al.	2408.16952	translate	read	link
2024-08-29	Eigen-Cluster VIS: Improving Weakly-supervised Video Instance Segmentation by Leveraging Spatio-temporal Consistency	Farnoosh Arefi et.al.	2408.16661	translate	read	link
2024-08-29	SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection	Rohit Venkata Sai Dulam et.al.	2408.16645	translate	read	null
2024-08-29	A Simple and Generalist Approach for Panoptic Segmentation	Nedyalko Prisadnikov et.al.	2408.16504	translate	read	null
2024-08-29	MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation	Linyan Yang et.al.	2408.16478	translate	read	null
2024-08-29	Multi-source Domain Adaptation for Panoramic Semantic Segmentation	Jing Jiang et.al.	2408.16469	translate	read	null
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	translate	read	null
2024-08-28	InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation	Thibaut Goldsborough et.al.	2408.15954	translate	read	link
2024-08-28	SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors	Zhiqing Zhang et.al.	2408.15887	translate	read	null
2024-08-28	DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries	Yu Yang et.al.	2408.15813	translate	read	null
2024-08-28	TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation	Junbao Zhou et.al.	2408.15657	translate	read	link
2024-08-27	Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images	Silvia Seidlitz et.al.	2408.15373	translate	read	link
2024-08-27	An Investigation on The Position Encoding in Vision-Based Dynamics Prediction	Jiageng Zhu et.al.	2408.15201	translate	read	null
2024-08-27	Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation	Elona Shatri et.al.	2408.15002	translate	read	null
2024-08-27	Applying ViT in Generalized Few-shot Semantic Segmentation	Liyuan Geng et.al.	2408.14957	translate	read	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	translate	read	null
2024-08-27	MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation	Yuanbing Zhu et.al.	2408.14776	translate	read	null
2024-08-26	Physically Feasible Semantic Segmentation	Shamik Basu et.al.	2408.14672	translate	read	link
2024-08-26	A Survey of Camouflaged Object Detection and Beyond	Fengyang Xiao et.al.	2408.14562	translate	read	null
2024-08-26	Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping	Vishal Batchu et.al.	2408.14400	translate	read	null
2024-08-25	OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation	Muhammad Rameez ur Rahman et.al.	2408.13936	translate	read	link
2024-08-25	Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation	Yuwen Pan et.al.	2408.13838	translate	read	null
2024-08-25	TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather	Xiongwei Zhao et.al.	2408.13802	translate	read	link
2024-08-25	ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation	Xin Zhang et.al.	2408.13771	translate	read	null
2024-08-25	Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation	Zhaoyang Li et.al.	2408.13752	translate	read	null
2024-08-24	ESA: Annotation-Efficient Active Learning for Semantic Segmentation	Jinchao Ge et.al.	2408.13491	translate	read	link
2024-08-23	Accuracy Improvement of Cell Image Segmentation Using Feedback Former	Hinako Mitsuoka et.al.	2408.12974	translate	read	null
2024-08-23	Image Segmentation in Foundation Model Era: A Survey	Tianfei Zhou et.al.	2408.12957	translate	read	null
2024-08-23	Symmetric masking strategy enhances the performance of Masked Image Modeling	Khanh-Binh Nguyen et.al.	2408.12772	translate	read	null
2024-08-22	Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets	Wolfgang Boettcher et.al.	2408.12489	translate	read	null
2024-08-22	The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation	Tuyen Tran et.al.	2408.12447	translate	read	null
2024-08-22	ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving Scenes	Zhenyi Liu et.al.	2408.12048	translate	read	link
2024-08-21	EmbodiedSAM: Online Segment Any 3D Thing in Real Time	Xiuwei Xu et.al.	2408.11811	translate	read	null
2024-08-21	NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation	Zhenye Lou et.al.	2408.11787	translate	read	link
2024-08-21	Open-Ended 3D Point Cloud Instance Segmentation	Phuc D. A. Nguyen et.al.	2408.11747	translate	read	null
2024-08-21	UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images	Enze Zhu et.al.	2408.11545	translate	read	null
2024-08-22	SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything	Chongkai Yu et.al.	2408.11535	translate	read	null
2024-08-21	Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation	Chuandong Liu et.al.	2408.11280	translate	read	null
2024-08-20	An Interpretable Deep Learning Approach for Morphological Script Type Analysis	Malamatenia Vlachou-Efstathiou et.al.	2408.11150	translate	read	null
2024-08-20	NeCo: Improving DINOv2’s spatial representations in 19 GPU hours with Patch Neighbor Consistency	Valentinos Pariza et.al.	2408.11054	translate	read	null
2024-08-20	CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients	Karen Sanchez et.al.	2408.10827	translate	read	null
2024-08-20	Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant	Guofeng Mei et.al.	2408.10652	translate	read	null
2024-08-20	Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?	Chen Liang et.al.	2408.10627	translate	read	null
2024-08-20	Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation	Jiawei Han et.al.	2408.10537	translate	read	link
2024-08-21	LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS	Xinyu Liu et.al.	2408.10469	translate	read	null
2024-08-19	Leveraging Superfluous Information in Contrastive Representation Learning	Xuechu Yu et.al.	2408.10292	translate	read	null
2024-08-19	Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network	Rasha Alshawi et.al.	2408.10181	translate	read	null
2024-08-19	Dynamic Label Injection for Imbalanced Industrial Defect Segmentation	Emanuele Caruso et.al.	2408.10031	translate	read	link
2024-08-19	Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis	Kira Maag et.al.	2408.10021	translate	read	null
2024-08-19	DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery	Corentin Dumery et.al.	2408.09928	translate	read	null
2024-08-19	3D-Aware Instance Segmentation and Tracking in Egocentric Videos	Yash Bhalgat et.al.	2408.09860	translate	read	null
2024-08-19	Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving	Jun Yan et.al.	2408.09839	translate	read	link
2024-08-18	OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Muhammad Rameez Ur Rahman et.al.	2408.09424	translate	read	link
2024-08-18	VrdONE: One-stage Video Visual Relation Detection	Xinjie Jiang et.al.	2408.09408	translate	read	link
2024-08-18	Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration	Hao Ai et.al.	2408.09336	translate	read	null
2024-08-17	Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology	Junchao Zhu et.al.	2408.09278	translate	read	link
2024-08-16	Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation	Tri Ton et.al.	2408.08591	translate	read	null
2024-08-16	Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation	Linghao Zheng et.al.	2408.08576	translate	read	null
2024-08-16	Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs	Jinming Liu et.al.	2408.08575	translate	read	null
2024-08-15	5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks	Dongshuo Yin et.al.	2408.08345	translate	read	link
2024-08-14	MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis	Nimeesha Chan et.al.	2408.07773	translate	read	link
2024-08-15	MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation	Beoungwoo Kang et.al.	2408.07576	translate	read	link
2024-08-15	MagicFace: Training-free Universal-Style Human Image Customized Synthesis	Yibin Wang et.al.	2408.07433	translate	read	null
2024-08-14	Segment Using Just One Example	Pratik Vora et.al.	2408.07393	translate	read	null
2024-08-14	Ensemble architecture in polyp segmentation	Hao-Yun Hsu et.al.	2408.07262	translate	read	link
2024-08-14	Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks	Raghavendra Singh et.al.	2408.07243	translate	read	null
2024-08-14	Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training	Ethan Kou et.al.	2408.07239	translate	read	null
2024-08-13	ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation	Jingyun Wang et.al.	2408.06747	translate	read	link
2024-08-10	Dilated Convolution with Learnable Spacings	Ismail Khalfaoui-Hassani et.al.	2408.06383	translate	read	null
2024-08-12	Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images	Siladittya Manna et.al.	2408.06235	translate	read	null
2024-08-12	A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting	Felix Assion et.al.	2408.06071	translate	read	null
2024-08-13	ClickAttention: Click Region Similarity Guided Interactive Segmentation	Long Xu et.al.	2408.06021	translate	read	null
2024-08-12	Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning	Xinrong Hu et.al.	2408.05889	translate	read	null
2024-08-11	Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task	Hannuo Zhang et.al.	2408.05777	translate	read	null
2024-08-11	MacFormer: Semantic Segmentation with Fine Object Boundaries	Guoan Xu et.al.	2408.05699	translate	read	null
2024-08-13	Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment	Abdul-Razak Alhassan Gamani et.al.	2408.05661	translate	read	null
2024-08-10	Multimodal generative semantic communication based on latent diffusion model	Weiqi Fu et.al.	2408.05455	translate	read	null
2024-08-09	PRISM Lite: A lightweight model for interactive 3D placenta segmentation in ultrasound	Hao Li et.al.	2408.05372	translate	read	link
2024-08-09	In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	Dahyun Kang et.al.	2408.04961	translate	read	link
2024-08-09	ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	Mengcheng Lan et.al.	2408.04883	translate	read	link
2024-08-09	Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning	Fumihiro Kaneko et.al.	2408.04795	translate	read	null
2024-08-08	Embodied Uncertainty-Aware Object Segmentation	Xiaolin Fang et.al.	2408.04760	translate	read	null
2024-08-08	SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation	Jieming Yu et.al.	2408.04593	translate	read	null
2024-08-08	Robust Approximate Characterization of Single-Cell Heterogeneity in Microbial Growth	Richard D. Paul et.al.	2408.04501	translate	read	link
2024-08-08	SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios	Sriram Mandalika et.al.	2408.04482	translate	read	null
2024-08-08	What could go wrong? Discovering and describing failure modes in computer vision	Gabriela Csurka et.al.	2408.04471	translate	read	null
2024-08-07	Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation	Yiqing Shen et.al.	2408.04098	translate	read	null
2024-08-07	CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications	Tianfang Zhang et.al.	2408.03703	translate	read	link
2024-08-07	SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology	Mingya Zhang et.al.	2408.03651	translate	read	link
2024-08-06	Post-Mortem Human Iris Segmentation Analysis with Deep Learning	Afzal Hossain et.al.	2408.03448	translate	read	null
2024-08-06	Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Jonas Schmitt et.al.	2408.03046	translate	read	link
2024-08-06	Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment	Shijie Lian et.al.	2408.02924	translate	read	link
2024-08-05	Scribble-Based Interactive Segmentation of Medical Hyperspectral Images	Zhonghao Wang et.al.	2408.02708	translate	read	null
2024-08-05	Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation	Sai Prasanna et.al.	2408.02297	translate	read	null
2024-08-05	Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs	Jeongkee Lim et.al.	2408.02261	translate	read	null
2024-08-05	Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders	Muhammad Abdullah Jamal et.al.	2408.02245	translate	read	null
2024-08-04	Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation	Ye Du et.al.	2408.02039	translate	read	null
2024-08-03	NuLite – Lightweight and Fast Model for Nuclei Instance Segmentation and Classification	Cristian Tommasino et.al.	2408.01797	translate	read	null
2024-08-03	Bayesian Active Learning for Semantic Segmentation	Sima Didari et.al.	2408.01694	translate	read	null
2024-08-03	A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection	Omkar Oak et.al.	2408.01692	translate	read	null
2024-08-03	Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation	Balázs Opra et.al.	2408.01640	translate	read	null
2024-08-02	Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans	Lukas Kratochvila et.al.	2408.01526	translate	read	null
2024-08-02	Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation	Yuanzhi Su et.al.	2408.01356	translate	read	null
2024-08-02	StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation	Bingyu Li et.al.	2408.01343	translate	read	null
2024-08-02	Amodal Segmentation for Laparoscopic Surgery Video Instruments	Ruohua Shi et.al.	2408.01067	translate	read	null
2024-08-02	Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach	Yabin Zhu et.al.	2408.00969	translate	read	null
2024-08-01	Medical SAM 2: Segment medical images as video via Segment Anything Model 2	Jiayuan Zhu et.al.	2408.00874	translate	read	link
2024-08-01	Leaf Angle Estimation using Mask R-CNN and LETR Vision Transformer	Venkat Margapuri et.al.	2408.00749	translate	read	null
2024-08-01	Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	Siyu Jiao et.al.	2408.00744	translate	read	link
2024-08-01	Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function	Matias Oscar Volman Stern et.al.	2408.00707	translate	read	null
2024-08-01	AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation	Asbjørn Munk et.al.	2408.00640	translate	read	null
2024-08-01	SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation	Shengbo Tan et.al.	2408.00496	translate	read	null
2024-08-01	A Simple Background Augmentation Method for Object Detection with Diffusion Model	Yuhang Li et.al.	2408.00350	translate	read	null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)