Semantic Segmentation - 2024-05

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-05-31	Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks	Linlin Yu et.al.	2405.20986	translate	read	null
2024-05-31	Extreme Point Supervised Instance Segmentation	Hyeonjun Lee et.al.	2405.20729	translate	read	null
2024-05-31	Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation	Wooseok Shin et.al.	2405.20610	translate	read	link
2024-05-30	P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation	Qi Zhang et.al.	2405.20443	translate	read	null
2024-05-30	SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow	Chaoyang Wang et.al.	2405.20282	translate	read	link
2024-05-30	MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion	Angel Villar-Corrales et.al.	2405.19921	translate	read	link
2024-05-30	Open-Set Domain Adaptation for Semantic Segmentation	Seun-An Choe et.al.	2405.19899	translate	read	link
2024-05-30	DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation	Ron Keuth et.al.	2405.19746	translate	read	link
2024-05-30	Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes	Yong-Qiang Mao et.al.	2405.19735	translate	read	null
2024-05-30	CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation	Ankush Gajanan Arudkar et.al.	2405.19672	translate	read	null
2024-05-29	Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation	Lianlei Shan et.al.	2405.19568	translate	read	null
2024-05-29	Enabling Visual Recognition at Radio Frequency	Haowen Lai et.al.	2405.19516	translate	read	null
2024-05-29	Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models	Tianrun Chen et.al.	2405.19326	translate	read	null
2024-05-29	A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation	Niclas Vödisch et.al.	2405.19035	translate	read	link
2024-05-29	Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	Zelin Peng et.al.	2405.18840	translate	read	null
2024-05-29	FocSAM: Delving Deeply into Focused Objects in Segmenting Anything	You Huang et.al.	2405.18706	translate	read	null
2024-05-28	Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation	JuneHyoung Kwon et.al.	2405.18148	translate	read	null
2024-05-28	Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images	Lianlei Shan et.al.	2405.18078	translate	read	null
2024-05-28	RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields	Mihnea-Bogdan Jurca et.al.	2405.18033	translate	read	null
2024-05-28	DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture	Shentong Mo et.al.	2405.17995	translate	read	link
2024-05-28	Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation	Yangxiao Lu et.al.	2405.17859	translate	read	link
2024-05-28	The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention	Xingyu Ding et.al.	2405.17776	translate	read	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	translate	read	null
2024-05-27	DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking	Hongtao Wang et.al.	2405.16980	translate	read	null
2024-05-27	Collective Perception Datasets for Autonomous Driving: A Comprehensive Review	Sven Teufel et.al.	2405.16973	translate	read	null
2024-05-27	Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models	Qian Wang et.al.	2405.16947	translate	read	null
2024-05-27	A re-calibration method for object detection with multi-modal alignment bias in autonomous driving	Zhihang Song et.al.	2405.16848	translate	read	null
2024-05-26	Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning	Neha Kalibhat et.al.	2405.16401	translate	read	null
2024-05-25	Video Prediction Models as General Visual Encoders	James Maier et.al.	2405.16382	translate	read	null
2024-05-25	BOLD: Boolean Logic Deep Learning	Van Minh Nguyen et.al.	2405.16339	translate	read	null
2024-05-25	Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation	Huizhou Chen et.al.	2405.16099	translate	read	null
2024-05-25	Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality	Hakim Ikebayashi et.al.	2405.16008	translate	read	null
2024-05-24	Visualize and Paint GAN Activations	Rudolf Herdt et.al.	2405.15636	translate	read	null
2024-05-24	Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets	Hoàng-Ân Lê et.al.	2405.15394	translate	read	null
2024-05-24	Autonomous Quilt Spreading for Caregiving Robots	Yuchun Guo et.al.	2405.15373	translate	read	null
2024-05-24	U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation	Bingyu Li et.al.	2405.15365	translate	read	link
2024-05-24	Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation	Jiayi Chen et.al.	2405.15265	translate	read	null
2024-05-23	Mamba-R: Vision Mamba ALSO Needs Registers	Feng Wang et.al.	2405.14858	translate	read	null
2024-05-23	Efficient Robot Learning for Perception and Mapping	Niclas Vödisch et.al.	2405.14688	translate	read	null
2024-05-23	Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation	Daniel Kienzle et.al.	2405.14467	translate	read	null
2024-05-23	MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models	Jiuming Liu et.al.	2405.14338	translate	read	null
2024-05-23	Tuning-free Universally-Supervised Semantic Segmentation	Xiaobo Yang et.al.	2405.14294	translate	read	null
2024-05-23	SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation	Kai Yao et.al.	2405.14278	translate	read	null
2024-05-23	Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations	Mohammed Baharoon et.al.	2405.14239	translate	read	null
2024-05-23	Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification	Taylor Archibald et.al.	2405.14162	translate	read	null
2024-05-23	Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips	Yaotian Liu et.al.	2405.14154	translate	read	null
2024-05-22	TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System	Diogo Lavado et.al.	2405.13989	translate	read	null
2024-05-21	Transparency Distortion Robustness for SOTA Image Segmentation Tasks	Volker Knauthe et.al.	2405.12864	translate	read	null
2024-05-20	A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation	Sushmita Sarker et.al.	2405.11903	translate	read	null
2024-05-20	Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments	Jooyong Park et.al.	2405.11855	translate	read	null
2024-05-20	Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate Model	Mounes Zaval et.al.	2405.11837	translate	read	null
2024-05-20	Universal Organizer of SAM for Unsupervised Semantic Segmentation	Tingting Li et.al.	2405.11742	translate	read	null
2024-05-19	Interpreting a Semantic Segmentation Model for Coastline Detection	Conor O’Sullivan et.al.	2405.11500	translate	read	null
2024-05-19	Unifying 3D Vision-Language Understanding via Promptable Queries	Ziyu Zhu et.al.	2405.11442	translate	read	null
2024-05-18	PS6D: Point Cloud Based Symmetry-Aware 6D Object Pose Estimation in Robot Bin-Picking	Yifan Yang et.al.	2405.11257	translate	read	null
2024-05-17	CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation	Mushui Liu et.al.	2405.10530	translate	read	link
2024-05-16	4D Panoptic Scene Graph Generation	Jingkang Yang et.al.	2405.10305	translate	read	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	translate	read	link
2024-05-16	DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data	Chengxiang Fan et.al.	2405.10185	translate	read	link
2024-05-16	An Integrated Framework for Multi-Granular Explanation of Video Summarization	Konstantinos Tsigos et.al.	2405.10082	translate	read	null
2024-05-16	A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance	Andrea Matteazzi et.al.	2405.10046	translate	read	null
2024-05-16	Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation	Jihwan Kwak et.al.	2405.09858	translate	read	null
2024-05-15	Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation	Guo Yachan et.al.	2405.09682	translate	read	null
2024-05-14	CLIP with Quality Captions: A Strong Pretraining for Vision Tasks	Pavan Kumar Anasosalu Vasu et.al.	2405.08911	translate	read	null
2024-05-14	Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study	Qinfeng Zhu et.al.	2405.08493	translate	read	null
2024-05-14	TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road Detection	Martín Bayón-Gutiérrez et.al.	2405.08429	translate	read	link
2024-05-13	IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data	Ziyang Zhang et.al.	2405.07916	translate	read	null
2024-05-13	PLUTO: Pathology-Universal Transformer	Dinkar Juyal et.al.	2405.07905	translate	read	null
2024-05-12	PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification	Mohammad Shafiul Alam et.al.	2405.07332	translate	read	link
2024-05-12	Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception	Haoming Chen et.al.	2405.07201	translate	read	null
2024-05-11	Global Motion Understanding in Large-Scale Video Object Segmentation	Volodymyr Fedynyak et.al.	2405.07031	translate	read	null
2024-05-10	GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs	Mustafa Munir et.al.	2405.06849	translate	read	link
2024-05-10	Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach	Elham Ravanbakhsh et.al.	2405.06586	translate	read	null
2024-05-10	Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation	Xiaowen Ma et.al.	2405.06525	translate	read	link
2024-05-10	Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data	Yonghao Xu et.al.	2405.06502	translate	read	null
2024-05-10	Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data	Rongyu Zhang et.al.	2405.06413	translate	read	null
2024-05-10	Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation	Zhenliang Ni et.al.	2405.06228	translate	read	link
2024-05-10	Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection	Koji Takeda et.al.	2405.06185	translate	read	null
2024-05-10	Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging	Zhuchen Shao et.al.	2405.06175	translate	read	null
2024-05-09	Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation	Yudian Zhang et.al.	2405.05830	translate	read	null
2024-05-09	CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks	Nick et.al.	2405.05755	translate	read	null
2024-05-08	OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies	Lingdong Kong et.al.	2405.05259	translate	read	link
2024-05-08	Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving	Lingdong Kong et.al.	2405.05258	translate	read	link
2024-05-08	Weakly-supervised Semantic Segmentation via Dual-stream Contrastive Learning of Cross-image Contextual Information	Qi Lai et.al.	2405.04913	translate	read	null
2024-05-08	DeepDamageNet: A two-step deep-learning model for multi-disaster building damage segmentation and classification using satellite imagery	Irene Alisjahbana et.al.	2405.04800	translate	read	null
2024-05-07	A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images	László Kopácsi et.al.	2405.04650	translate	read	null
2024-05-07	FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes	Charles Gaydon et.al.	2405.04634	translate	read	link
2024-05-07	AugmenTory: A Fast and Flexible Polygon Augmentation Library	Tanaz Ghahremani et.al.	2405.04442	translate	read	null
2024-05-07	A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields	Raiyan Rahman et.al.	2405.04305	translate	read	null
2024-05-07	ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation	Zhibo Zhang et.al.	2405.04121	translate	read	null
2024-05-07	Structured Click Control in Transformer-based Interactive Segmentation	Long Xu et.al.	2405.04009	translate	read	link
2024-05-06	PTQ4SAM: Post-Training Quantization for Segment Anything	Chengtao Lv et.al.	2405.03144	translate	read	link
2024-05-04	MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning	Vishal Nedungadi et.al.	2405.02771	translate	read	null
2024-05-04	Few-Shot Fruit Segmentation via Transfer Learning	Jordan A. James et.al.	2405.02556	translate	read	null
2024-05-03	Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic Segmentation	Gabriel Fischer Abati et.al.	2405.02177	translate	read	null
2024-05-03	Towards general deep-learning-based tree instance segmentation models	Jonathan Henrich et.al.	2405.02061	translate	read	null
2024-05-03	DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model	Peijin Jia et.al.	2405.02008	translate	read	null
2024-05-02	Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey	Guoping Xu et.al.	2405.01725	translate	read	link
2024-05-02	Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey	Rokas Gipiškis et.al.	2405.01636	translate	read	null
2024-05-02	CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation	Chenying Liu et.al.	2405.01217	translate	read	null
2024-05-02	Uncertainty-aware self-training with expectation maximization basis transformation	Zijia Wang et.al.	2405.01175	translate	read	null
2024-05-01	GraCo: Granularity-Controllable Interactive Segmentation	Yian Zhao et.al.	2405.00587	translate	read	null
2024-05-01	Exploring Self-Supervised Vision Transformers for Deepfake Detection: A Comparative Analysis	Huy H. Nguyen et.al.	2405.00355	translate	read	null

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)