Semantic Segmentation - 2024-06

Publish Date Title Authors PDF Translate Read Code
2024-06-28 EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model Yuxuan Zhang et.al. 2406.20076 translate read link
2024-06-28 PM-VIS+: High-Performance Video Instance Segmentation without Video Annotation Zhangjing Yang et.al. 2406.19665 translate read link
2024-06-28 Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation Junsung Park et.al. 2406.19638 translate read link
2024-06-28 PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation Deyi Ji et.al. 2406.19632 translate read null
2024-06-27 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model Haobo Yuan et.al. 2406.19369 translate read null
2024-06-27 ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2406.19225 translate read null
2024-06-30 Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO Fuseini Mumuni et.al. 2406.19057 translate read null
2024-06-27 Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation Tao Lian et.al. 2406.18809 translate read null
2024-06-26 CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data Nikolaos Dionelis et.al. 2406.18279 translate read null
2024-06-26 CoDA: Interactive Segmentation and Morphological Analysis of Dendroid Structures Exemplified on Stony Cold-Water Corals Kira Schmitt et.al. 2406.18236 translate read link
2024-06-26 The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval Meinardus Boris et.al. 2406.18113 translate read link
2024-06-26 Few-Shot Medical Image Segmentation with High-Fidelity Prototypes Song Tang et.al. 2406.18074 translate read link
2024-06-25 Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation Bernardo Silva et.al. 2406.17915 translate read null
2024-06-25 Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation Xuming Zhang et.al. 2406.17679 translate read null
2024-06-25 DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation Ahmad Mohammadshirazi et.al. 2406.17591 translate read link
2024-06-25 Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Felix Stillger et.al. 2406.17541 translate read null
2024-06-25 Investigating Self-Supervised Methods for Label-Efficient Learning Srinivasa Rao Nandam et.al. 2406.17460 translate read null
2024-06-25 Pseudo Labelling for Enhanced Masked Autoencoders Srinivasa Rao Nandam et.al. 2406.17450 translate read null
2024-06-25 Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model Zhuoyuan Li et.al. 2406.17442 translate read null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 translate read null
2024-06-25 Depth-Guided Semi-Supervised Instance Segmentation Xin Chen et.al. 2406.17413 translate read null
2024-06-25 XAMI – A Benchmark Dataset for Artefact Detection in XMM-Newton Optical Images Elisabeta-Iulia Dima et.al. 2406.17323 translate read link
2024-06-24 GMT: Guided Mask Transformer for Leaf Instance Segmentation Feng Chen et.al. 2406.17109 translate read null
2024-06-24 Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation Yizheng Wu et.al. 2406.16776 translate read link
2024-06-24 μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation Pierangela Bruno et.al. 2406.16724 translate read null
2024-06-24 GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection Harnaik Dhami et.al. 2406.16625 translate read null
2024-06-24 LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images Xiaowen Ma et.al. 2406.16502 translate read link
2024-06-24 Cascade Reward Sampling for Efficient Decoding-Time Alignment Bolian Li et.al. 2406.16306 translate read link
2024-06-24 SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Neng Wang et.al. 2406.16279 translate read link
2024-06-23 UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery Pengfei Zhang et.al. 2406.16129 translate read null
2024-06-23 CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery Oluwatosin Alabi et.al. 2406.16039 translate read null
2024-06-22 Fine-grained Background Representation for Weakly Supervised Semantic Segmentation Xu Yin et.al. 2406.15755 translate read null
2024-06-21 TraceNet: Segment one thing efficiently Mingyuan Wu et.al. 2406.14874 translate read null
2024-06-19 3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data Siddiqui Muhammad Yasir et.al. 2406.14581 translate read null
2024-06-20 Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery Ilham Adi Panuntun et.al. 2406.14220 translate read null
2024-06-20 Trusting Semantic Segmentation Networks Samik Some et.al. 2406.14201 translate read null
2024-06-20 EvSegSNN: Neuromorphic Semantic Segmentation for Event Data Dalia Hareb et.al. 2406.14178 translate read null
2024-06-20 Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images Qinfeng Zhu et.al. 2406.14086 translate read link
2024-06-20 2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation Bin Cao et.al. 2406.13939 translate read null
2024-06-19 Search-based DNN Testing and Retraining with GAN-enhanced Simulations Mohammed Oualid Attaoui et.al. 2406.13359 translate read null
2024-06-19 Deep Learning-Based 3D Instance and Semantic Segmentation: A Review Siddiqui Muhammad Yasir et.al. 2406.13308 translate read null
2024-06-18 Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation Guoyu Yang et.al. 2406.12496 translate read link
2024-06-18 Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines Honglei Zhang et.al. 2406.12367 translate read null
2024-06-18 Agriculture-Vision Challenge 2024 – The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble Wang Liu et.al. 2406.12271 translate read null
2024-06-17 OoDIS: Anomaly Instance Segmentation Benchmark Alexey Nekrasov et.al. 2406.11835 translate read link
2024-06-17 Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT Maximilian E. Tschuchnig et.al. 2406.11650 translate read null
2024-06-17 Learning from Exemplars for Interactive Image Segmentation Kun Li et.al. 2406.11472 translate read null
2024-06-17 SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation Zhenchao Lin et.al. 2406.11441 translate read link
2024-06-17 Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Yunsong Wang et.al. 2406.11283 translate read null
2024-06-17 Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation Bingfeng Zhang et.al. 2406.11189 translate read null
2024-06-16 $α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion Sanbao Su et.al. 2406.11021 translate read null
2024-06-16 Benchmarking Label Noise in Instance Segmentation: Spatial Noise Matters Moshe Kimhi et.al. 2406.10891 translate read link
2024-06-16 PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery Libo Wang et.al. 2406.10828 translate read link
2024-06-15 GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR Bharat Singh et.al. 2406.10722 translate read null
2024-06-14 Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations Daan de Geus et.al. 2406.10114 translate read null
2024-06-14 ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers Narges Norouzi et.al. 2406.09936 translate read null
2024-06-14 Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions Aldi Piroli et.al. 2406.09906 translate read null
2024-06-14 Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation Brunó B. Englert et.al. 2406.09896 translate read link
2024-06-14 Open-Vocabulary Semantic Segmentation with Image Embedding Balancing Xiangheng Shan et.al. 2406.09829 translate read link
2024-06-14 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Roman Bachmann et.al. 2406.09406 translate read null
2024-06-13 Instance-level quantitative saliency in multiple sclerosis lesion segmentation Federico Spagnolo et.al. 2406.09335 translate read null
2024-06-13 APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation Weizhao He et.al. 2406.08372 translate read null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249 translate read link
2024-06-12 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation Zhensong Xu et.al. 2406.08192 translate read null
2024-06-13 A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder Lixian Zhang et.al. 2406.08079 translate read null
2024-06-12 OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding Yinan Deng et.al. 2406.08009 translate read link
2024-06-12 SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation Chanda Grover Kamra et.al. 2406.07986 translate read link
2024-06-12 Small Scale Data-Free Knowledge Distillation He Liu et.al. 2406.07876 translate read link
2024-06-11 Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph Sergey Linok et.al. 2406.07113 translate read null
2024-06-11 PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Yining Shi et.al. 2406.07037 translate read null
2024-06-11 RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks Zhechao Wang et.al. 2406.07032 translate read null
2024-06-12 LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection Jiahua Xu et.al. 2406.07023 translate read null
2024-06-11 Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples Kailas Dayanandan et.al. 2406.06967 translate read link
2024-06-11 UVIS: Unsupervised Video Instance Segmentation Shuaiyi Huang et.al. 2406.06908 translate read null
2024-06-10 Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation Dong Zhao et.al. 2406.06813 translate read null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512 translate read link
2024-06-10 UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving Daniel Bogdoll et.al. 2406.06370 translate read null
2024-06-10 Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset Shijie Lian et.al. 2406.06039 translate read link
2024-06-09 Scaling Graph Convolutions for Mobile Vision William Avery et.al. 2406.05850 translate read link
2024-06-09 Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation Jun Yu et.al. 2406.05837 translate read null
2024-06-09 Convolution and Attention-Free Mamba-based Cardiac Image Segmentation Abbas Khan et.al. 2406.05786 translate read null
2024-06-09 Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language Mark Hamilton et.al. 2406.05629 translate read link
2024-06-08 A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ Jianzhao Wang et.al. 2406.05513 translate read null
2024-06-08 Layered Image Vectorization via Semantic Simplification Zhenyu Wang et.al. 2406.05404 translate read null
2024-06-08 1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR’24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation Qingfeng Liu et.al. 2406.05352 translate read null
2024-06-07 Semantic Segmentation on VSPW Dataset through Masked Video Consistency Chen Liang et.al. 2406.04979 translate read null
2024-06-07 Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment Venkanna Babu Guthula et.al. 2406.04949 translate read null
2024-06-06 Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis Chengeng Liu et.al. 2406.04149 translate read null
2024-06-07 3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation Ruipu Wu et.al. 2406.04002 translate read null
2024-06-06 Frequency-based Matcher for Long-tailed Semantic Segmentation Shan Li et.al. 2406.03917 translate read link
2024-06-07 Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge Nan Zhang et.al. 2406.03799 translate read link
2024-06-06 Instance Segmentation and Teeth Classification in Panoramic X-rays Devichand Budagam et.al. 2406.03747 translate read link
2024-06-06 DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation Zilu Guo et.al. 2406.03702 translate read link
2024-06-05 Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation Maximilian Zenk et.al. 2406.03323 translate read null
2024-06-05 Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy Yunho Kim et.al. 2406.02989 translate read null
2024-06-04 W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics Andre Schreiber et.al. 2406.02822 translate read link
2024-06-04 Window to Wall Ratio Detection using SegFormer Zoe De Simone et.al. 2406.02706 translate read link
2024-06-04 Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation Mohamed El Amine Boudjoghra et.al. 2406.02548 translate read link
2024-06-04 Generative Active Learning for Long-tailed Instance Segmentation Muzhi Zhu et.al. 2406.02435 translate read link
2024-06-04 Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning Heather Doig et.al. 2406.01932 translate read null
2024-06-03 MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild Zeren Jiang et.al. 2406.01595 translate read null
2024-06-03 Towards Flexible Interactive Reflection Removal with Human Guidance Xiao Chen et.al. 2406.01555 translate read link
2024-06-03 EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Thanh-Dat Truong et.al. 2406.01429 translate read null
2024-06-03 An expert-driven data generation pipeline for histological images Roberto Basla et.al. 2406.01403 translate read link
2024-06-03 TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation Antonio Santo et.al. 2406.01395 translate read link
2024-06-03 MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images Ke-Lei Wang et.al. 2406.01356 translate read null
2024-06-03 ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds Ka Lung Cheung et.al. 2406.01337 translate read link

(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)