Semantic Segmentation - 2024-06
Semantic Segmentation - 2024-06
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-06-28 | EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model | Yuxuan Zhang et.al. | 2406.20076 | translate | read | link |
| 2024-06-28 | PM-VIS+: High-Performance Video Instance Segmentation without Video Annotation | Zhangjing Yang et.al. | 2406.19665 | translate | read | link |
| 2024-06-28 | Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation | Junsung Park et.al. | 2406.19638 | translate | read | link |
| 2024-06-28 | PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation | Deyi Ji et.al. | 2406.19632 | translate | read | null |
| 2024-06-27 | Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model | Haobo Yuan et.al. | 2406.19369 | translate | read | null |
| 2024-06-27 | ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation | Nazanin Moradinasab et.al. | 2406.19225 | translate | read | null |
| 2024-06-30 | Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO | Fuseini Mumuni et.al. | 2406.19057 | translate | read | null |
| 2024-06-27 | Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation | Tao Lian et.al. | 2406.18809 | translate | read | null |
| 2024-06-26 | CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data | Nikolaos Dionelis et.al. | 2406.18279 | translate | read | null |
| 2024-06-26 | CoDA: Interactive Segmentation and Morphological Analysis of Dendroid Structures Exemplified on Stony Cold-Water Corals | Kira Schmitt et.al. | 2406.18236 | translate | read | link |
| 2024-06-26 | The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval | Meinardus Boris et.al. | 2406.18113 | translate | read | link |
| 2024-06-26 | Few-Shot Medical Image Segmentation with High-Fidelity Prototypes | Song Tang et.al. | 2406.18074 | translate | read | link |
| 2024-06-25 | Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation | Bernardo Silva et.al. | 2406.17915 | translate | read | null |
| 2024-06-25 | Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation | Xuming Zhang et.al. | 2406.17679 | translate | read | null |
| 2024-06-25 | DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation | Ahmad Mohammadshirazi et.al. | 2406.17591 | translate | read | link |
| 2024-06-25 | Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation | Felix Stillger et.al. | 2406.17541 | translate | read | null |
| 2024-06-25 | Investigating Self-Supervised Methods for Label-Efficient Learning | Srinivasa Rao Nandam et.al. | 2406.17460 | translate | read | null |
| 2024-06-25 | Pseudo Labelling for Enhanced Masked Autoencoders | Srinivasa Rao Nandam et.al. | 2406.17450 | translate | read | null |
| 2024-06-25 | Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model | Zhuoyuan Li et.al. | 2406.17442 | translate | read | null |
| 2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | translate | read | null |
| 2024-06-25 | Depth-Guided Semi-Supervised Instance Segmentation | Xin Chen et.al. | 2406.17413 | translate | read | null |
| 2024-06-25 | XAMI – A Benchmark Dataset for Artefact Detection in XMM-Newton Optical Images | Elisabeta-Iulia Dima et.al. | 2406.17323 | translate | read | link |
| 2024-06-24 | GMT: Guided Mask Transformer for Leaf Instance Segmentation | Feng Chen et.al. | 2406.17109 | translate | read | null |
| 2024-06-24 | Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation | Yizheng Wu et.al. | 2406.16776 | translate | read | link |
| 2024-06-24 | μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation | Pierangela Bruno et.al. | 2406.16724 | translate | read | null |
| 2024-06-24 | GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection | Harnaik Dhami et.al. | 2406.16625 | translate | read | null |
| 2024-06-24 | LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images | Xiaowen Ma et.al. | 2406.16502 | translate | read | link |
| 2024-06-24 | Cascade Reward Sampling for Efficient Decoding-Time Alignment | Bolian Li et.al. | 2406.16306 | translate | read | link |
| 2024-06-24 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments | Neng Wang et.al. | 2406.16279 | translate | read | link |
| 2024-06-23 | UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery | Pengfei Zhang et.al. | 2406.16129 | translate | read | null |
| 2024-06-23 | CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery | Oluwatosin Alabi et.al. | 2406.16039 | translate | read | null |
| 2024-06-22 | Fine-grained Background Representation for Weakly Supervised Semantic Segmentation | Xu Yin et.al. | 2406.15755 | translate | read | null |
| 2024-06-21 | TraceNet: Segment one thing efficiently | Mingyuan Wu et.al. | 2406.14874 | translate | read | null |
| 2024-06-19 | 3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data | Siddiqui Muhammad Yasir et.al. | 2406.14581 | translate | read | null |
| 2024-06-20 | Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery | Ilham Adi Panuntun et.al. | 2406.14220 | translate | read | null |
| 2024-06-20 | Trusting Semantic Segmentation Networks | Samik Some et.al. | 2406.14201 | translate | read | null |
| 2024-06-20 | EvSegSNN: Neuromorphic Semantic Segmentation for Event Data | Dalia Hareb et.al. | 2406.14178 | translate | read | null |
| 2024-06-20 | Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images | Qinfeng Zhu et.al. | 2406.14086 | translate | read | link |
| 2024-06-20 | 2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Bin Cao et.al. | 2406.13939 | translate | read | null |
| 2024-06-19 | Search-based DNN Testing and Retraining with GAN-enhanced Simulations | Mohammed Oualid Attaoui et.al. | 2406.13359 | translate | read | null |
| 2024-06-19 | Deep Learning-Based 3D Instance and Semantic Segmentation: A Review | Siddiqui Muhammad Yasir et.al. | 2406.13308 | translate | read | null |
| 2024-06-18 | Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation | Guoyu Yang et.al. | 2406.12496 | translate | read | link |
| 2024-06-18 | Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines | Honglei Zhang et.al. | 2406.12367 | translate | read | null |
| 2024-06-18 | Agriculture-Vision Challenge 2024 – The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble | Wang Liu et.al. | 2406.12271 | translate | read | null |
| 2024-06-17 | OoDIS: Anomaly Instance Segmentation Benchmark | Alexey Nekrasov et.al. | 2406.11835 | translate | read | link |
| 2024-06-17 | Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT | Maximilian E. Tschuchnig et.al. | 2406.11650 | translate | read | null |
| 2024-06-17 | Learning from Exemplars for Interactive Image Segmentation | Kun Li et.al. | 2406.11472 | translate | read | null |
| 2024-06-17 | SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation | Zhenchao Lin et.al. | 2406.11441 | translate | read | link |
| 2024-06-17 | Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding | Yunsong Wang et.al. | 2406.11283 | translate | read | null |
| 2024-06-17 | Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation | Bingfeng Zhang et.al. | 2406.11189 | translate | read | null |
| 2024-06-16 | $α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion | Sanbao Su et.al. | 2406.11021 | translate | read | null |
| 2024-06-16 | Benchmarking Label Noise in Instance Segmentation: Spatial Noise Matters | Moshe Kimhi et.al. | 2406.10891 | translate | read | link |
| 2024-06-16 | PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery | Libo Wang et.al. | 2406.10828 | translate | read | link |
| 2024-06-15 | GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR | Bharat Singh et.al. | 2406.10722 | translate | read | null |
| 2024-06-14 | Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations | Daan de Geus et.al. | 2406.10114 | translate | read | null |
| 2024-06-14 | ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers | Narges Norouzi et.al. | 2406.09936 | translate | read | null |
| 2024-06-14 | Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions | Aldi Piroli et.al. | 2406.09906 | translate | read | null |
| 2024-06-14 | Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation | Brunó B. Englert et.al. | 2406.09896 | translate | read | link |
| 2024-06-14 | Open-Vocabulary Semantic Segmentation with Image Embedding Balancing | Xiangheng Shan et.al. | 2406.09829 | translate | read | link |
| 2024-06-14 | 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities | Roman Bachmann et.al. | 2406.09406 | translate | read | null |
| 2024-06-13 | Instance-level quantitative saliency in multiple sclerosis lesion segmentation | Federico Spagnolo et.al. | 2406.09335 | translate | read | null |
| 2024-06-13 | APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation | Weizhao He et.al. | 2406.08372 | translate | read | null |
| 2024-06-12 | Dataset Enhancement with Instance-Level Augmentations | Orest Kupyn et.al. | 2406.08249 | translate | read | link |
| 2024-06-12 | 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Zhensong Xu et.al. | 2406.08192 | translate | read | null |
| 2024-06-13 | A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder | Lixian Zhang et.al. | 2406.08079 | translate | read | null |
| 2024-06-12 | OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Yinan Deng et.al. | 2406.08009 | translate | read | link |
| 2024-06-12 | SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation | Chanda Grover Kamra et.al. | 2406.07986 | translate | read | link |
| 2024-06-12 | Small Scale Data-Free Knowledge Distillation | He Liu et.al. | 2406.07876 | translate | read | link |
| 2024-06-11 | Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph | Sergey Linok et.al. | 2406.07113 | translate | read | null |
| 2024-06-11 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Yining Shi et.al. | 2406.07037 | translate | read | null |
| 2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032 | translate | read | null |
| 2024-06-12 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023 | translate | read | null |
| 2024-06-11 | Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples | Kailas Dayanandan et.al. | 2406.06967 | translate | read | link |
| 2024-06-11 | UVIS: Unsupervised Video Instance Segmentation | Shuaiyi Huang et.al. | 2406.06908 | translate | read | null |
| 2024-06-10 | Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation | Dong Zhao et.al. | 2406.06813 | translate | read | null |
| 2024-06-10 | Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Louis Blankemeier et.al. | 2406.06512 | translate | read | link |
| 2024-06-10 | UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving | Daniel Bogdoll et.al. | 2406.06370 | translate | read | null |
| 2024-06-10 | Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Shijie Lian et.al. | 2406.06039 | translate | read | link |
| 2024-06-09 | Scaling Graph Convolutions for Mobile Vision | William Avery et.al. | 2406.05850 | translate | read | link |
| 2024-06-09 | Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation | Jun Yu et.al. | 2406.05837 | translate | read | null |
| 2024-06-09 | Convolution and Attention-Free Mamba-based Cardiac Image Segmentation | Abbas Khan et.al. | 2406.05786 | translate | read | null |
| 2024-06-09 | Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language | Mark Hamilton et.al. | 2406.05629 | translate | read | link |
| 2024-06-08 | A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ | Jianzhao Wang et.al. | 2406.05513 | translate | read | null |
| 2024-06-08 | Layered Image Vectorization via Semantic Simplification | Zhenyu Wang et.al. | 2406.05404 | translate | read | null |
| 2024-06-08 | 1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR’24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation | Qingfeng Liu et.al. | 2406.05352 | translate | read | null |
| 2024-06-07 | Semantic Segmentation on VSPW Dataset through Masked Video Consistency | Chen Liang et.al. | 2406.04979 | translate | read | null |
| 2024-06-07 | Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment | Venkanna Babu Guthula et.al. | 2406.04949 | translate | read | null |
| 2024-06-06 | Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis | Chengeng Liu et.al. | 2406.04149 | translate | read | null |
| 2024-06-07 | 3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation | Ruipu Wu et.al. | 2406.04002 | translate | read | null |
| 2024-06-06 | Frequency-based Matcher for Long-tailed Semantic Segmentation | Shan Li et.al. | 2406.03917 | translate | read | link |
| 2024-06-07 | Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge | Nan Zhang et.al. | 2406.03799 | translate | read | link |
| 2024-06-06 | Instance Segmentation and Teeth Classification in Panoramic X-rays | Devichand Budagam et.al. | 2406.03747 | translate | read | link |
| 2024-06-06 | DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation | Zilu Guo et.al. | 2406.03702 | translate | read | link |
| 2024-06-05 | Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation | Maximilian Zenk et.al. | 2406.03323 | translate | read | null |
| 2024-06-05 | Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy | Yunho Kim et.al. | 2406.02989 | translate | read | null |
| 2024-06-04 | W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics | Andre Schreiber et.al. | 2406.02822 | translate | read | link |
| 2024-06-04 | Window to Wall Ratio Detection using SegFormer | Zoe De Simone et.al. | 2406.02706 | translate | read | link |
| 2024-06-04 | Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Mohamed El Amine Boudjoghra et.al. | 2406.02548 | translate | read | link |
| 2024-06-04 | Generative Active Learning for Long-tailed Instance Segmentation | Muzhi Zhu et.al. | 2406.02435 | translate | read | link |
| 2024-06-04 | Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning | Heather Doig et.al. | 2406.01932 | translate | read | null |
| 2024-06-03 | MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild | Zeren Jiang et.al. | 2406.01595 | translate | read | null |
| 2024-06-03 | Towards Flexible Interactive Reflection Removal with Human Guidance | Xiao Chen et.al. | 2406.01555 | translate | read | link |
| 2024-06-03 | EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding | Thanh-Dat Truong et.al. | 2406.01429 | translate | read | null |
| 2024-06-03 | An expert-driven data generation pipeline for histological images | Roberto Basla et.al. | 2406.01403 | translate | read | link |
| 2024-06-03 | TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation | Antonio Santo et.al. | 2406.01395 | translate | read | link |
| 2024-06-03 | MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images | Ke-Lei Wang et.al. | 2406.01356 | translate | read | null |
| 2024-06-03 | ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds | Ka Lung Cheung et.al. | 2406.01337 | translate | read | link |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)