Semantic Segmentation - 2024-11
Semantic Segmentation - 2024-11
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-11-29 | LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention | Zewen Du et.al. | 2411.19585 | translate | read | link |
| 2024-11-29 | Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Wenbo Zhang et.al. | 2411.19551 | translate | read | null |
| 2024-11-29 | Retrieval-guided Cross-view Image Synthesis | Hongji Yang et.al. | 2411.19510 | translate | read | null |
| 2024-11-29 | Adaptive Interactive Segmentation for Multimodal Medical Imaging via Selection Engine | Zhi Li et.al. | 2411.19447 | translate | read | link |
| 2024-11-28 | GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model | Rui Zhou et.al. | 2411.19289 | translate | read | null |
| 2024-11-28 | InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Haijie Li et.al. | 2411.19235 | translate | read | null |
| 2024-11-28 | MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers | Jongseong Bae et.al. | 2411.18995 | translate | read | null |
| 2024-11-28 | Textured As-Is BIM via GIS-informed Point Cloud Segmentation | Mohamed S. H. Alabassy et.al. | 2411.18898 | translate | read | null |
| 2024-11-27 | The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation | Daniel Morales-Brotons et.al. | 2411.18728 | translate | read | null |
| 2024-11-27 | HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Li-Yuan Tsao et.al. | 2411.18662 | translate | read | link |
| 2024-11-26 | Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation | Sudarshan Rajagopalan et.al. | 2411.17814 | translate | read | null |
| 2024-11-26 | Efficient Multi-modal Large Language Models via Visual Token Grouping | Minbin Huang et.al. | 2411.17773 | translate | read | null |
| 2024-11-26 | Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation | Niharika Hegde et.al. | 2411.17610 | translate | read | null |
| 2024-11-26 | A Bilayer Segmentation-Recombination Network for Accurate Segmentation of Overlapping C. elegans | Mengqian Dinga et.al. | 2411.17557 | translate | read | null |
| 2024-11-26 | Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17543 | translate | read | null |
| 2024-11-26 | Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning | Hoàng-Ân Lê et.al. | 2411.17536 | translate | read | link |
| 2024-11-26 | TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Xiaowen Ma et.al. | 2411.17473 | translate | read | link |
| 2024-11-26 | Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps | Xue Xia et.al. | 2411.17425 | translate | read | null |
| 2024-11-26 | MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection | Juefei He et.al. | 2411.17167 | translate | read | null |
| 2024-11-26 | Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation | Chanyoung Kim et.al. | 2411.17150 | translate | read | null |
| 2024-11-26 | ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction | Chang Li et.al. | 2411.17088 | translate | read | null |
| 2024-11-26 | SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation | Guoan Xu et.al. | 2411.17061 | translate | read | null |
| 2024-11-25 | Deformable Mamba for Wide Field of View Segmentation | Jie Hu et.al. | 2411.16481 | translate | read | link |
| 2024-11-25 | A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models | Manuel Schwonberg et.al. | 2411.16407 | translate | read | null |
| 2024-11-25 | CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation | Leon Sick et.al. | 2411.16319 | translate | read | null |
| 2024-11-25 | An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models | Wentao Qu et.al. | 2411.16308 | translate | read | null |
| 2024-11-25 | A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads | Rafael S. Toledo et.al. | 2411.16295 | translate | read | null |
| 2024-11-25 | Weakly supervised image segmentation for defect-based grading of fresh produce | Manuel Knott et.al. | 2411.16219 | translate | read | null |
| 2024-11-25 | Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Yanan Wang et.al. | 2411.16196 | translate | read | null |
| 2024-11-25 | Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking | Phuc Nguyen et.al. | 2411.16183 | translate | read | null |
| 2024-11-25 | Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training | Man Yao et.al. | 2411.16061 | translate | read | link |
| 2024-11-24 | Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan | Saba Zahid et.al. | 2411.15923 | translate | read | null |
| 2024-11-22 | Effective SAM Combination for Open-Vocabulary Semantic Segmentation | Minhyeok Lee et.al. | 2411.14723 | translate | read | null |
| 2024-11-21 | Revisiting the Integration of Convolution and Attention for Vision Backbone | Lei Zhu et.al. | 2411.14429 | translate | read | link |
| 2024-11-21 | CompetitorFormer: Competitor Transformer for 3D Instance Segmentation | Duanchu Wang et.al. | 2411.14179 | translate | read | null |
| 2024-11-21 | CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation | Lin Sun et.al. | 2411.13836 | translate | read | link |
| 2024-11-21 | Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals | Hussni Mohd Zakir et.al. | 2411.13774 | translate | read | null |
| 2024-11-20 | FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting | Ola Shorinwa et.al. | 2411.13753 | translate | read | null |
| 2024-11-20 | DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines | Mizanur Rahman Jewel et.al. | 2411.13544 | translate | read | null |
| 2024-11-21 | Entropy Bootstrapping for Weakly Supervised Nuclei Detection | James Willoughby et.al. | 2411.13528 | translate | read | null |
| 2024-11-20 | BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation | Umamaheswaran Raman Kumar et.al. | 2411.13251 | translate | read | null |
| 2024-11-20 | XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation | Ziyi Wang et.al. | 2411.13243 | translate | read | link |
| 2024-11-20 | Automating Sonologists USG Commands with AI and Voice Interface | Emad Mohamed et.al. | 2411.13006 | translate | read | null |
| 2024-11-19 | Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline | Junlong Cheng et.al. | 2411.12814 | translate | read | link |
| 2024-11-19 | A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation | Jiaqi Yang et.al. | 2411.12615 | translate | read | link |
| 2024-11-19 | SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation | Ron Keuth et.al. | 2411.12602 | translate | read | link |
| 2024-11-19 | ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator | Xiao Jiang et.al. | 2411.12250 | translate | read | null |
| 2024-11-18 | ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements | M. Arda Aydın et.al. | 2411.12044 | translate | read | link |
| 2024-11-18 | Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation | Hanieh Shojaei Miandashti et.al. | 2411.11935 | translate | read | null |
| 2024-11-18 | MGNiceNet: Unified Monocular Geometric Scene Understanding | Markus Schön et.al. | 2411.11466 | translate | read | null |
| 2024-11-18 | MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models | Harshita Sharma et.al. | 2411.11362 | translate | read | null |
| 2024-11-18 | Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications | Scarlett Raine et.al. | 2411.11287 | translate | read | null |
| 2024-11-18 | Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development | Ranjan Sapkota et.al. | 2411.11285 | translate | read | null |
| 2024-11-16 | Attention-based U-Net Method for Autonomous Lane Detection | Mohammadhamed Tangestanizadeh et.al. | 2411.10902 | translate | read | null |
| 2024-11-16 | Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation | Jaisidh Singh et.al. | 2411.10845 | translate | read | null |
| 2024-11-16 | Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients | Maria Monzon et.al. | 2411.10755 | translate | read | null |
| 2024-11-15 | Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation | Markus Karmann et.al. | 2411.10411 | translate | read | null |
| 2024-11-15 | Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images | Ammar Qammaz et.al. | 2411.10334 | translate | read | null |
| 2024-11-15 | RETR: Multi-View Radar Detection Transformer for Indoor Perception | Ryoma Yataka et.al. | 2411.10293 | translate | read | null |
| 2024-11-15 | CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation | Dengke Zhang et.al. | 2411.10086 | translate | read | link |
| 2024-11-14 | OneNet: A Channel-Wise 1D Convolutional U-Net | Sanghyun Byun et.al. | 2411.09838 | translate | read | link |
| 2024-11-14 | Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks | Zengyi Yang et.al. | 2411.09387 | translate | read | null |
| 2024-11-14 | Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation | Yuheng Shi et.al. | 2411.09219 | translate | read | link |
| 2024-11-14 | Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery | Ashim Dahal et.al. | 2411.09101 | translate | read | link |
| 2024-11-13 | CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation | Xuming Zhang et.al. | 2411.09023 | translate | read | null |
| 2024-11-14 | Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation | Yangyang Li et.al. | 2411.08756 | translate | read | null |
| 2024-11-13 | Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model | Jun Xie et.al. | 2411.08592 | translate | read | null |
| 2024-11-13 | UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation | Chengyuan Zhang et.al. | 2411.08569 | translate | read | null |
| 2024-11-13 | Detection and classification of radio sources with deep learning | S. Riggi et.al. | 2411.08519 | translate | read | null |
| 2024-11-12 | Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry | Christopher Hahne et.al. | 2411.07918 | translate | read | link |
| 2024-11-12 | INTRABENCH: Interactive Radiological Benchmark | Constantin Ulrich et.al. | 2411.07885 | translate | read | null |
| 2024-11-12 | Horticultural Temporal Fruit Monitoring via 3D Instance Segmentation and Re-Identification using Point Clouds | Daniel Fusaro et.al. | 2411.07799 | translate | read | link |
| 2024-11-12 | Semantic segmentation on multi-resolution optical and microwave data using deep learning | Jai G Singla et.al. | 2411.07581 | translate | read | null |
| 2024-11-12 | GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting | Umangi Jain et.al. | 2411.07555 | translate | read | null |
| 2024-11-11 | Data-Centric Learning Framework for Real-Time Detection of Aiming Beam in Fluorescence Lifetime Imaging Guided Surgery | Mohamed Abul Hassan et.al. | 2411.07395 | translate | read | null |
| 2024-11-11 | SAMPart3D: Segment Any Part in 3D Objects | Yunhan Yang et.al. | 2411.07184 | translate | read | link |
| 2024-11-11 | SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation | Jiale Chen et.al. | 2411.06991 | translate | read | null |
| 2024-11-11 | Fast and Efficient Transformer-based Method for Bird’s Eye View Instance Prediction | Miguel Antunes-García et.al. | 2411.06851 | translate | read | link |
| 2024-11-11 | Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision | Yueyang Cang et.al. | 2411.06727 | translate | read | null |
| 2024-11-10 | Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments | Deegan Atha et.al. | 2411.06632 | translate | read | null |
| 2024-11-09 | Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing | Kaixuan Lu et.al. | 2411.06091 | translate | read | null |
| 2024-11-08 | Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model | Shuchang Lyu et.al. | 2411.05878 | translate | read | link |
| 2024-11-08 | Agricultural Landscape Understanding At Country-Scale | Radhika Dua et.al. | 2411.05359 | translate | read | null |
| 2024-11-08 | Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation | Sien Li et.al. | 2411.05307 | translate | read | link |
| 2024-11-07 | In the Era of Prompt Learning with Vision-Language Models | Ankit Jha et.al. | 2411.04892 | translate | read | null |
| 2024-11-08 | ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset | Olaf Wysocki et.al. | 2411.04865 | translate | read | link |
| 2024-11-06 | Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts | Zhitong Gao et.al. | 2411.03829 | translate | read | link |
| 2024-11-06 | SA3DIP: Segment Any 3D Instance with Potential 3D Priors | Xi Yang et.al. | 2411.03819 | translate | read | link |
| 2024-11-06 | Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model | Yansong Qu et.al. | 2411.03672 | translate | read | null |
| 2024-11-05 | Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation | Zhiling Yue et.al. | 2411.03551 | translate | read | null |
| 2024-11-05 | SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture | Andrew Heschl et.al. | 2411.03505 | translate | read | link |
| 2024-11-05 | Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need | Qishuai Wen et.al. | 2411.03033 | translate | read | link |
| 2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | translate | read | null |
| 2024-11-05 | Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery | Mohammad Kakooei et.al. | 2411.02935 | translate | read | null |
| 2024-11-05 | CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation | Jinchao Ge et.al. | 2411.02715 | translate | read | null |
| 2024-11-04 | Deep Learning on 3D Semantic Segmentation: A Detailed Review | Thodoris Betsas et.al. | 2411.02104 | translate | read | null |
| 2024-11-04 | Tree level change detection over Ahmedabad city using very high resolution satellite images and Deep Learning | Jai G Singla et.al. | 2411.02009 | translate | read | null |
| 2024-11-04 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | translate | read | null |
| 2024-11-04 | DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability | Bo Gao et.al. | 2411.01819 | translate | read | null |
| 2024-11-04 | Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations | Thanh Nguyen Canh et.al. | 2411.01816 | translate | read | null |
| 2024-11-05 | MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation | Duc Dang Trung Tran et.al. | 2411.01781 | translate | read | null |
| 2024-11-03 | PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation | Xinyu Xu et.al. | 2411.01624 | translate | read | null |
| 2024-11-01 | Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions | Lixiao Yang et.al. | 2411.01039 | translate | read | null |
| 2024-11-01 | Event-guided Low-light Video Semantic Segmentation | Zhen Yao et.al. | 2411.00639 | translate | read | null |
| 2024-11-01 | Automated Classification of Cell Shapes: A Comparative Evaluation of Shape Descriptors | Valentina Vadori et.al. | 2411.00561 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)