Semantic Segmentation - 2024-03
Semantic Segmentation - 2024-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-03-29 | ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning | Beomyoung Kim et.al. | 2403.20126 | translate | read | link |
| 2024-03-29 | Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation | Qi Bi et.al. | 2403.20092 | translate | read | null |
| 2024-03-29 | Using Images as Covariates: Measuring Curb Appeal with Deep Learning | Ardyn Nordstrom et.al. | 2403.19915 | translate | read | null |
| 2024-03-29 | MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection | Ali Behrouz et.al. | 2403.19888 | translate | read | link |
| 2024-03-28 | Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation | Qitian Ma et.al. | 2403.19826 | translate | read | null |
| 2024-03-28 | ENet-21: An Optimized light CNN Structure for Lane Detection | Seyed Rasoul Hosseini et.al. | 2403.19782 | translate | read | null |
| 2024-03-29 | Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers | Pingcheng Dong et.al. | 2403.19591 | translate | read | link |
| 2024-03-28 | DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | Donghyun Kim et.al. | 2403.19588 | translate | read | link |
| 2024-03-28 | Learning Multiple Representations with Inconsistency-Guided Detail Regularization for Mask-Guided Matting | Weihao Jiang et.al. | 2403.19213 | translate | read | null |
| 2024-03-27 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T et.al. | 2403.18922 | translate | read | null |
| 2024-03-27 | Annolid: Annotate, Segment, and Track Anything You Need | Chen Yang et.al. | 2403.18690 | translate | read | null |
| 2024-03-27 | I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation | Ayoub Karine et.al. | 2403.18490 | translate | read | null |
| 2024-03-28 | ViTAR: Vision Transformer with Any Resolution | Qihang Fan et.al. | 2403.18361 | translate | read | null |
| 2024-03-27 | Generating Diverse Agricultural Data for Vision-Based Farming Applications | Mikolaj Cieslak et.al. | 2403.18351 | translate | read | null |
| 2024-03-27 | Road Obstacle Detection based on Unknown Objectness Scores | Chihiro Noguchi et.al. | 2403.18207 | translate | read | null |
| 2024-03-26 | Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision Transformer | Badri N. Patro et.al. | 2403.18063 | translate | read | link |
| 2024-03-26 | The Need for Speed: Pruning Transformers with One Recipe | Samir Khaki et.al. | 2403.17921 | translate | read | link |
| 2024-03-26 | Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation | Carlos Gomes et.al. | 2403.17886 | translate | read | null |
| 2024-03-26 | PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition | Chenhongyi Yang et.al. | 2403.17695 | translate | read | link |
| 2024-03-26 | Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion | Kazi Shahriar Sanjid et.al. | 2403.17432 | translate | read | null |
| 2024-03-25 | Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions | Ye Li et.al. | 2403.17009 | translate | read | link |
| 2024-03-25 | DreamLIP: Language-Image Pre-training with Long Captions | Kecheng Zheng et.al. | 2403.17007 | translate | read | link |
| 2024-03-25 | TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation | Quang-Huy Che et.al. | 2403.16958 | translate | read | null |
| 2024-03-25 | HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation | Linglin Jing et.al. | 2403.16788 | translate | read | null |
| 2024-03-25 | Clustering Propagation for Universal Medical Image Segmentation | Yuhang Ding et.al. | 2403.16646 | translate | read | null |
| 2024-03-25 | SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation | Aysim Toker et.al. | 2403.16605 | translate | read | null |
| 2024-03-25 | Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes | Tianwei Zhang et.al. | 2403.16499 | translate | read | null |
| 2024-03-25 | GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation | Weiming Zhang et.al. | 2403.16370 | translate | read | null |
| 2024-03-24 | AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans | Cedric Perauer et.al. | 2403.16318 | translate | read | null |
| 2024-03-24 | Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System | Jing Li et.al. | 2403.16227 | translate | read | null |
| 2024-03-24 | Segment Anything Model for Road Network Graph Extraction | Congrui Hetang et.al. | 2403.16051 | translate | read | link |
| 2024-03-24 | SM2C: Boost the Semi-supervised Segmentation for Medical Image by using Meta Pseudo Labels and Mixed Images | Yifei Wang et.al. | 2403.16009 | translate | read | null |
| 2024-03-22 | Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting | Jun Guo et.al. | 2403.15624 | translate | read | null |
| 2024-03-22 | A2DMN: Anatomy-Aware Dilated Multiscale Network for Breast Ultrasound Semantic Segmentation | Kyle Lucke et.al. | 2403.15560 | translate | read | null |
| 2024-03-22 | InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding | Yi Wang et.al. | 2403.15377 | translate | read | link |
| 2024-03-22 | Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations | Pranav Kulkarni et.al. | 2403.15218 | translate | read | null |
| 2024-03-22 | Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion | Sofia Casarin et.al. | 2403.15194 | translate | read | null |
| 2024-03-22 | IFSENet : Harnessing Sparse Iterations for Interactive Few-shot Segmentation Excellence | Shreyas Chandgothia et.al. | 2403.15089 | translate | read | null |
| 2024-03-22 | Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans | Heng Guo et.al. | 2403.15063 | translate | read | null |
| 2024-03-22 | BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation | Jiahao Lu et.al. | 2403.15019 | translate | read | null |
| 2024-03-22 | Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation | Wenlve Zhou et.al. | 2403.14995 | translate | read | null |
| 2024-03-21 | WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather | Blake Gella et.al. | 2403.14874 | translate | read | null |
| 2024-03-21 | PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model | Zheng Zhang et.al. | 2403.14598 | translate | read | link |
| 2024-03-21 | Learning to Project for Cross-Task Knowledge Distillation | Dylan Auty et.al. | 2403.14494 | translate | read | null |
| 2024-03-21 | OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation | Bohao Peng et.al. | 2403.14418 | translate | read | link |
| 2024-03-21 | Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Pablo Marcos-Manchón et.al. | 2403.14291 | translate | read | link |
| 2024-03-21 | OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation | Kwanyoung Kim et.al. | 2403.14183 | translate | read | null |
| 2024-03-21 | Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference | Junyoung Kim et.al. | 2403.14138 | translate | read | null |
| 2024-03-21 | Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling | Yong He et.al. | 2403.14124 | translate | read | null |
| 2024-03-21 | Semantics from Space: Satellite-Guided Thermal Semantic Segmentation Annotation for Aerial Field Robots | Connor Lee et.al. | 2403.14056 | translate | read | null |
| 2024-03-20 | When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather | Giulia Rizzoli et.al. | 2403.13762 | translate | read | null |
| 2024-03-20 | Next day fire prediction via semantic segmentation | Konstantinos Alexis et.al. | 2403.13545 | translate | read | null |
| 2024-03-20 | MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining | Di Wang et.al. | 2403.13430 | translate | read | link |
| 2024-03-20 | AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments | Mohamed Elnoor et.al. | 2403.13235 | translate | read | null |
| 2024-03-20 | Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation | Linshan Wu et.al. | 2403.13225 | translate | read | link |
| 2024-03-19 | Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation | Kasi Viswanath et.al. | 2403.13188 | translate | read | null |
| 2024-03-19 | As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? | Anjun Hu et.al. | 2403.12693 | translate | read | null |
| 2024-03-19 | PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation | Haruya Ishikawa et.al. | 2403.12530 | translate | read | null |
| 2024-03-19 | Semantics, Distortion, and Style Matter: Towards Source-free UDA for Panoramic Segmentation | Xu Zheng et.al. | 2403.12505 | translate | read | null |
| 2024-03-19 | CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation | Wenqi Zhu et.al. | 2403.12455 | translate | read | link |
| 2024-03-19 | Multi-Object RANSAC: Efficient Plane Clustering Method in a Clutter | Seunghyeon Lim et.al. | 2403.12449 | translate | read | null |
| 2024-03-18 | EffiPerception: an Efficient Framework for Various Perception Tasks | Xinhao Xiang et.al. | 2403.12317 | translate | read | null |
| 2024-03-18 | Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang et.al. | 2403.11812 | translate | read | null |
| 2024-03-18 | Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation | Wangbo Zhao et.al. | 2403.11808 | translate | read | link |
| 2024-03-18 | LSKNet: A Foundation Lightweight Backbone for Remote Sensing | Yuxuan Li et.al. | 2403.11735 | translate | read | null |
| 2024-03-18 | TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models | Lisa Weijler et.al. | 2403.11691 | translate | read | null |
| 2024-03-18 | Better (pseudo-)labels for semi-supervised instance segmentation | François Porcher et.al. | 2403.11675 | translate | read | null |
| 2024-03-18 | Synthesizing multi-log grasp poses | Arvid Fälldin et.al. | 2403.11623 | translate | read | null |
| 2024-03-18 | OurDB: Ouroboric Domain Bridging for Multi-Target Domain Adaptive Semantic Segmentation | Seungbeom Woo et.al. | 2403.11582 | translate | read | null |
| 2024-03-18 | MISS: Memory-efficient Instance Segmentation Framework By Visual Inductive Priors Flow Propagation | Chih-Chung Hsu et.al. | 2403.11576 | translate | read | null |
| 2024-03-18 | Augment Before Copy-Paste: Data and Memory Efficiency-Oriented Instance Segmentation Framework for Sport-scenes | Chih-Chung Hsu et.al. | 2403.11572 | translate | read | null |
| 2024-03-18 | Circle Representation for Medical Instance Object Segmentation | Juming Xiong et.al. | 2403.11507 | translate | read | link |
| 2024-03-18 | MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception | Thien-Minh Nguyen et.al. | 2403.11496 | translate | read | null |
| 2024-03-18 | Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting | Mingkui Tan et.al. | 2403.11491 | translate | read | null |
| 2024-03-18 | ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation | Minh Tran et.al. | 2403.11376 | translate | read | null |
| 2024-03-14 | PosSAM: Panoptic Open-vocabulary Segment Anything | Vibashan VS et.al. | 2403.09620 | translate | read | link |
| 2024-03-14 | WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity | Qiyuan Wang et.al. | 2403.09551 | translate | read | null |
| 2024-03-14 | Annotation Free Semantic Segmentation with Vision Foundation Models | Soroush Seifi et.al. | 2403.09307 | translate | read | null |
| 2024-03-14 | StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images | Robert Jewsbury et.al. | 2403.09302 | translate | read | link |
| 2024-03-14 | Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation | Hyung-Il Kim et.al. | 2403.09199 | translate | read | null |
| 2024-03-14 | When Semantic Segmentation Meets Frequency Aliasing | Linwei Chen et.al. | 2403.09065 | translate | read | link |
| 2024-03-13 | CART: Caltech Aerial RGB-Thermal Dataset in the Wild | Connor Lee et.al. | 2403.08997 | translate | read | link |
| 2024-03-13 | SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net | Helin Cao et.al. | 2403.08885 | translate | read | null |
| 2024-03-13 | Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches | Yun Xin Teoh et.al. | 2403.08761 | translate | read | null |
| 2024-03-13 | Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Samuel Sze et.al. | 2403.08748 | translate | read | null |
| 2024-03-13 | Semantic Segmentation of Solar Radio Spikes at Low Frequencies | Pearse C. Murphy et.al. | 2403.08546 | translate | read | null |
| 2024-03-13 | Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation | Zicheng Zhang et.al. | 2403.08426 | translate | read | null |
| 2024-03-13 | LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving | Sicen Guo et.al. | 2403.08215 | translate | read | null |
| 2024-03-13 | Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks | Fuzhi Wu et.al. | 2403.08157 | translate | read | link |
| 2024-03-12 | Mitigating the Impact of Attribute Editing on Face Recognition | Sudipta Banerjee et.al. | 2403.08092 | translate | read | null |
| 2024-03-12 | Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation | Feilong Tang et.al. | 2403.07630 | translate | read | link |
| 2024-03-12 | PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution | Honghao Chen et.al. | 2403.07589 | translate | read | null |
| 2024-03-12 | Open-World Semantic Segmentation Including Class Similarity | Matteo Sodano et.al. | 2403.07532 | translate | read | null |
| 2024-03-11 | Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation | Theodore Barfoot et.al. | 2403.06759 | translate | read | link |
| 2024-03-11 | Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation | Bianca-Cerasela-Zelia Blaga et.al. | 2403.06621 | translate | read | link |
| 2024-03-11 | OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation | Baran Ozaydin et.al. | 2403.06546 | translate | read | null |
| 2024-03-11 | 3D Semantic Segmentation-Driven Representations for 3D Object Detection | Hayeon O et.al. | 2403.06501 | translate | read | link |
| 2024-03-11 | Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy | Jiuming Liu et.al. | 2403.06467 | translate | read | link |
| 2024-03-11 | Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation | Xiaoyang Wang et.al. | 2403.06462 | translate | read | null |
| 2024-03-11 | Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation | Peng Zhang et.al. | 2403.06401 | translate | read | null |
| 2024-03-10 | Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning | Woo-Jin Ahn et.al. | 2403.06122 | translate | read | link |
| 2024-03-09 | Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation | Hairong Shi et.al. | 2403.05912 | translate | read | null |
| 2024-03-09 | Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration | Jingyun Xue et.al. | 2403.05906 | translate | read | null |
| 2024-03-08 | Attention-guided Feature Distillation for Semantic Segmentation | Amir M. Mansourian et.al. | 2403.05451 | translate | read | link |
| 2024-03-08 | Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation | Yu Han et.al. | 2403.05388 | translate | read | null |
| 2024-03-08 | Frequency-Adaptive Dilated Convolution for Semantic Segmentation | Linwei Chen et.al. | 2403.05369 | translate | read | link |
| 2024-03-08 | Embedded Deployment of Semantic Segmentation in Medicine through Low-Resolution Inputs | Erik Ostrowski et.al. | 2403.05340 | translate | read | null |
| 2024-03-08 | LVIC: Multi-modality segmentation by Lifting Visual Info as Cue | Zichao Dong et.al. | 2403.05159 | translate | read | null |
| 2024-03-07 | SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising | Tao Zhou et.al. | 2403.04194 | translate | read | link |
| 2024-03-06 | ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic Segmentation | Erik Brorsson et.al. | 2403.03854 | translate | read | link |
| 2024-03-06 | Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision | Yajie Liu et.al. | 2403.03707 | translate | read | null |
| 2024-03-06 | Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery | Jingru Zhu et.al. | 2403.03704 | translate | read | null |
| 2024-03-06 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou et.al. | 2403.03608 | translate | read | null |
| 2024-03-06 | Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator | Wonhyeok Choi et.al. | 2403.03468 | translate | read | null |
| 2024-03-05 | CenterDisks: Real-time instance segmentation with disk covering | Katia Jodogne-Del Litto et.al. | 2403.03296 | translate | read | link |
| 2024-03-05 | Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection | Mohamed Afifi et.al. | 2403.03111 | translate | read | null |
| 2024-03-05 | ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving | Han Lu et.al. | 2403.02877 | translate | read | null |
| 2024-03-05 | DDF: A Novel Dual-Domain Image Fusion Strategy for Remote Sensing Image Semantic Segmentation with Unsupervised Domain Adaptation | Lingyan Ran et.al. | 2403.02784 | translate | read | null |
| 2024-03-05 | Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels | Zhuohong Li et.al. | 2403.02746 | translate | read | null |
| 2024-03-05 | FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View | Jiawei Hou et.al. | 2403.02710 | translate | read | null |
| 2024-03-05 | Deep Common Feature Mining for Efficient Video Semantic Segmentation | Yaoyan Zheng et.al. | 2403.02689 | translate | read | null |
| 2024-03-04 | Self-Supervised Facial Representation Learning with Facial Region Awareness | Zheng Gao et.al. | 2403.02138 | translate | read | null |
| 2024-03-04 | Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey | Lingyan Ran et.al. | 2403.01909 | translate | read | null |
| 2024-03-04 | Map-aided annotation for pole base detection | Benjamin Missaoui et.al. | 2403.01868 | translate | read | null |
| 2024-03-04 | AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation | Haonan Wang et.al. | 2403.01818 | translate | read | link |
| 2024-03-02 | Benchmarking Segmentation Models with Mask-Preserved Attribute Editing | Zijin Yin et.al. | 2403.01231 | translate | read | link |
| 2024-03-02 | Boosting Box-supervised Instance Segmentation with Pseudo Depth | Xinyi Yu et.al. | 2403.01214 | translate | read | null |
| 2024-03-02 | Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation | Lian Xu et.al. | 2403.01156 | translate | read | null |
| 2024-03-01 | Rethinking Few-shot 3D Point Cloud Semantic Segmentation | Zhaochong An et.al. | 2403.00592 | translate | read | link |
| 2024-03-01 | Small, Versatile and Mighty: A Range-View Perception Framework | Qiang Meng et.al. | 2403.00325 | translate | read | null |
| 2024-03-01 | YOLO-MED : Multi-Task Interaction Network for Biomedical Images | Suizhi Huang et.al. | 2403.00245 | translate | read | null |
| 2024-03-01 | PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Niccolò Cavagnero et.al. | 2402.19422 | translate | read | link |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)