Semantic Segmentation - 2024-10
Semantic Segmentation - 2024-10
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-10-31 | Federated Black-Box Adaptation for Semantic Segmentation | Jay N. Paranjape et.al. | 2410.24181 | translate | read | null |
| 2024-10-31 | COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes | Muhammad Ali et.al. | 2410.24139 | translate | read | link |
| 2024-10-31 | Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model | Hao Zhang et.al. | 2410.23905 | translate | read | link |
| 2024-10-30 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving | Maciej K. Wozniak et.al. | 2410.23085 | translate | read | null |
| 2024-10-31 | CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Ziyang Gong et.al. | 2410.22629 | translate | read | link |
| 2024-10-29 | Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation | Zhaochong An et.al. | 2410.22489 | translate | read | link |
| 2024-10-29 | Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation | Jintao Tong et.al. | 2410.22135 | translate | read | null |
| 2024-10-29 | Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models | Imad Ali Shah et.al. | 2410.22101 | translate | read | null |
| 2024-10-29 | Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation | Ruihao Xia et.al. | 2410.21708 | translate | read | link |
| 2024-10-28 | Domain Adaptation with a Single Vision-Language Embedding | Mohammad Fahes et.al. | 2410.21361 | translate | read | null |
| 2024-10-28 | IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks | Manjunath D et.al. | 2410.20953 | translate | read | null |
| 2024-10-27 | A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models | Camilo Espinosa-Curilem et.al. | 2410.20595 | translate | read | link |
| 2024-10-27 | Unlocking Comics: The AI4VA Dataset for Visual Understanding | Peter Grönquist et.al. | 2410.20459 | translate | read | link |
| 2024-10-27 | Historical Test-time Prompt Tuning for Vision Foundation Models | Jingyi Zhang et.al. | 2410.20346 | translate | read | null |
| 2024-10-25 | OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery | Philipe Dias et.al. | 2410.19965 | translate | read | null |
| 2024-10-25 | IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Kaixian Qu et.al. | 2410.19697 | translate | read | null |
| 2024-10-25 | Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation | Yao Wu et.al. | 2410.19446 | translate | read | link |
| 2024-10-25 | Context-Based Visual-Language Place Recognition | Soojin Woo et.al. | 2410.19341 | translate | read | link |
| 2024-10-24 | Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks | Alexander Jaus et.al. | 2410.18684 | translate | read | null |
| 2024-10-24 | Unsupervised semantic segmentation of urban high-density multispectral point clouds | Oona Oinonen et.al. | 2410.18520 | translate | read | null |
| 2024-10-26 | CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator | Stefanos Pasios et.al. | 2410.18238 | translate | read | link |
| 2024-10-23 | Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers | Achille Chiuchiarelli et.al. | 2410.17738 | translate | read | null |
| 2024-10-23 | YOLOv11: An Overview of the Key Architectural Enhancements | Rahima Khanam et.al. | 2410.17725 | translate | read | null |
| 2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | translate | read | null |
| 2024-10-22 | EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding | Zhiyi Pan et.al. | 2410.17207 | translate | read | null |
| 2024-10-22 | LIMIS: Towards Language-based Interactive Medical Image Segmentation | Lena Heinemann et.al. | 2410.16939 | translate | read | null |
| 2024-10-22 | DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Zhixiong Nan et.al. | 2410.16707 | translate | read | null |
| 2024-10-22 | SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments | Jumman Hossain et.al. | 2410.16686 | translate | read | null |
| 2024-10-22 | NucleiMix: Realistic Data Augmentation for Nuclei Instance Segmentation | Jiamu Wang et.al. | 2410.16671 | translate | read | null |
| 2024-10-21 | PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model | Zhongchen Deng et.al. | 2410.16545 | translate | read | null |
| 2024-10-21 | TIPS: Text-Image Pretraining with Spatial Awareness | Kevis-Kokitsi Maninis et.al. | 2410.16512 | translate | read | link |
| 2024-10-21 | GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation | Nazanin Moradinasab et.al. | 2410.16485 | translate | read | null |
| 2024-10-21 | Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation | Ruting Chi et.al. | 2410.16063 | translate | read | null |
| 2024-10-21 | LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training | Thomas Kreutz et.al. | 2410.15833 | translate | read | link |
| 2024-10-21 | TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight | Hyun-Kurl Jang et.al. | 2410.15674 | translate | read | link |
| 2024-10-21 | Deep Learning and Machine Learning – Object Detection and Semantic Segmentation: From Theory to Applications | Jintao Ren et.al. | 2410.15584 | translate | read | null |
| 2024-10-20 | Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation | Fnu Neha et.al. | 2410.15472 | translate | read | null |
| 2024-10-20 | Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing | Daniya Najiha Abdul Kareem et.al. | 2410.15360 | translate | read | null |
| 2024-10-18 | On the Influence of Shape, Texture and Color for Learning Semantic Segmentation | Annika Mütze et.al. | 2410.14878 | translate | read | null |
| 2024-10-18 | Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ | Arpan Mahara et.al. | 2410.14836 | translate | read | null |
| 2024-10-18 | Impact of imperfect annotations on CNN training and performance for instance segmentation and classification in digital pathology | Laura Gálvez Jiménez et.al. | 2410.14365 | translate | read | null |
| 2024-10-17 | ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding | Guangda Ji et.al. | 2410.13924 | translate | read | link |
| 2024-10-17 | Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks | Clément Playout et.al. | 2410.13822 | translate | read | link |
| 2024-10-18 | Enhanced Prompt-leveraged Weakly Supervised Cancer Segmentation based on Segment Anything | Joonhyeon Song et.al. | 2410.13621 | translate | read | link |
| 2024-10-17 | Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation | Ziyang Chen et.al. | 2410.13472 | translate | read | null |
| 2024-10-17 | SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing | Bin Wang et.al. | 2410.13471 | translate | read | link |
| 2024-10-17 | Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation | Florian Wulff et.al. | 2410.13383 | translate | read | null |
| 2024-10-17 | LESS: Label-Efficient and Single-Stage Referring 3D Segmentation | Xuexun Liu et.al. | 2410.13294 | translate | read | link |
| 2024-10-17 | Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation | Houze Liu et.al. | 2410.13099 | translate | read | null |
| 2024-10-16 | Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation | Wenbo Xu et.al. | 2410.13094 | translate | read | null |
| 2024-10-16 | Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation | Anthony Opipari et.al. | 2410.12995 | translate | read | null |
| 2024-10-16 | Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation | Jesús Alejandro Loera-Ponce et.al. | 2410.12988 | translate | read | null |
| 2024-10-16 | VividMed: Vision Language Model with Versatile Visual Grounding for Medicine | Lingxiao Luo et.al. | 2410.12694 | translate | read | null |
| 2024-10-16 | Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans | Luca Marsilio et.al. | 2410.12641 | translate | read | null |
| 2024-10-16 | Order-Aware Interactive Segmentation | Bin Wang et.al. | 2410.12214 | translate | read | null |
| 2024-10-16 | SAM-Guided Masked Token Prediction for 3D Scene Understanding | Zhimin Chen et.al. | 2410.12158 | translate | read | null |
| 2024-10-15 | WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation | Chenghao Qian et.al. | 2410.12075 | translate | read | link |
| 2024-10-15 | Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning | Rijun Wang et.al. | 2410.11913 | translate | read | null |
| 2024-10-15 | Fractal Calibration for long-tailed object detection | Konstantinos Panagiotis Alexandridis et.al. | 2410.11774 | translate | read | link |
| 2024-10-15 | RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation | Anton Antonov et.al. | 2410.11722 | translate | read | link |
| 2024-10-15 | InvSeg: Test-Time Prompt Inversion for Semantic Segmentation | Jiayi Lin et.al. | 2410.11473 | translate | read | null |
| 2024-10-15 | MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation | Xianping Ma et.al. | 2410.11160 | translate | read | link |
| 2024-10-14 | Locality Alignment Improves Vision-Language Models | Ian Covert et.al. | 2410.11087 | translate | read | null |
| 2024-10-14 | Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes | Tim Broedermann et.al. | 2410.10791 | translate | read | null |
| 2024-10-14 | UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation | Lihe Yang et.al. | 2410.10777 | translate | read | link |
| 2024-10-14 | PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion | Runsong Zhu et.al. | 2410.10659 | translate | read | link |
| 2024-10-14 | Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Daniel Fusaro et.al. | 2410.10510 | translate | read | link |
| 2024-10-14 | LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections | Xuezhi Xiang et.al. | 2410.10433 | translate | read | null |
| 2024-10-14 | V2M: Visual 2-Dimensional Mamba for Image Representation Learning | Chengkun Wang et.al. | 2410.10382 | translate | read | link |
| 2024-10-14 | GlobalMamba: Global Image Serialization for Vision Mamba | Chengkun Wang et.al. | 2410.10316 | translate | read | link |
| 2024-10-13 | UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation | Ye Sun et.al. | 2410.09909 | translate | read | null |
| 2024-10-13 | AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model | Yuchen Li et.al. | 2410.09714 | translate | read | null |
| 2024-10-12 | An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation | Wei Liang et.al. | 2410.09443 | translate | read | null |
| 2024-10-11 | Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation | Varduhi Yeghiazaryan et.al. | 2410.08946 | translate | read | null |
| 2024-10-11 | Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation | Hanieh Shojaei et.al. | 2410.08687 | translate | read | null |
| 2024-10-11 | DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Nguyen Huu Bao Long et.al. | 2410.08582 | translate | read | link |
| 2024-10-10 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Samir Abou Haidar et.al. | 2410.08365 | translate | read | null |
| 2024-10-10 | Interactive4D: Interactive 4D LiDAR Segmentation | Ilya Fradlin et.al. | 2410.08206 | translate | read | link |
| 2024-10-10 | Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation | Zhiyi Pan et.al. | 2410.08091 | translate | read | null |
| 2024-10-10 | Shift and matching queries for video semantic segmentation | Tsubasa Mizuno et.al. | 2410.07635 | translate | read | null |
| 2024-10-10 | 3D Vision-Language Gaussian Splatting | Qucheng Peng et.al. | 2410.07577 | translate | read | null |
| 2024-10-09 | Segmenting objects with Bayesian fusion of active contour models and convnet priors | Przemyslaw Polewski et.al. | 2410.07421 | translate | read | null |
| 2024-10-11 | Bridge the Points: Graph-based Few-shot Segment Anything Semantically | Anqi Zhang et.al. | 2410.06964 | translate | read | null |
| 2024-10-09 | Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation | Seungho Lee et.al. | 2410.06893 | translate | read | null |
| 2024-10-09 | Rethinking the Evaluation of Visible and Infrared Image Fusion | Dayan Guan et.al. | 2410.06811 | translate | read | link |
| 2024-10-10 | QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Fei Xie et.al. | 2410.06806 | translate | read | link |
| 2024-10-09 | Transesophageal Echocardiography Generation using Anatomical Models | Emmanuel Oladokun et.al. | 2410.06781 | translate | read | null |
| 2024-10-09 | Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy | Qinfeng Zhu et.al. | 2410.06725 | translate | read | null |
| 2024-10-09 | Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments | Meng Yu et.al. | 2410.06626 | translate | read | null |
| 2024-10-09 | Towards Natural Image Matting in the Wild via Real-Scenario Prior | Ruihao Xia et.al. | 2410.06593 | translate | read | link |
| 2024-10-08 | Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions | Mateus Karvat et.al. | 2410.06380 | translate | read | link |
| 2024-10-08 | Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts | Zhiwei Lin et.al. | 2410.05963 | translate | read | null |
| 2024-10-07 | Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation | Vince Zhu et.al. | 2410.04689 | translate | read | null |
| 2024-10-06 | In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding | Shenghao Li et.al. | 2410.04529 | translate | read | null |
| 2024-10-05 | ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments | Lorenzo Terenzi et.al. | 2410.04250 | translate | read | null |
| 2024-10-04 | SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 | Hao Yu et.al. | 2410.03962 | translate | read | null |
| 2024-10-04 | Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features | Benyuan Meng et.al. | 2410.03558 | translate | read | link |
| 2024-10-04 | Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images | Abhijeet Patil et.al. | 2410.03289 | translate | read | link |
| 2024-10-04 | HRVMamba: High-Resolution Visual State Space Model for Dense Prediction | Hao Zhang et.al. | 2410.03174 | translate | read | link |
| 2024-10-03 | HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer | Jingjing Ren et.al. | 2410.02528 | translate | read | null |
| 2024-10-06 | SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations | Nikolaos Giakoumoglou et.al. | 2410.02401 | translate | read | link |
| 2024-10-04 | Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation | Muzhi Zhu et.al. | 2410.02369 | translate | read | link |
| 2024-10-03 | ProtoSeg: A Prototype-Based Point Cloud Instance Segmentation Method | Remco Royen et.al. | 2410.02352 | translate | read | null |
| 2024-10-03 | RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds | Remco Royen et.al. | 2410.02323 | translate | read | link |
| 2024-10-03 | Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network | Yangyang Qiu et.al. | 2410.02224 | translate | read | null |
| 2024-10-03 | Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images | Qingyuan Liu et.al. | 2410.02207 | translate | read | null |
| 2024-10-02 | SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images | Kaiyu Li et.al. | 2410.01768 | translate | read | link |
| 2024-10-02 | One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations | Shaokang Wu et.al. | 2410.01630 | translate | read | null |
| 2024-10-02 | Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation | Zhaofeng Shi et.al. | 2410.01341 | translate | read | null |
| 2024-10-02 | VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings | Andrea Carrara et.al. | 2410.01336 | translate | read | null |
| 2024-10-01 | RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation | Yazhou Zhu et.al. | 2410.01110 | translate | read | null |
| 2024-10-01 | Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer | Vlatko Spasev et.al. | 2410.01092 | translate | read | null |
| 2024-10-01 | Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time | Chiao-An Yang et.al. | 2410.01083 | translate | read | link |
| 2024-10-01 | DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles | Robert Krajewski et.al. | 2410.00769 | translate | read | null |
| 2024-10-01 | Optimizing Drug Delivery in Smart Pharmacies: A Novel Framework of Multi-Stage Grasping Network Combined with Adaptive Robotics Mechanism | Rui Tang et.al. | 2410.00753 | translate | read | null |
| 2024-10-01 | Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection | Pengxi Zeng et.al. | 2410.00582 | translate | read | null |
| 2024-10-01 | Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization | Siru Li et.al. | 2409.18434 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)