Semantic Segmentation - 2025-06
Semantic Segmentation - 2025-06
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-06-30 | SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures | Fengyi Jiang et.al. | 2507.00209 | translate | read | null |
| 2025-06-30 | Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors | Ce Wang et.al. | 2506.23801 | translate | read | null |
| 2025-06-30 | Deep Learning-Based Semantic Segmentation for Real-Time Kidney Imaging and Measurements with Augmented Reality-Assisted Ultrasound | Gijs Luijten et.al. | 2506.23721 | translate | read | null |
| 2025-06-30 | PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum | Shiqi Zhang et.al. | 2506.23607 | translate | read | null |
| 2025-06-30 | Interactive Interface For Semantic Segmentation Dataset Synthesis | Ngoc-Do Tran et.al. | 2506.23470 | translate | read | null |
| 2025-06-30 | Contrastive Learning with Diffusion Features for Weakly Supervised Medical Image Segmentation | Dewen Zeng et.al. | 2506.23460 | translate | read | null |
| 2025-06-29 | Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement | Siyuan Chai et.al. | 2506.23353 | translate | read | null |
| 2025-06-29 | FastSeg: Efficient Training-Free Open-Vocabulary Segmentation via Hierarchical Attention Refinement Method | Quang-Huy Che et.al. | 2506.23323 | translate | read | null |
| 2025-06-29 | BPD-Neo: An MRI Dataset for Lung-Trachea Segmentation with Clinical Data for Neonatal Bronchopulmonary Dysplasia | Rachit Saluja et.al. | 2506.23305 | translate | read | null |
| 2025-06-29 | High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation | Lunhao Duan et.al. | 2506.23227 | translate | read | null |
| 2025-06-29 | DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation | Jihun Kim et.al. | 2506.23104 | translate | read | null |
| 2025-06-27 | Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation | Jialei Chen et.al. | 2506.22032 | translate | read | null |
| 2025-06-27 | TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models | Meng Yu et.al. | 2506.21975 | translate | read | null |
| 2025-06-27 | SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images | Naftaly Wambugu et.al. | 2506.21945 | translate | read | null |
| 2025-06-26 | Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection | Tobias J. Riedlinger et.al. | 2506.21486 | translate | read | null |
| 2025-06-26 | PanSt3R: Multi-view Consistent Panoptic Segmentation | Lojze Zust et.al. | 2506.21348 | translate | read | null |
| 2025-06-26 | HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation | Diego Biagini et.al. | 2506.21287 | translate | read | null |
| 2025-06-27 | ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation | Xiwei Xuan et.al. | 2506.21233 | translate | read | null |
| 2025-06-26 | Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 | Jongyeon Park et.al. | 2506.21174 | translate | read | null |
| 2025-06-27 | DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation | Wenzhou Lyu et.al. | 2506.21034 | translate | read | null |
| 2025-06-26 | TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation | Chade Li et.al. | 2506.20991 | translate | read | null |
| 2025-06-26 | Segment Anything in Pathology Images with Natural Language | Zhixuan Chen et.al. | 2506.20988 | translate | read | null |
| 2025-06-25 | A Deep Learning Approach to Identify Rock Bolts in Complex 3D Point Clouds of Underground Mines Captured Using Mobile Laser Scanners | Dibyayan Patra et.al. | 2506.20464 | translate | read | null |
| 2025-06-26 | Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition | Man Duc Chuc et.al. | 2506.20174 | translate | read | null |
| 2025-06-24 | A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Shulan Ruan et.al. | 2506.19769 | translate | read | null |
| 2025-06-24 | USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation | Lin Hong et.al. | 2506.19472 | translate | read | null |
| 2025-06-24 | A Global-Local Cross-Attention Network for Ultra-high Resolution Remote Sensing Image Semantic Segmentation | Chen Yi et.al. | 2506.19406 | translate | read | null |
| 2025-06-25 | AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation | Ziyan Zhao et.al. | 2506.19269 | translate | read | null |
| 2025-06-23 | Orthogonal Projection Subspace to Aggregate Online Prior-knowledge for Continual Test-time Adaptation | Jinlong Li et.al. | 2506.19022 | translate | read | null |
| 2025-06-23 | Multi-Scale Spectral Attention Module-based Hyperspectral Segmentation in Autonomous Driving Scenarios | Imad Ali Shah et.al. | 2506.18682 | translate | read | null |
| 2025-06-23 | SafeClick: Error-Tolerant Interactive Segmentation of Any Medical Volumes via Hierarchical Expert Consensus | Yifan Gao et.al. | 2506.18404 | translate | read | null |
| 2025-06-23 | Jet Reconstruction with Mamba Networks in Collider Events | Jinmian Li et.al. | 2506.18336 | translate | read | null |
| 2025-06-22 | OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model | Shuaiyu Chen et.al. | 2506.18006 | translate | read | null |
| 2025-06-22 | Relation3D: Enhancing Relation Modeling for Point Cloud Instance Segmentation | Jiahao Lu et.al. | 2506.17891 | translate | read | null |
| 2025-06-22 | Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation | Xiaodong Guo et.al. | 2506.17869 | translate | read | null |
| 2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159 | translate | read | link |
| 2025-06-20 | ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds | Binbin Xiang et.al. | 2506.16991 | translate | read | link |
| 2025-06-20 | LunarLoc: Segment-Based Global Localization on the Moon | Annika Thomas et.al. | 2506.16940 | translate | read | link |
| 2025-06-19 | From Semantic To Instance: A Semi-Self-Supervised Learning Approach | Keyhan Najafian et.al. | 2506.16563 | translate | read | null |
| 2025-06-19 | Structured Semantic 3D Reconstruction (S23DR) Challenge 2025 – Winning solution | Jan Skvrna et.al. | 2506.16421 | translate | read | null |
| 2025-06-19 | LBMamba: Locally Bi-directional Mamba | Jingwei Zhang et.al. | 2506.15976 | translate | read | link |
| 2025-06-19 | Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging | Jiawen Yang et.al. | 2506.15971 | translate | read | null |
| 2025-06-19 | Polyline Path Masked Attention for Vision Transformer | Zhongchen Zhao et.al. | 2506.15940 | translate | read | link |
| 2025-06-18 | MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning | Leonid Ivanov et.al. | 2506.15313 | translate | read | link |
| 2025-06-18 | Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation | Jiaqi Shi et.al. | 2506.15160 | translate | read | link |
| 2025-06-17 | Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset | Nikolaos Dionelis et.al. | 2506.14765 | translate | read | null |
| 2025-06-17 | FocalClick-XL: Towards Unified and High-quality Interactive Segmentation | Xi Chen et.al. | 2506.14686 | translate | read | null |
| 2025-06-17 | VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy | Zhuoyue Tan et.al. | 2506.14525 | translate | read | null |
| 2025-06-17 | DepthSeg: Depth prompting in remote sensing semantic segmentation | Ning Zhou et.al. | 2506.14382 | translate | read | null |
| 2025-06-17 | Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment | Weiming Zhang et.al. | 2506.14271 | translate | read | null |
| 2025-06-16 | HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment | Numair Nadeem et.al. | 2506.13925 | translate | read | null |
| 2025-06-16 | A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects | Guohuan Xie et.al. | 2506.13552 | translate | read | null |
| 2025-06-16 | Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning | Rohit Mohan et.al. | 2506.13265 | translate | read | null |
| 2025-06-16 | ViewPCL: a point cloud based active learning method for multi-view segmentation | Christian Hilaire et.al. | 2506.13043 | translate | read | null |
| 2025-06-15 | A large-scale, physically-based synthetic dataset for satellite pose estimation | Szabolcs Velkei et.al. | 2506.12782 | translate | read | null |
| 2025-06-15 | Unleashing Diffusion and State Space Models for Medical Image Segmentation | Rong Wu et.al. | 2506.12747 | translate | read | null |
| 2025-06-15 | Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups | Zhenghao Xi et.al. | 2506.12712 | translate | read | null |
| 2025-06-13 | O2Former:Direction-Aware and Multi-Scale Query Enhancement for SAR Ship Instance Segmentation | F. Gao et.al. | 2506.11913 | translate | read | null |
| 2025-06-13 | Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling | Yunhan Ren et.al. | 2506.11661 | translate | read | null |
| 2025-06-13 | A $^2$ LC: Active and Automated Label Correction for Semantic Segmentation | Youjin Jeon et.al. | 2506.11599 | translate | read | null |
| 2025-06-13 | OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots | Juno Kim et.al. | 2506.11585 | translate | read | null |
| 2025-06-12 | GynSurg: A Comprehensive Gynecology Laparoscopic Surgery Dataset | Sahar Nasirihaghighi et.al. | 2506.11356 | translate | read | null |
| 2025-06-12 | Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes | Masahiro Yasuda et.al. | 2506.10676 | translate | read | link |
| 2025-06-12 | Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models | Francisco Caetano et.al. | 2506.10634 | translate | read | link |
| 2025-06-12 | Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration | Jun Wang et.al. | 2506.10573 | translate | read | null |
| 2025-06-12 | ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation | Teerapong Panboonyuen et.al. | 2506.10524 | translate | read | null |
| 2025-06-12 | Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation | Shuyang Li et.al. | 2506.10503 | translate | read | null |
| 2025-06-12 | Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success | Che Wang et.al. | 2506.10359 | translate | read | null |
| 2025-06-11 | Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements | Mustafa Atahan Nuhoglu et.al. | 2506.10107 | translate | read | null |
| 2025-06-11 | Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation | Siyu Chen et.al. | 2506.09881 | translate | read | link |
| 2025-06-11 | Accurate and efficient zero-shot 6D pose estimation with frozen foundation models | Andrea Caraffa et.al. | 2506.09784 | translate | read | null |
| 2025-06-11 | The Four Color Theorem for Cell Instance Segmentation | Ye Zhang et.al. | 2506.09724 | translate | read | link |
| 2025-06-11 | Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments | Fatemeh Mohammadi Amin et.al. | 2506.09552 | translate | read | null |
| 2025-06-12 | Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20 $^{th}$ century Urban Landscapes with Satellite Imageries | Tianxiang Hao et.al. | 2506.09476 | translate | read | null |
| 2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | translate | read | null |
| 2025-06-10 | WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos | Negin Ghamsarian et.al. | 2506.08896 | translate | read | null |
| 2025-06-11 | RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation | Jiayi Song et.al. | 2506.08772 | translate | read | null |
| 2025-06-10 | ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction | Juan Yeo et.al. | 2506.08678 | translate | read | null |
| 2025-06-10 | ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network | Feixiang Du et.al. | 2506.08629 | translate | read | null |
| 2025-06-09 | LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds | Zihui Zhang et.al. | 2506.07857 | translate | read | null |
| 2025-06-09 | SAM2Auto: Auto Annotation Using FLASH | Arash Rocky et.al. | 2506.07850 | translate | read | null |
| 2025-06-09 | F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation | Hengzhi Chen et.al. | 2506.07847 | translate | read | null |
| 2025-06-09 | Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity | Mohamed Djilani et.al. | 2506.07773 | translate | read | null |
| 2025-06-09 | OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting | Jens Piekenbrinck et.al. | 2506.07697 | translate | read | null |
| 2025-06-09 | Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation | Jintao Tong et.al. | 2506.07376 | translate | read | null |
| 2025-06-09 | Multiple Object Stitching for Unsupervised Representation Learning | Chengchao Shen et.al. | 2506.07364 | translate | read | link |
| 2025-06-08 | BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite | Liyang Chen et.al. | 2506.07116 | translate | read | null |
| 2025-06-08 | Technical Report for ICRA 2025 GOOSE 3D Semantic Segmentation Challenge: Adaptive Point Cloud Understanding for Heterogeneous Robotic Systems | Xiaoya Zhang et.al. | 2506.06995 | translate | read | null |
| 2025-06-07 | Position Prediction Self-Supervised Learning for Multimodal Satellite Imagery Semantic Segmentation | John Waithaka et.al. | 2506.06852 | translate | read | null |
| 2025-06-06 | Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness | Steven Landgraf et.al. | 2506.05917 | translate | read | null |
| 2025-06-06 | You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping | Jingshun Huang et.al. | 2506.05719 | translate | read | null |
| 2025-06-05 | FRAME: Pre-Training Video Feature Representations via Anticipation and Memory | Sethuraman TV et.al. | 2506.05543 | translate | read | null |
| 2025-06-05 | U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation | Marwane Kzadri et.al. | 2506.05444 | translate | read | null |
| 2025-06-05 | Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting | Alfred T. Christiansen et.al. | 2506.05009 | translate | read | null |
| 2025-06-05 | Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery | Mélisande Teng et.al. | 2506.04970 | translate | read | null |
| 2025-06-05 | CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx | Lukas Picek et.al. | 2506.04931 | translate | read | null |
| 2025-06-05 | OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model | Kunshen Zhang et.al. | 2506.04837 | translate | read | null |
| 2025-06-05 | Gen-n-Val: Agentic Image Data Generation and Validation | Jing-En Huang et.al. | 2506.04676 | translate | read | null |
| 2025-06-04 | You Only Train Once | Christos Sakaridis et.al. | 2506.04349 | translate | read | null |
| 2025-06-04 | AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives | Aniruddh Sikdar et.al. | 2506.03709 | translate | read | null |
| 2025-06-04 | OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation | Aditya Gandhamal et.al. | 2506.03706 | translate | read | null |
| 2025-06-04 | BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation | Jialei Chen et.al. | 2506.03675 | translate | read | null |
| 2025-06-03 | Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery | Pengyu Chen et.al. | 2506.03388 | translate | read | null |
| 2025-06-03 | Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding | Weiqing Xiao et.al. | 2506.03134 | translate | read | link |
| 2025-06-03 | GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Shufan Qing et.al. | 2506.02736 | translate | read | link |
| 2025-06-03 | Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather | Longyu Yang et.al. | 2506.02396 | translate | read | null |
| 2025-06-04 | SAB3R: Semantic-Augmented Backbone in 3D Reconstruction | Xuweiyi Chen et.al. | 2506.02112 | translate | read | null |
| 2025-06-02 | SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation | Rafael Flor-Rodríguez et.al. | 2506.01418 | translate | read | null |
| 2025-06-01 | Perceptual Inductive Bias Is What You Need Before Contrastive Learning | Tianqin Li et.al. | 2506.01201 | translate | read | null |
| 2025-06-01 | GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning | Sahiti Yerramilli et.al. | 2506.00785 | translate | read | null |
| 2025-06-02 | NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation | Xuzhi Wang et.al. | 2505.24634 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)