Semantic Segmentation - 2025-07
Semantic Segmentation - 2025-07
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-07-25 | Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing | Haichuan Li et.al. | 2507.19691 | translate | read | null |
| 2025-07-25 | SurgPIS: Surgical-instrument-level Instances and Part-level Semantics for Weakly-supervised Part-aware Instance Segmentation | Meng Wei et.al. | 2507.19592 | translate | read | null |
| 2025-07-24 | HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation | Xinyu Wang et.al. | 2507.18575 | translate | read | null |
| 2025-07-24 | Synthetic Data Augmentation for Enhanced Chicken Carcass Instance Segmentation | Yihong Feng et.al. | 2507.18558 | translate | read | null |
| 2025-07-24 | Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows | Simin Huo et.al. | 2507.18405 | translate | read | link |
| 2025-07-24 | GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences | Gabriel Jarry et.al. | 2507.18330 | translate | read | null |
| 2025-07-24 | SemiSegECG: A Multi-Dataset Benchmark for Semi-Supervised Semantic Segmentation in ECG Delineation | Minje Park et.al. | 2507.18323 | translate | read | link |
| 2025-07-24 | Unsupervised Domain Adaptation for 3D LiDAR Semantic Segmentation Using Contrastive Learning and Multi-Model Pseudo Labeling | Abhishek Kaushik et.al. | 2507.18176 | translate | read | null |
| 2025-07-23 | AFRDA: Attentive Feature Refinement for Domain Adaptive Semantic Segmentation | Md. Al-Masrur Khan et.al. | 2507.17957 | translate | read | link |
| 2025-07-23 | Exploring Spatial Diversity for Region-based Active Learning | Lile Cai et.al. | 2507.17367 | translate | read | null |
| 2025-07-23 | Exploring Active Learning for Semiconductor Defect Segmentation | Lile Cai et.al. | 2507.17359 | translate | read | null |
| 2025-07-23 | Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation | Haotian Chen et.al. | 2507.17347 | translate | read | null |
| 2025-07-23 | On Temporal Guidance and Iterative Refinement in Audio Source Separation | Tobias Morocutti et.al. | 2507.17297 | translate | read | null |
| 2025-07-23 | ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation | Bo Fang et.al. | 2507.17149 | translate | read | null |
| 2025-07-22 | MultiTaskDeltaNet: Change Detection-based Image Segmentation for Operando ETEM with Application to Carbon Gasification Kinetics | Yushuo Niu et.al. | 2507.16803 | translate | read | null |
| 2025-07-22 | A2Mamba: Attention-augmented State Space Models for Visual Recognition | Meng Lou et.al. | 2507.16624 | translate | read | link |
| 2025-07-22 | Semantic Segmentation for Preoperative Planning in Transcatheter Aortic Valve Replacement | Cedric Zöllner et.al. | 2507.16573 | translate | read | null |
| 2025-07-22 | Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge | Tobias Rueckert et.al. | 2507.16559 | translate | read | null |
| 2025-07-23 | EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion | Shang Liu et.al. | 2507.16535 | translate | read | null |
| 2025-07-22 | Advancing Visual Large Language Model for Multi-granular Versatile Perception | Wentao Xiang et.al. | 2507.16213 | translate | read | null |
| 2025-07-22 | AMMNet: An Asymmetric Multi-Modal Network for Remote Sensing Semantic Segmentation | Hui Ye et.al. | 2507.16158 | translate | read | null |
| 2025-07-21 | Improved Semantic Segmentation from Ultra-Low-Resolution RGB Images Applied to Privacy-Preserving Object-Goal Navigation | Xuying Huang et.al. | 2507.16034 | translate | read | null |
| 2025-07-21 | ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction | Danhui Chen et.al. | 2507.15803 | translate | read | null |
| 2025-07-21 | ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting | Ruijie Zhu et.al. | 2507.15454 | translate | read | link |
| 2025-07-21 | Rethinking Occlusion in FER: A Semantic-Aware Perspective and Go Beyond | Huiyu Zhai et.al. | 2507.15401 | translate | read | null |
| 2025-07-20 | Towards Geometric and Textural Consistency 3D Scene Generation via Single Image-guided Model Generation and Layout Optimization | Xiang Tang et.al. | 2507.14841 | translate | read | null |
| 2025-07-20 | A Novel Downsampling Strategy Based on Information Complementarity for Medical Image Segmentation | Wenbo Yue et.al. | 2507.14790 | translate | read | null |
| 2025-07-19 | GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset | Zhiwei Zhang et.al. | 2507.14697 | translate | read | null |
| 2025-07-19 | Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall | Shayan Rokhva et.al. | 2507.14662 | translate | read | null |
| 2025-07-19 | Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection | Jifeng Shen et.al. | 2507.14643 | translate | read | null |
| 2025-07-19 | DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF | Doriand Petit et.al. | 2507.14596 | translate | read | null |
| 2025-07-18 | Semantic Segmentation based Scene Understanding in Autonomous Vehicles | Ehsan Rassekh et.al. | 2507.14303 | translate | read | null |
| 2025-07-18 | Leveraging Pathology Foundation Models for Panoptic Segmentation of Melanoma in H&E Images | Jiaqi Lv et.al. | 2507.13974 | translate | read | null |
| 2025-07-17 | SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation | Shiqi Huang et.al. | 2507.12857 | translate | read | null |
| 2025-07-17 | A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique | Homare Sueyoshi et.al. | 2507.12730 | translate | read | null |
| 2025-07-16 | VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians | Siyuan Yao et.al. | 2507.12667 | translate | read | null |
| 2025-07-16 | NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting | Kuangshi Ai et.al. | 2507.12621 | translate | read | null |
| 2025-07-16 | Out-of-distribution data supervision towards biomedical semantic segmentation | Yiquan Gao et.al. | 2507.12105 | translate | read | null |
| 2025-07-16 | Tree-SLAM: semantic object SLAM for efficient mapping of individual trees in orchards | David Rapado-Rincon et.al. | 2507.12093 | translate | read | null |
| 2025-07-16 | Frequency-Dynamic Attention Modulation for Dense Prediction | Linwei Chen et.al. | 2507.12006 | translate | read | null |
| 2025-07-16 | SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation | Jun Yin et.al. | 2507.11994 | translate | read | null |
| 2025-07-16 | Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation | Yuhang Zhang et.al. | 2507.11955 | translate | read | null |
| 2025-07-16 | Spatial Frequency Modulation for Semantic Segmentation | Linwei Chen et.al. | 2507.11893 | translate | read | link |
| 2025-07-15 | SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics | Suyuan Zhao et.al. | 2507.11588 | translate | read | null |
| 2025-07-15 | Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping | Yujie Zhang et.al. | 2507.11279 | translate | read | null |
| 2025-07-15 | Personalized OVSS: Understanding Personal Concept in Open-Vocabulary Semantic Segmentation | Sunghyun Park et.al. | 2507.11030 | translate | read | null |
| 2025-07-15 | Graph Aggregation Prototype Learning for Semantic Change Detection in Remote Sensing | Zhengyi Xu et.al. | 2507.10938 | translate | read | null |
| 2025-07-14 | Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision | Justin M. Kasowski et.al. | 2507.10813 | translate | read | null |
| 2025-07-14 | rt-RISeg: Real-Time Model-Free Robot Interactive Segmentation for Active Instance-Level Object Understanding | Howard H. Qian et.al. | 2507.10776 | translate | read | null |
| 2025-07-14 | FGSSNet: Feature-Guided Semantic Segmentation of Real World Floorplans | Hugo Norrby et.al. | 2507.10343 | translate | read | null |
| 2025-07-14 | Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks | Ben Hamscher et.al. | 2507.10239 | translate | read | null |
| 2025-07-14 | Spatial Lifting for Dense Prediction | Mingzhi Xu et.al. | 2507.10222 | translate | read | null |
| 2025-07-14 | DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation | Ivan Martinović et.al. | 2507.10118 | translate | read | null |
| 2025-07-13 | MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression | Ofir Gordon et.al. | 2507.09616 | translate | read | null |
| 2025-07-13 | Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive | You Huang et.al. | 2507.09612 | translate | read | null |
| 2025-07-13 | SegVec3D: A Method for Vector Embedding of 3D Objects Oriented Towards Robot manipulation | Zhihan Kang et.al. | 2507.09459 | translate | read | null |
| 2025-07-11 | Multimodal HD Mapping for Intersections by Intelligent Roadside Units | Zhongzhang Chen et.al. | 2507.08903 | translate | read | null |
| 2025-07-11 | Image Translation with Kernel Prediction Networks for Semantic Segmentation | Cristina Mata et.al. | 2507.08554 | translate | read | null |
| 2025-07-11 | From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning | Sen Wang et.al. | 2507.08380 | translate | read | null |
| 2025-07-11 | SurfDist: Interpretable Three-Dimensional Instance Segmentation Using Curved Surface Patches | Jackson Borchardt et.al. | 2507.08223 | translate | read | null |
| 2025-07-10 | RAPS-3D: Efficient interactive segmentation for 3D radiological imaging | Théo Danielou et.al. | 2507.07730 | translate | read | null |
| 2025-07-10 | LOSC: LiDAR Open-voc Segmentation Consolidator | Nermin Samet et.al. | 2507.07605 | translate | read | null |
| 2025-07-10 | Diffusion-Guided Knowledge Distillation for Weakly-Supervised Low-Light Semantic Segmentation | Chunyan Wang et.al. | 2507.07578 | translate | read | null |
| 2025-07-10 | Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections | Yongtang Bao et.al. | 2507.07395 | translate | read | null |
| 2025-07-08 | CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings | Cristina Mata et.al. | 2507.07125 | translate | read | null |
| 2025-07-09 | A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level | Johanna Orsholm et.al. | 2507.06972 | translate | read | null |
| 2025-07-09 | SemRaFiner: Panoptic Segmentation in Sparse and Noisy Radar Point Clouds | Matthias Zeller et.al. | 2507.06906 | translate | read | null |
| 2025-07-09 | Know Your Attention Maps: Class-specific Token Masking for Weakly Supervised Semantic Segmentation | Joelle Hanna et.al. | 2507.06848 | translate | read | null |
| 2025-07-09 | Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning | Yang Chen et.al. | 2507.06592 | translate | read | null |
| 2025-07-08 | Centralized Copy-Paste: Enhanced Data Augmentation Strategy for Wildland Fire Semantic Segmentation | Joon Tai Kim et.al. | 2507.06321 | translate | read | null |
| 2025-07-08 | FineGrasp: Towards Robust Grasping for Delicate Objects | Yun Du et.al. | 2507.05978 | translate | read | null |
| 2025-07-08 | Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation | Quanzhu Niu et.al. | 2507.05948 | translate | read | link |
| 2025-07-08 | I $^2$ R: Inter and Intra-image Refinement in Few Shot Segmentation | Ourui Fu et.al. | 2507.05838 | translate | read | null |
| 2025-07-09 | Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework | Wang Wang et.al. | 2507.05814 | translate | read | null |
| 2025-07-08 | SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning | Xin Hu et.al. | 2507.05798 | translate | read | null |
| 2025-07-08 | DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation | Young Hun Kim et.al. | 2507.05627 | translate | read | null |
| 2025-07-07 | OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts | Shiting Xiao et.al. | 2507.05427 | translate | read | null |
| 2025-07-07 | Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations | Xiang Xu et.al. | 2507.05260 | translate | read | null |
| 2025-07-07 | All in One: Visual-Description-Guided Unified Point Cloud Segmentation | Zongyan Han et.al. | 2507.05211 | translate | read | null |
| 2025-07-07 | RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis | Songxiao Yang et.al. | 2507.05193 | translate | read | null |
| 2025-07-07 | MOSU: Autonomous Long-range Robot Navigation with Multi-modal Scene Understanding | Jing Liang et.al. | 2507.04686 | translate | read | null |
| 2025-07-06 | Street design and driving behavior: evidence from a large-scale study in Milan, Amsterdam, and Dubai | Giacomo Orsi et.al. | 2507.04434 | translate | read | null |
| 2025-07-06 | CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning | Fatmaelzahraa Ali Ahmed et.al. | 2507.04317 | translate | read | null |
| 2025-07-06 | Surg-SegFormer: A Dual Transformer-Based Model for Holistic Surgical Scene Segmentation | Fatimaelzahraa Ahmed et.al. | 2507.04304 | translate | read | null |
| 2025-07-05 | Differentiable High-Performance Ray Tracing-Based Simulation of Radio Propagation with Point Clouds | Niklas Vaara et.al. | 2507.04021 | translate | read | null |
| 2025-07-05 | NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models | Siyu Li et.al. | 2507.04002 | translate | read | null |
| 2025-07-05 | CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes via Chain-of-Thought Reasoning | Jeonghyo Song et.al. | 2507.03984 | translate | read | null |
| 2025-07-03 | LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion | Fangfu Liu et.al. | 2507.02813 | translate | read | link |
| 2025-07-03 | No time to train! Training-Free Reference-Based Instance Segmentation | Miguel Espinosa et.al. | 2507.02798 | translate | read | link |
| 2025-07-03 | From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images | Danrong Zhang et.al. | 2507.02781 | translate | read | null |
| 2025-07-03 | MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention | Zunhui Xia et.al. | 2507.02488 | translate | read | null |
| 2025-07-03 | Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis | Byung Hyun Lee et.al. | 2507.02395 | translate | read | null |
| 2025-07-03 | Perception Activator: An intuitive and portable framework for brain cognitive exploration | Le Xu et.al. | 2507.02311 | translate | read | null |
| 2025-07-02 | How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks | Rahul Ramachandran et.al. | 2507.01955 | translate | read | link |
| 2025-07-02 | 3D Reconstruction and Information Fusion between Dormant and Canopy Seasons in Commercial Orchards Using Deep Learning and Fast GICP | Ranjan Sapkota et.al. | 2507.01912 | translate | read | null |
| 2025-07-02 | A Gift from the Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation | Hao Wang et.al. | 2507.01573 | translate | read | null |
| 2025-07-02 | NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation | Max Gandyra et.al. | 2507.01463 | translate | read | null |
| 2025-07-01 | Towards Open-World Human Action Segmentation Using Graph Convolutional Networks | Hao Xing et.al. | 2507.00756 | translate | read | null |
| 2025-07-01 | Rectifying Magnitude Neglect in Linear Attention | Qihang Fan et.al. | 2507.00698 | translate | read | link |
| 2025-07-02 | ExPaMoE: An Expandable Parallel Mixture of Experts for Continual Test-Time Adaptation | JianChao Zhao et.al. | 2507.00502 | translate | read | null |
| 2025-07-01 | Process-aware and high-fidelity microstructure generation using stable diffusion | Hoang Cuong Phan et.al. | 2507.00459 | translate | read | null |
| 2025-07-01 | PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching | Xin Yang et.al. | 2507.00371 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)