Semantic Segmentation - 2025-12
Semantic Segmentation - 2025-12
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-12-29 | Motion-Compensated Latent Semantic Canvases for Visual Situational Awareness on Edge | Igor Lodin et.al. | 2601.00854 | translate | read | null |
| 2025-12-31 | UniC-Lift: Unified 3D Instance Segmentation via Contrastive Learning | Ankit Dhiman et.al. | 2512.24763 | translate | read | null |
| 2025-12-31 | 3D Semantic Segmentation for Post-Disaster Assessment | Nhut Le et.al. | 2512.24593 | translate | read | null |
| 2025-12-30 | From Static to Dynamic: Evaluating the Perceptual Impact of Dynamic Elements in Urban Scenes via MLLM-Guided Generative Inpainting | Zhiwei Wei et.al. | 2512.24513 | translate | read | null |
| 2025-12-30 | MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation | Fuqiang Gu et.al. | 2512.24243 | translate | read | null |
| 2025-12-30 | ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation | Ziquan Liu et.al. | 2512.24224 | translate | read | null |
| 2025-12-30 | BATISNet: Instance Segmentation of Tooth Point Clouds with Boundary Awareness | Yating Cai et.al. | 2512.24201 | translate | read | null |
| 2025-12-30 | Targeted Semantic Segmentation of Himalayan Glacial Lakes Using Time-Series SAR: Towards Automated GLOF Early Warning | Pawan Adhikari et.al. | 2512.24117 | translate | read | null |
| 2025-12-30 | Bridging Structure and Appearance: Topological Features for Robust Self-Supervised Segmentation | Haotang Li et.al. | 2512.23997 | translate | read | null |
| 2025-12-29 | SOFTooth: Semantics-Enhanced Order-Aware Fusion for Tooth Instance Segmentation | Xiaolan Li et.al. | 2512.23411 | translate | read | null |
| 2025-12-29 | PCR-ORB: Enhanced ORB-SLAM3 with Point Cloud Refinement Using Deep Learning-Based Dynamic Object Filtering | Sheng-Kai Chen et.al. | 2512.23318 | translate | read | null |
| 2025-12-29 | AVOID: The Adverse Visual Conditions Dataset with Obstacles for Driving Scene Understanding | Jongoh Jeong et.al. | 2512.23215 | translate | read | null |
| 2025-12-28 | 3D sans 3D Scans: Scalable Pre-training from Video-Generated Point Clouds | Ryousuke Yamada et.al. | 2512.23042 | translate | read | null |
| 2025-12-28 | Toward Stable Semi-Supervised Remote Sensing Segmentation via Co-Guidance and Co-Fusion | Yi Zhou et.al. | 2512.23035 | translate | read | null |
| 2025-12-28 | Next Best View Selections for Semantic and Dynamic 3D Gaussian Splatting | Yiqian Li et.al. | 2512.22771 | translate | read | null |
| 2025-12-26 | iOSPointMapper: RealTime Pedestrian and Accessibility Mapping with Mobile AI | Himanshu Naidu et.al. | 2512.22392 | translate | read | null |
| 2025-12-26 | A Lightweight Multi-Scale Attention Framework for Real-Time Spinal Endoscopic Instance Segmentation | Qi Lai et.al. | 2512.21984 | translate | read | null |
| 2025-12-24 | Self-supervised Multiplex Consensus Mamba for General Image Fusion | Yingying Wang et.al. | 2512.20921 | translate | read | null |
| 2025-12-23 | Learning to Sense for Driving: Joint Optics-Sensor-Model Co-Design for Semantic Segmentation | Reeshad Khan et.al. | 2512.20815 | translate | read | null |
| 2025-12-23 | BiCoR-Seg: Bidirectional Co-Refinement Framework for High-Resolution Remote Sensing Image Segmentation | Jinghao Shi et.al. | 2512.20255 | translate | read | null |
| 2025-12-22 | Retrieving Objects from 3D Scenes with Box-Guided Open-Vocabulary Instance Segmentation | Khanh Nguyen et.al. | 2512.19088 | translate | read | null |
| 2025-12-22 | ICP-4D: Bridging Iterative Closest Point and LiDAR Panoptic Segmentation | Gyeongrok Oh et.al. | 2512.18991 | translate | read | null |
| 2025-12-22 | VOIC: Visible-Occluded Decoupling for Monocular 3D Semantic Scene Completion | Zaidao Han et.al. | 2512.18954 | translate | read | null |
| 2025-12-20 | Multifaceted Exploration of Spatial Openness in Rental Housing: A Big Data Analysis in Tokyo’s 23 Wards | Takuya OKi et.al. | 2512.18226 | translate | read | null |
| 2025-12-19 | Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation | Shreshth Rajan et.al. | 2512.18082 | translate | read | null |
| 2025-12-19 | Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding | Yue Li et.al. | 2512.17817 | translate | read | null |
| 2025-12-19 | SAVeD: A First-Person Social Media Video Dataset for ADAS-equipped vehicle Near-Miss and Crash Event Analyses | Shaoyan Zhai et.al. | 2512.17724 | translate | read | null |
| 2025-12-19 | A 28nm 0.22μJ/token memory-compute-intensity-aware CNN-Transformer accelerator with hybrid-attention-based layer-fusion and cascaded pruning for semantic-segmentation | Pingcheng Dong et.al. | 2512.17555 | translate | read | null |
| 2025-12-19 | MULTIAQUA: A multimodal maritime dataset and robust training strategies for multimodal semantic segmentation | Jon Muhovič et.al. | 2512.17450 | translate | read | null |
| 2025-12-19 | AIFloodSense: A Global Aerial Imagery Dataset for Semantic Segmentation and Understanding of Flooded Environments | Georgios Simantiris et.al. | 2512.17432 | translate | read | null |
| 2025-12-18 | Next-Embedding Prediction Makes Strong Vision Learners | Sihan Xu et.al. | 2512.16922 | translate | read | null |
| 2025-12-18 | Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation | Yunkai Yang et.al. | 2512.16740 | translate | read | null |
| 2025-12-18 | Causal-Tune: Mining Causal Factors from Vision Foundation Models for Domain Generalized Semantic Segmentation | Yin Zhang et.al. | 2512.16567 | translate | read | null |
| 2025-12-18 | PixelArena: A benchmark for Pixel-Precision Visual Intelligence | Feng Liang et.al. | 2512.16303 | translate | read | null |
| 2025-12-17 | In Pursuit of Pixel Supervision for Visual Pre-training | Lihe Yang et.al. | 2512.15715 | translate | read | null |
| 2025-12-17 | MoonSeg3R: Monocular Online Zero-Shot Segment Anything in 3D with Reconstructive Foundation Priors | Zhipeng Du et.al. | 2512.15577 | translate | read | null |
| 2025-12-17 | SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis | Maximilian Kellner et.al. | 2512.15369 | translate | read | null |
| 2025-12-17 | Vision-based module for accurately reading linear scales in a laboratory | Parvesh Saini et.al. | 2512.15327 | translate | read | null |
| 2025-12-17 | SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2512.15310 | translate | read | null |
| 2025-12-16 | Segmental Attention Decoding With Long Form Acoustic Encodings | Pawel Swietojanski et.al. | 2512.14652 | translate | read | null |
| 2025-12-16 | S2D: Sparse-To-Dense Keymask Distillation for Unsupervised Video Instance Segmentation | Leon Sick et.al. | 2512.14440 | translate | read | null |
| 2025-12-16 | DriverGaze360: OmniDirectional Driver Attention with Object-Level Guidance | Shreedhar Govil et.al. | 2512.14266 | translate | read | null |
| 2025-12-16 | Consistent Instance Field for Dynamic Scene Understanding | Junyi Wu et.al. | 2512.14126 | translate | read | null |
| 2025-12-16 | ChartAgent: A Chart Understanding Framework with Tool Integrated Reasoning | Boran Wang et.al. | 2512.14040 | translate | read | null |
| 2025-12-16 | Deep Learning Perspective of Scene Understanding in Autonomous Robots | Afia Maham et.al. | 2512.14020 | translate | read | null |
| 2025-12-15 | Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation | Hongxuan Sun et.al. | 2512.13175 | translate | read | null |
| 2025-12-15 | JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion | Haoyu Wang et.al. | 2512.13014 | translate | read | null |
| 2025-12-15 | TWLR: Text-Guided Weakly-Supervised Lesion Localization and Severity Regression for Explainable Diabetic Retinopathy Grading | Xi Luo et.al. | 2512.13008 | translate | read | null |
| 2025-12-13 | OMUDA: Omni-level Masking for Unsupervised Domain Adaptation in Semantic Segmentation | Yang Ou et.al. | 2512.12303 | translate | read | null |
| 2025-12-12 | Enhancing deep learning performance on burned area delineation from SPOT-6/7 imagery for emergency management | Maria Rodriguez et.al. | 2512.12056 | translate | read | null |
| 2025-12-09 | Generalization vs. Specialization: Evaluating Segment Anything Model (SAM3) Zero-Shot Segmentation Against Fine-Tuned YOLO Detectors | Ranjan Sapkota et.al. | 2512.11884 | translate | read | null |
| 2025-12-07 | Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training | Jiahao Jiang et.al. | 2512.11874 | translate | read | null |
| 2025-12-12 | Referring Change Detection in Remote Sensing Imagery | Yilmaz Korkmaz et.al. | 2512.11719 | translate | read | null |
| 2025-12-12 | DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation | Mohamed Abdelsamad et.al. | 2512.11465 | translate | read | null |
| 2025-12-12 | Out-of-Distribution Segmentation via Wasserstein-Based Evidential Uncertainty | Arnold Brosch et.al. | 2512.11373 | translate | read | null |
| 2025-12-12 | VFMF: World Modeling by Forecasting Vision Foundation Model Features | Gabrijel Boduljak et.al. | 2512.11225 | translate | read | null |
| 2025-12-11 | Take a Peek: Efficient Encoder Adaptation for Few-Shot Semantic Segmentation via LoRA | Pasquale De Marinis et.al. | 2512.10521 | translate | read | null |
| 2025-12-11 | Hybrid Transformer-Mamba Architecture for Weakly Supervised Volumetric Medical Segmentation | Yiheng Lyu et.al. | 2512.10353 | translate | read | null |
| 2025-12-11 | ConStruct: Structural Distillation of Foundation Models for Prototype-Based Weakly Supervised Histopathology Segmentation | Khang Le et.al. | 2512.10316 | translate | read | null |
| 2025-12-11 | DualProtoSeg: Simple and Efficient Design with Text- and Image-Guided Prototype Learning for Weakly Supervised Histopathology Image Segmentation | Anh M. Vu et.al. | 2512.10314 | translate | read | null |
| 2025-12-10 | NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway | Sander Riisøen Jyhne et.al. | 2512.09913 | translate | read | null |
| 2025-12-10 | ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation | Shengchao Zhou et.al. | 2512.09364 | translate | read | null |
| 2025-12-10 | ROI-Packing: Efficient Region-Based Compression for Machine Vision | Md Eimran Hossain Eimon et.al. | 2512.09258 | translate | read | null |
| 2025-12-09 | SIP: Site in Pieces- A Dataset of Disaggregated Construction-Phase 3D Scans for Semantic Segmentation and Scene Understanding | Seongyong Kim et.al. | 2512.09062 | translate | read | null |
| 2025-12-09 | Persistent Homology for Labeled Datasets: Gromov-Hausdorff Stability and Generalized Landscapes | Yaoying Fu et.al. | 2512.08794 | translate | read | null |
| 2025-12-09 | SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images | Kaiyu Li et.al. | 2512.08730 | translate | read | null |
| 2025-12-09 | Instance-Aware Test-Time Segmentation for Continual Domain Shifts | Seunghwan Lee et.al. | 2512.08569 | translate | read | null |
| 2025-12-09 | Query-aware Hub Prototype Learning for Few-Shot 3D Point Cloud Semantic Segmentation | YiLin Zhou et.al. | 2512.08253 | translate | read | null |
| 2025-12-08 | Restrictive Hierarchical Semantic Segmentation for Stratified Tooth Layer Detection | Ryan Banks et.al. | 2512.07984 | translate | read | null |
| 2025-12-08 | Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-Free Open-Vocabulary Semantic Segmentation | Qiming Huang et.al. | 2512.07360 | translate | read | null |
| 2025-12-08 | Generalized Referring Expression Segmentation on Aerial Photos | Luís Marnoto et.al. | 2512.07338 | translate | read | null |
| 2025-12-08 | A graph generation pipeline for critical infrastructures based on heuristics, images and depth data | Mike Diessner et.al. | 2512.07269 | translate | read | null |
| 2025-12-07 | Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues | Tuan-Anh Vu et.al. | 2512.07034 | translate | read | null |
| 2025-12-07 | Selective Masking based Self-Supervised Learning for Image Semantic Segmentation | Yuemin Wang et.al. | 2512.06981 | translate | read | null |
| 2025-12-07 | Balanced Learning for Domain Adaptive Semantic Segmentation | Wangkai Li et.al. | 2512.06886 | translate | read | null |
| 2025-12-07 | Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion | Yu Zhu et.al. | 2512.06882 | translate | read | null |
| 2025-12-07 | Towards Robust Pseudo-Label Learning in Semantic Segmentation: An Encoding Perspective | Wangkai Li et.al. | 2512.06870 | translate | read | null |
| 2025-12-07 | Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training | Kaixuan Lu et.al. | 2512.06864 | translate | read | null |
| 2025-12-07 | FedDSR: Federated Deep Supervision and Regularization Towards Autonomous Driving | Wei-Bin Kou et.al. | 2512.06676 | translate | read | null |
| 2025-12-07 | Statistic-Augmented, Decoupled MoE Routing and Aggregating in Autonomous Driving | Wei-Bin Kou et.al. | 2512.06664 | translate | read | null |
| 2025-12-07 | CoT4Det: A Chain-of-Thought Framework for Perception-Oriented Vision-Language Tasks | Yu Qi et.al. | 2512.06663 | translate | read | null |
| 2025-12-06 | Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework | Xinhao Xiang et.al. | 2512.06376 | translate | read | null |
| 2025-12-03 | Fast and Flexible Robustness Certificates for Semantic Segmentation | Thomas Massena et.al. | 2512.06010 | translate | read | null |
| 2025-12-05 | LPD: Learnable Prototypes with Diversity Regularization for Weakly Supervised Histopathology Segmentation | Khang Le et.al. | 2512.05922 | translate | read | null |
| 2025-12-05 | Label-Efficient Point Cloud Segmentation with Active Learning | Johannes Meyer et.al. | 2512.05759 | translate | read | null |
| 2025-12-05 | DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model | Pasquale De Marinis et.al. | 2512.05613 | translate | read | null |
| 2025-12-01 | FlowEO: Generative Unsupervised Domain Adaptation for Earth Observation | Georges Le Bellier et.al. | 2512.05140 | translate | read | null |
| 2025-12-04 | GeoPE:A Unified Geometric Positional Embedding for Structured Tensors | Yupu Yao et.al. | 2512.04963 | translate | read | null |
| 2025-12-04 | MT-Depth: Multi-task Instance feature analysis for the Depth Completion | Abdul Haseeb Nizamani et.al. | 2512.04734 | translate | read | null |
| 2025-12-04 | DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance | Yinghui Xing et.al. | 2512.04511 | translate | read | null |
| 2025-12-03 | Diminishing Returns in Self-Supervised Learning | Oli Bridge et.al. | 2512.03862 | translate | read | null |
| 2025-12-03 | A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection | Shahid Ansari et.al. | 2512.03684 | translate | read | null |
| 2025-12-03 | OpenTrack3D: Towards Accurate and Generalizable Open-Vocabulary 3D Instance Segmentation | Zhishan Zhou et.al. | 2512.03532 | translate | read | null |
| 2025-12-03 | Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation | Seogkyu Jeon et.al. | 2512.03508 | translate | read | null |
| 2025-12-02 | Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks | Matthew Dutson et.al. | 2512.03014 | translate | read | null |
| 2025-12-02 | Enhancing Floor Plan Recognition: A Hybrid Mix-Transformer and U-Net Approach for Precise Wall Segmentation | Dmitriy Parashchuk et.al. | 2512.02413 | translate | read | null |
| 2025-12-02 | Reproducing and Extending RaDelft 4D Radar with Camera-Assisted Labels | Kejia Hu et.al. | 2512.02394 | translate | read | null |
| 2025-12-02 | SAGE: Style-Adaptive Generalization for Privacy-Constrained Semantic Segmentation Across Domains | Qingmei Li et.al. | 2512.02369 | translate | read | null |
| 2025-12-01 | Multifractal Recalibration of Neural Networks for Medical Imaging Segmentation | Miguel L. Martins et.al. | 2512.02198 | translate | read | null |
| 2025-12-01 | Evaluating SAM2 for Video Semantic Segmentation | Syed Hesham Syed Ariff et.al. | 2512.01774 | translate | read | null |
| 2025-12-01 | SSR: Semantic and Spatial Rectification for CLIP-based Weakly Supervised Segmentation | Xiuli Bi et.al. | 2512.01701 | translate | read | null |
| 2025-12-01 | ViT $^3$ : Unlocking Test-Time Training in Vision | Dongchen Han et.al. | 2512.01643 | translate | read | null |
| 2025-12-01 | Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation | Thao Thi Phuong Dao et.al. | 2512.01589 | translate | read | null |
| 2025-12-01 | ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark | Joanne Lin et.al. | 2512.01495 | translate | read | null |
| 2025-12-01 | Panda: Self-distillation of Reusable Sensor-level Representations for High Energy Physics | Samuel Young et.al. | 2512.01324 | translate | read | null |
| 2025-12-01 | TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image | Ziqian Wang et.al. | 2512.01204 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)