Semantic Segmentation - 2024-05
Semantic Segmentation - 2024-05
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-05-31 | Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks | Linlin Yu et.al. | 2405.20986 | translate | read | null |
| 2024-05-31 | Extreme Point Supervised Instance Segmentation | Hyeonjun Lee et.al. | 2405.20729 | translate | read | null |
| 2024-05-31 | Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation | Wooseok Shin et.al. | 2405.20610 | translate | read | link |
| 2024-05-30 | P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation | Qi Zhang et.al. | 2405.20443 | translate | read | null |
| 2024-05-30 | SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow | Chaoyang Wang et.al. | 2405.20282 | translate | read | link |
| 2024-05-30 | MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion | Angel Villar-Corrales et.al. | 2405.19921 | translate | read | link |
| 2024-05-30 | Open-Set Domain Adaptation for Semantic Segmentation | Seun-An Choe et.al. | 2405.19899 | translate | read | link |
| 2024-05-30 | DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation | Ron Keuth et.al. | 2405.19746 | translate | read | link |
| 2024-05-30 | Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes | Yong-Qiang Mao et.al. | 2405.19735 | translate | read | null |
| 2024-05-30 | CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation | Ankush Gajanan Arudkar et.al. | 2405.19672 | translate | read | null |
| 2024-05-29 | Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation | Lianlei Shan et.al. | 2405.19568 | translate | read | null |
| 2024-05-29 | Enabling Visual Recognition at Radio Frequency | Haowen Lai et.al. | 2405.19516 | translate | read | null |
| 2024-05-29 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models | Tianrun Chen et.al. | 2405.19326 | translate | read | null |
| 2024-05-29 | A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation | Niclas Vödisch et.al. | 2405.19035 | translate | read | link |
| 2024-05-29 | Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation | Zelin Peng et.al. | 2405.18840 | translate | read | null |
| 2024-05-29 | FocSAM: Delving Deeply into Focused Objects in Segmenting Anything | You Huang et.al. | 2405.18706 | translate | read | null |
| 2024-05-28 | Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation | JuneHyoung Kwon et.al. | 2405.18148 | translate | read | null |
| 2024-05-28 | Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images | Lianlei Shan et.al. | 2405.18078 | translate | read | null |
| 2024-05-28 | RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields | Mihnea-Bogdan Jurca et.al. | 2405.18033 | translate | read | null |
| 2024-05-28 | DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture | Shentong Mo et.al. | 2405.17995 | translate | read | link |
| 2024-05-28 | Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation | Yangxiao Lu et.al. | 2405.17859 | translate | read | link |
| 2024-05-28 | The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention | Xingyu Ding et.al. | 2405.17776 | translate | read | null |
| 2024-05-27 | Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation | Steven Landgraf et.al. | 2405.17097 | translate | read | null |
| 2024-05-27 | DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking | Hongtao Wang et.al. | 2405.16980 | translate | read | null |
| 2024-05-27 | Collective Perception Datasets for Autonomous Driving: A Comprehensive Review | Sven Teufel et.al. | 2405.16973 | translate | read | null |
| 2024-05-27 | Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models | Qian Wang et.al. | 2405.16947 | translate | read | null |
| 2024-05-27 | A re-calibration method for object detection with multi-modal alignment bias in autonomous driving | Zhihang Song et.al. | 2405.16848 | translate | read | null |
| 2024-05-26 | Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning | Neha Kalibhat et.al. | 2405.16401 | translate | read | null |
| 2024-05-25 | Video Prediction Models as General Visual Encoders | James Maier et.al. | 2405.16382 | translate | read | null |
| 2024-05-25 | BOLD: Boolean Logic Deep Learning | Van Minh Nguyen et.al. | 2405.16339 | translate | read | null |
| 2024-05-25 | Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation | Huizhou Chen et.al. | 2405.16099 | translate | read | null |
| 2024-05-25 | Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality | Hakim Ikebayashi et.al. | 2405.16008 | translate | read | null |
| 2024-05-24 | Visualize and Paint GAN Activations | Rudolf Herdt et.al. | 2405.15636 | translate | read | null |
| 2024-05-24 | Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets | Hoàng-Ân Lê et.al. | 2405.15394 | translate | read | null |
| 2024-05-24 | Autonomous Quilt Spreading for Caregiving Robots | Yuchun Guo et.al. | 2405.15373 | translate | read | null |
| 2024-05-24 | U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation | Bingyu Li et.al. | 2405.15365 | translate | read | link |
| 2024-05-24 | Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation | Jiayi Chen et.al. | 2405.15265 | translate | read | null |
| 2024-05-23 | Mamba-R: Vision Mamba ALSO Needs Registers | Feng Wang et.al. | 2405.14858 | translate | read | null |
| 2024-05-23 | Efficient Robot Learning for Perception and Mapping | Niclas Vödisch et.al. | 2405.14688 | translate | read | null |
| 2024-05-23 | Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation | Daniel Kienzle et.al. | 2405.14467 | translate | read | null |
| 2024-05-23 | MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | Jiuming Liu et.al. | 2405.14338 | translate | read | null |
| 2024-05-23 | Tuning-free Universally-Supervised Semantic Segmentation | Xiaobo Yang et.al. | 2405.14294 | translate | read | null |
| 2024-05-23 | SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation | Kai Yao et.al. | 2405.14278 | translate | read | null |
| 2024-05-23 | Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations | Mohammed Baharoon et.al. | 2405.14239 | translate | read | null |
| 2024-05-23 | Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification | Taylor Archibald et.al. | 2405.14162 | translate | read | null |
| 2024-05-23 | Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips | Yaotian Liu et.al. | 2405.14154 | translate | read | null |
| 2024-05-22 | TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System | Diogo Lavado et.al. | 2405.13989 | translate | read | null |
| 2024-05-21 | Transparency Distortion Robustness for SOTA Image Segmentation Tasks | Volker Knauthe et.al. | 2405.12864 | translate | read | null |
| 2024-05-20 | A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation | Sushmita Sarker et.al. | 2405.11903 | translate | read | null |
| 2024-05-20 | Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments | Jooyong Park et.al. | 2405.11855 | translate | read | null |
| 2024-05-20 | Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate Model | Mounes Zaval et.al. | 2405.11837 | translate | read | null |
| 2024-05-20 | Universal Organizer of SAM for Unsupervised Semantic Segmentation | Tingting Li et.al. | 2405.11742 | translate | read | null |
| 2024-05-19 | Interpreting a Semantic Segmentation Model for Coastline Detection | Conor O’Sullivan et.al. | 2405.11500 | translate | read | null |
| 2024-05-19 | Unifying 3D Vision-Language Understanding via Promptable Queries | Ziyu Zhu et.al. | 2405.11442 | translate | read | null |
| 2024-05-18 | PS6D: Point Cloud Based Symmetry-Aware 6D Object Pose Estimation in Robot Bin-Picking | Yifan Yang et.al. | 2405.11257 | translate | read | null |
| 2024-05-17 | CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation | Mushui Liu et.al. | 2405.10530 | translate | read | link |
| 2024-05-16 | 4D Panoptic Scene Graph Generation | Jingkang Yang et.al. | 2405.10305 | translate | read | link |
| 2024-05-16 | Towards Task-Compatible Compressible Representations | Anderson de Andrade et.al. | 2405.10244 | translate | read | link |
| 2024-05-16 | DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | Chengxiang Fan et.al. | 2405.10185 | translate | read | link |
| 2024-05-16 | An Integrated Framework for Multi-Granular Explanation of Video Summarization | Konstantinos Tsigos et.al. | 2405.10082 | translate | read | null |
| 2024-05-16 | A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance | Andrea Matteazzi et.al. | 2405.10046 | translate | read | null |
| 2024-05-16 | Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation | Jihwan Kwak et.al. | 2405.09858 | translate | read | null |
| 2024-05-15 | Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation | Guo Yachan et.al. | 2405.09682 | translate | read | null |
| 2024-05-14 | CLIP with Quality Captions: A Strong Pretraining for Vision Tasks | Pavan Kumar Anasosalu Vasu et.al. | 2405.08911 | translate | read | null |
| 2024-05-14 | Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study | Qinfeng Zhu et.al. | 2405.08493 | translate | read | null |
| 2024-05-14 | TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road Detection | Martín Bayón-Gutiérrez et.al. | 2405.08429 | translate | read | link |
| 2024-05-13 | IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data | Ziyang Zhang et.al. | 2405.07916 | translate | read | null |
| 2024-05-13 | PLUTO: Pathology-Universal Transformer | Dinkar Juyal et.al. | 2405.07905 | translate | read | null |
| 2024-05-12 | PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification | Mohammad Shafiul Alam et.al. | 2405.07332 | translate | read | link |
| 2024-05-12 | Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception | Haoming Chen et.al. | 2405.07201 | translate | read | null |
| 2024-05-11 | Global Motion Understanding in Large-Scale Video Object Segmentation | Volodymyr Fedynyak et.al. | 2405.07031 | translate | read | null |
| 2024-05-10 | GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs | Mustafa Munir et.al. | 2405.06849 | translate | read | link |
| 2024-05-10 | Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach | Elham Ravanbakhsh et.al. | 2405.06586 | translate | read | null |
| 2024-05-10 | Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation | Xiaowen Ma et.al. | 2405.06525 | translate | read | link |
| 2024-05-10 | Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data | Yonghao Xu et.al. | 2405.06502 | translate | read | null |
| 2024-05-10 | Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data | Rongyu Zhang et.al. | 2405.06413 | translate | read | null |
| 2024-05-10 | Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation | Zhenliang Ni et.al. | 2405.06228 | translate | read | link |
| 2024-05-10 | Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection | Koji Takeda et.al. | 2405.06185 | translate | read | null |
| 2024-05-10 | Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging | Zhuchen Shao et.al. | 2405.06175 | translate | read | null |
| 2024-05-09 | Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation | Yudian Zhang et.al. | 2405.05830 | translate | read | null |
| 2024-05-09 | CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks | Nick et.al. | 2405.05755 | translate | read | null |
| 2024-05-08 | OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies | Lingdong Kong et.al. | 2405.05259 | translate | read | link |
| 2024-05-08 | Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving | Lingdong Kong et.al. | 2405.05258 | translate | read | link |
| 2024-05-08 | Weakly-supervised Semantic Segmentation via Dual-stream Contrastive Learning of Cross-image Contextual Information | Qi Lai et.al. | 2405.04913 | translate | read | null |
| 2024-05-08 | DeepDamageNet: A two-step deep-learning model for multi-disaster building damage segmentation and classification using satellite imagery | Irene Alisjahbana et.al. | 2405.04800 | translate | read | null |
| 2024-05-07 | A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images | László Kopácsi et.al. | 2405.04650 | translate | read | null |
| 2024-05-07 | FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes | Charles Gaydon et.al. | 2405.04634 | translate | read | link |
| 2024-05-07 | AugmenTory: A Fast and Flexible Polygon Augmentation Library | Tanaz Ghahremani et.al. | 2405.04442 | translate | read | null |
| 2024-05-07 | A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields | Raiyan Rahman et.al. | 2405.04305 | translate | read | null |
| 2024-05-07 | ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation | Zhibo Zhang et.al. | 2405.04121 | translate | read | null |
| 2024-05-07 | Structured Click Control in Transformer-based Interactive Segmentation | Long Xu et.al. | 2405.04009 | translate | read | link |
| 2024-05-06 | PTQ4SAM: Post-Training Quantization for Segment Anything | Chengtao Lv et.al. | 2405.03144 | translate | read | link |
| 2024-05-04 | MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | Vishal Nedungadi et.al. | 2405.02771 | translate | read | null |
| 2024-05-04 | Few-Shot Fruit Segmentation via Transfer Learning | Jordan A. James et.al. | 2405.02556 | translate | read | null |
| 2024-05-03 | Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic Segmentation | Gabriel Fischer Abati et.al. | 2405.02177 | translate | read | null |
| 2024-05-03 | Towards general deep-learning-based tree instance segmentation models | Jonathan Henrich et.al. | 2405.02061 | translate | read | null |
| 2024-05-03 | DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model | Peijin Jia et.al. | 2405.02008 | translate | read | null |
| 2024-05-02 | Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey | Guoping Xu et.al. | 2405.01725 | translate | read | link |
| 2024-05-02 | Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey | Rokas Gipiškis et.al. | 2405.01636 | translate | read | null |
| 2024-05-02 | CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation | Chenying Liu et.al. | 2405.01217 | translate | read | null |
| 2024-05-02 | Uncertainty-aware self-training with expectation maximization basis transformation | Zijia Wang et.al. | 2405.01175 | translate | read | null |
| 2024-05-01 | GraCo: Granularity-Controllable Interactive Segmentation | Yian Zhao et.al. | 2405.00587 | translate | read | null |
| 2024-05-01 | Exploring Self-Supervised Vision Transformers for Deepfake Detection: A Comparative Analysis | Huy H. Nguyen et.al. | 2405.00355 | translate | read | null |
(<a href=../Semantic_Segmentation.md>back to Semantic Segmentation</a>)