Object Detection - 2026-03
Object Detection - 2026-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-03-31 | Conditional Polarization Guidance for Camouflaged Object Detection | QIfan Zhang et.al. | 2603.30008 | translate | read | null |
| 2026-03-31 | Detecting Unknown Objects via Energy-based Separation for Open World Object Detection | Jun-Woo Heo et.al. | 2603.29954 | translate | read | null |
| 2026-03-31 | Toward Generalizable Whole Brain Representations with High-Resolution Light-Sheet Data | Minyoung E. Kim et.al. | 2603.29842 | translate | read | null |
| 2026-03-30 | Sim-to-Real Fruit Detection Using Synthetic Data: Quantitative Evaluation and Embedded Deployment with Isaac Sim | Martina Hutter-Mironovova et.al. | 2603.28670 | translate | read | null |
| 2026-03-30 | ORSIFlow: Saliency-Guided Rectified Flow for Optical Remote Sensing Salient Object Detection | Haojing Chen et.al. | 2603.28584 | translate | read | null |
| 2026-03-30 | AceleradorSNN: A Neuromorphic Cognitive System Integrating Spiking Neural Networks and DynamicImage Signal Processing on FPGA | Daniel Gutierrez et.al. | 2603.28429 | translate | read | null |
| 2026-03-30 | Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal | Kazuma Ikeda et.al. | 2603.28224 | translate | read | null |
| 2026-03-30 | A Closer Look at Cross-Domain Few-Shot Object Detection: Fine-Tuning Matters and Parallel Decoder Helps | Xuanlong Yu et.al. | 2603.28182 | translate | read | null |
| 2026-03-30 | BlankSkip: Early-exit Object Detection onboard Nano-drones | Carlo Marra et.al. | 2603.28149 | translate | read | null |
| 2026-03-30 | Object Detection Based on Distributed Convolutional Neural Networks | Liang Sun et.al. | 2603.28050 | translate | read | null |
| 2026-03-30 | UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection | Hongjing Wu et.al. | 2603.27995 | translate | read | null |
| 2026-03-30 | EnsemJudge: Enhancing Reliability in Chinese LLM-Generated Text Detection through Diverse Model Ensembles | Zhuoshang Wang et.al. | 2603.27949 | translate | read | null |
| 2026-03-25 | Language-Guided Structure-Aware Network for Camouflaged Object Detection | Min Zhang et.al. | 2603.24355 | translate | read | null |
| 2026-03-25 | VERIA: Verification-Centric Multimodal Instance Augmentation for Long-Tailed 3D Object Detection | Jumin Lee et.al. | 2603.24294 | translate | read | null |
| 2026-03-25 | Heuristic-inspired Reasoning Priors Facilitate Data-Efficient Referring Object Detection | Xu Zhang et.al. | 2603.24166 | translate | read | null |
| 2026-03-24 | Mind the Hitch: Dynamic Calibration and Articulated Perception for Autonomous Trucks | Morui Zhu et.al. | 2603.23711 | translate | read | null |
| 2026-03-24 | DetPO: In-Context Learning with Multi-Modal LLMs for Few-Shot Object Detection | Gautam Rajendrakumar Gare et.al. | 2603.23455 | translate | read | null |
| 2026-03-24 | CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection | Yuchen Wu et.al. | 2603.23276 | translate | read | null |
| 2026-03-24 | Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy | Shushanta Pudasaini et.al. | 2603.23146 | translate | read | null |
| 2026-03-24 | YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception | Marios Impraimakis et.al. | 2603.23037 | translate | read | null |
| 2026-03-24 | Concept-based explanations of Segmentation and Detection models in Natural Disaster Management | Samar Heydari et.al. | 2603.23020 | translate | read | null |
| 2026-03-24 | FCL-COD: Weakly Supervised Camouflaged Object Detection with Frequency-aware and Contrastive Learning | Jingchen Ni et.al. | 2603.22969 | translate | read | null |
| 2026-03-24 | TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design | Hyunwoo Oh et.al. | 2603.22855 | translate | read | null |
| 2026-03-24 | UAV-DETR: DETR for Anti-Drone Target Detection | Jun Yang et.al. | 2603.22841 | translate | read | null |
| 2026-03-24 | From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery | Bijay Shakya et.al. | 2603.22768 | translate | read | null |
| 2026-03-24 | Human vs. NAO: A Computational-Behavioral Framework for Quantifying Social Orienting in Autism and Typical Development | Vartika Narayani Srinet et.al. | 2603.22759 | translate | read | null |
| 2026-03-23 | STENet: Superpixel Token Enhancing Network for RGB-D Salient Object Detection | Jianlin Chen et.al. | 2603.21999 | translate | read | null |
| 2026-03-23 | Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection | Youbin Kim et.al. | 2603.21944 | translate | read | null |
| 2026-03-23 | Benchmarking Recurrent Event-Based Object Detection for Industrial Multi-Class Recognition on MTEvent | Lokeshwaran Manohar et.al. | 2603.21787 | translate | read | null |
| 2026-03-23 | No Dense Tensors Needed: Fully Sparse Object Detection on Event-Camera Voxel Grids | Mohamad Yazan Sadoun et.al. | 2603.21638 | translate | read | null |
| 2026-03-22 | NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection | Yupeng Zhang et.al. | 2603.21069 | translate | read | null |
| 2026-03-22 | Single-Eye View: Monocular Real-time Perception Package for Autonomous Driving | Haixi Zhang et.al. | 2603.21061 | translate | read | null |
| 2026-03-20 | Deterministic Mode Proposals: An Efficient Alternative to Generative Sampling for Ambiguous Segmentation | Sebastian Gerard et.al. | 2603.20191 | translate | read | null |
| 2026-03-20 | MFil-Mamba: Multi-Filter Scanning for Spatial Redundancy-Aware Visual State Space Models | Puskal Khadka et.al. | 2603.20074 | translate | read | null |
| 2026-03-20 | Offshore oil and gas platform dynamics in the North Sea, Gulf of Mexico, and Persian Gulf: Exploiting the Sentinel-1 archive | Robin Spanier et.al. | 2603.19801 | translate | read | null |
| 2026-03-20 | Template-based Object Detection Using a Foundation Model | Valentin Braeutigam et.al. | 2603.19773 | translate | read | null |
| 2026-03-20 | MoCA3D: Monocular 3D Bounding Box Prediction in the Image Plane | Changwoo Jeon et.al. | 2603.19538 | translate | read | null |
| 2026-03-19 | Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting | Yiren Lu et.al. | 2603.19193 | translate | read | null |
| 2026-03-19 | DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection | Haochen Li et.al. | 2603.18757 | translate | read | null |
| 2026-03-19 | Automatic detection of Gen-AI texts: A comparative framework of neural models | Cristian Buttaro et.al. | 2603.18750 | translate | read | null |
| 2026-03-19 | EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation | Longfei Liu et.al. | 2603.18739 | translate | read | null |
| 2026-03-19 | Remedying Target-Domain Astigmatism for Cross-Domain Few-Shot Object Detection | Yongwei Jiang et.al. | 2603.18541 | translate | read | null |
| 2026-03-19 | Robotic Agentic Platform for Intelligent Electric Vehicle Disassembly | Zachary Allen et.al. | 2603.18520 | translate | read | null |
| 2026-03-18 | PeriphAR: Fast and Accurate Real-World Object Selection with Peripheral Augmented Reality Displays | Yutong Ren et.al. | 2603.18350 | translate | read | null |
| 2026-03-18 | MicroVision: An Open Dataset and Benchmark Models for Detecting Vulnerable Road Users and Micromobility Vehicles | Alexander Rasch et.al. | 2603.18192 | translate | read | null |
| 2026-03-18 | Prompt-Free Universal Region Proposal Network | Qihong Tang et.al. | 2603.17554 | translate | read | null |
| 2026-03-18 | VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection | Chupeng Liu et.al. | 2603.17470 | translate | read | null |
| 2026-03-17 | PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models | Hisayuki Yokomizo et.al. | 2603.16958 | translate | read | null |
| 2026-03-17 | GAP-MLLM: Geometry-Aligned Pre-training for Activating 3D Spatial Perception in Multimodal Large Language Models | Jiaxin Zhang et.al. | 2603.16461 | translate | read | null |
| 2026-03-17 | CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection | Junseok Lee et.al. | 2603.16439 | translate | read | null |
| 2026-03-17 | SF-Mamba: Rethinking State Space Model for Vision | Masakazu Yoshimura et.al. | 2603.16423 | translate | read | null |
| 2026-03-17 | PKINet-v2: Towards Powerful and Efficient Poly-Kernel Remote Sensing Object Detection | Xinhao Cai et.al. | 2603.16341 | translate | read | null |
| 2026-03-17 | Toward Deep Representation Learning for Event-Enhanced Visual Autonomous Perception: the eAP Dataset | Jinghang Li et.al. | 2603.16303 | translate | read | null |
| 2026-03-17 | AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection | Hongwei Lin et.al. | 2603.16261 | translate | read | null |
| 2026-03-17 | KidsNanny: A Two-Stage Multimodal Content Moderation Pipeline Integrating Visual Classification, Object Detection, OCR, and Contextual Reasoning for Child Safety | Viraj Panchal et.al. | 2603.16181 | translate | read | null |
| 2026-03-17 | Out-of-Distribution Object Detection in Street Scenes via Synthetic Outlier Exposure and Transfer Learning | Sadia Ilyas et.al. | 2603.16122 | translate | read | null |
| 2026-03-16 | Robust Dynamic Object Detection in Cluttered Indoor Scenes via Learned Spatiotemporal Cues | Juan Rached et.al. | 2603.15826 | translate | read | null |
| 2026-03-16 | GLANCE: Gaze-Led Attention Network for Compressed Edge-inference | Neeraj Solanki et.al. | 2603.15717 | translate | read | null |
| 2026-03-16 | Real-Time Oriented Object Detection Transformer in Remote Sensing Images | Zeyu Ding et.al. | 2603.15497 | translate | read | null |
| 2026-03-16 | RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance | Xianbao Hou et.al. | 2603.15484 | translate | read | null |
| 2026-03-16 | Detection of Autonomous Shuttles in Urban Traffic Images Using Adaptive Residual Context | Mohamed Aziz Younes et.al. | 2603.15404 | translate | read | null |
| 2026-03-16 | Pointing-Based Object Recognition | Lukáš Hajdúch et.al. | 2603.15403 | translate | read | null |
| 2026-03-16 | Multi-Objective Load Balancing for Heterogeneous Edge-Based Object Detection Systems | Daghash K. Alqahtani et.al. | 2603.15400 | translate | read | null |
| 2026-03-16 | A PPO-Based Bitrate Allocation Conditional Diffusion Model for Remote Sensing Image Compression | Yuming Han et.al. | 2603.15365 | translate | read | null |
| 2026-03-16 | Exemplar Diffusion: Improving Medical Object Detection with Opportunistic Labels | Victor Wåhlstrand et.al. | 2603.15267 | translate | read | null |
| 2026-03-16 | PrototypeNAS: Rapid Design of Deep Neural Networks for Microcontroller Units | Mark Deutel et.al. | 2603.15106 | translate | read | null |
| 2026-03-16 | Interpretable Predictability-Based AI Text Detection: A Replication Study | Adam Skurla et.al. | 2603.15034 | translate | read | null |
| 2026-03-16 | PASTE: Physics-Aware Scattering Topology Embedding Framework for SAR Object Detection | Jiacheng Chen et.al. | 2603.14886 | translate | read | null |
| 2026-03-16 | SpiralDiff: Spiral Diffusion with LoRA for RGB-to-RAW Conversion Across Cameras | Huanjing Yue et.al. | 2603.14885 | translate | read | null |
| 2026-03-16 | Video Detector: A Dual-Phase Vision-Based System for Real-Time Traffic Intersection Control and Intelligent Transportation Analysis | Mustafa Fatih Şen et.al. | 2603.14861 | translate | read | null |
| 2026-03-16 | RadarXFormer: Robust Object Detection via Cross-Dimension Fusion of 4D Radar Spectra and Images for Autonomous Driving | Yue Sun et.al. | 2603.14822 | translate | read | null |
| 2026-03-15 | Medical Image Spatial Grounding with Semantic Sampling | Andrew Seohwan Yu et.al. | 2603.14579 | translate | read | null |
| 2026-03-15 | Covariance-Guided Resource Adaptive Learning for Efficient Edge Inference | Ahmad N. L. Nabhaan et.al. | 2603.14577 | translate | read | null |
| 2026-03-12 | RDNet: Region Proportion-Aware Dynamic Adaptive Salient Object Detection Network in Optical Remote Sensing Images | Bin Wan et.al. | 2603.12215 | translate | read | null |
| 2026-03-12 | R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection | Zhongyu Xia et.al. | 2603.11566 | translate | read | null |
| 2026-03-12 | TornadoNet: Real-Time Building Damage Detection with Ordinal Supervision | Robinson Umeike et.al. | 2603.11557 | translate | read | null |
| 2026-03-12 | One Supervisor, Many Modalities: Adaptive Tool Orchestration for Autonomous Queries | Mayank Saini Arit Kumar Bishwas et.al. | 2603.11545 | translate | read | null |
| 2026-03-12 | EReCu: Pseudo-label Evolution Fusion and Refinement with Multi-Cue Learning for Unsupervised Camouflage Detection | Shuo Jiang et.al. | 2603.11521 | translate | read | null |
| 2026-03-11 | Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style | Marvin Limpijankit et.al. | 2603.11024 | translate | read | null |
| 2026-03-11 | GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations | Boyuan Chen et.al. | 2603.10978 | translate | read | null |
| 2026-03-11 | Evaluating Few-Shot Pill Recognition Under Visual Domain Shift | W. I. Chu et.al. | 2603.10833 | translate | read | null |
| 2026-03-10 | A Robust Deep Learning Framework for Bangla License Plate Recognition Using YOLO and Vision-Language OCR | Nayeb Hasin et.al. | 2603.10267 | translate | read | null |
| 2026-03-10 | From Semantics to Pixels: Coarse-to-Fine Masked Autoencoders for Hierarchical Visual Understanding | Wenzhao Xiang et.al. | 2603.09955 | translate | read | null |
| 2026-03-10 | DRIFT: Dual-Representation Inter-Fusion Transformer for Automated Driving Perception with 4D Radar Point Clouds | Siqi Pei et.al. | 2603.09695 | translate | read | null |
| 2026-03-10 | X-GS: An Extensible Open Framework for Perceiving and Thinking via 3D Gaussian Splatting | Yueen Ma et.al. | 2603.09632 | translate | read | null |
| 2026-03-10 | Decoder-Free Distillation for Quantized Image Restoration | S. M. A. Sharif et.al. | 2603.09624 | translate | read | null |
| 2026-03-10 | RiO-DETR: DETR for Real-time Oriented Object Detection | Zhangchi Hu et.al. | 2603.09411 | translate | read | null |
| 2026-03-10 | YOLO-NAS-Bench: A Surrogate Benchmark with Self-Evolving Predictors for YOLO Architecture Search | Zhe Li et.al. | 2603.09405 | translate | read | null |
| 2026-03-10 | SpaceSense-Bench: A Large-Scale Multi-Modal Benchmark for Spacecraft Perception and Pose Estimation | Aodi Wu et.al. | 2603.09320 | translate | read | null |
| 2026-03-10 | Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning | Kanishkha Jaisankar et.al. | 2603.09255 | translate | read | null |
| 2026-03-10 | Distributed Convolutional Neural Networks for Object Recognition | Liang Sun et.al. | 2603.09220 | translate | read | null |
| 2026-03-10 | Intelligent Spatial Estimation for Fire Hazards in Engineering Sites: An Enhanced YOLOv8-Powered Proximity Analysis Framework | Ammar K. AlMhdawi et.al. | 2603.09069 | translate | read | null |
| 2026-03-09 | Comparative Analysis of Patch Attack on VLM-Based Autonomous Driving Architectures | David Fernandez et.al. | 2603.08897 | translate | read | null |
| 2026-03-09 | Computer Vision-Based Vehicle Allotment System using Perspective Mapping | Prachi Nandi et.al. | 2603.08827 | translate | read | null |
| 2026-03-09 | ER-Pose: Rethinking Keypoint-Driven Representation Learning for Real-Time Human Pose Estimation | Nanjun Li et.al. | 2603.08681 | translate | read | null |
| 2026-03-09 | FOMO-3D: Using Vision Foundation Models for Long-Tailed 3D Object Detection | Anqi Joyce Yang et.al. | 2603.08611 | translate | read | null |
| 2026-03-09 | Beyond Hungarian: Match-Free Supervision for End-to-End Object Detection | Shoumeng Qiu et.al. | 2603.08514 | translate | read | null |
| 2026-03-09 | Alignment-Aware and Reliability-Gated Multimodal Fusion for Unmanned Aerial Vehicle Detection Across Heterogeneous Thermal-Visual Sensors | Ishrat Jahan et.al. | 2603.08208 | translate | read | null |
| 2026-03-09 | ALOOD: Exploiting Language Representations for LiDAR-based Out-of-Distribution Object Detection | Michael Kösel et.al. | 2603.08180 | translate | read | null |
| 2026-03-09 | On the Feasibility and Opportunity of Autoregressive 3D Object Detection | Zanming Huang et.al. | 2603.07985 | translate | read | null |
| 2026-03-08 | Overthinking Causes Hallucination: Tracing Confounder Propagation in Vision Language Models | Abin Shoby et.al. | 2603.07619 | translate | read | null |
| 2026-03-08 | Fast Attention-Based Simplification of LiDAR Point Clouds for Object Detection and Classification | Z. Rozsa et.al. | 2603.07593 | translate | read | null |
| 2026-03-08 | RayD3D: Distilling Depth Knowledge Along the Ray for Robust Multi-View 3D Object Detection | Rui Ding et.al. | 2603.07493 | translate | read | null |
| 2026-03-08 | Multi-Modal Decouple and Recouple Network for Robust 3D Object Detection | Rui Ding et.al. | 2603.07486 | translate | read | null |
| 2026-03-08 | Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection | Rui Ding et.al. | 2603.07464 | translate | read | null |
| 2026-03-08 | Machine Learning for the Internet of Underwater Things: From Fundamentals to Implementation | Kenechi Omeke et.al. | 2603.07413 | translate | read | null |
| 2026-03-07 | A Lightweight Digital-Twin-Based Framework for Edge-Assisted Vehicle Tracking and Collision Prediction | Murat Arda Onsu et.al. | 2603.07338 | translate | read | null |
| 2026-03-07 | OV-DEIM: Real-time DETR-Style Open-Vocabulary Object Detection with GridSynthetic Augmentation | Leilei Wang et.al. | 2603.07022 | translate | read | null |
| 2026-03-06 | DLRMamba: Distilling Low-Rank Mamba for Edge Multispectral Fusion Object Detection | Qianqian Zhang et.al. | 2603.06920 | translate | read | null |
| 2026-03-06 | PaQ-DETR: Learning Pattern and Quality-Aware Dynamic Queries for Object Detection | Zhengjian Kang et.al. | 2603.06917 | translate | read | null |
| 2026-03-06 | BEVLM: Distilling Semantic Knowledge from LLMs into Bird’s-Eye View Representations | Thomas Monninger et.al. | 2603.06576 | translate | read | null |
| 2026-03-06 | Modeling and Measuring Redundancy in Multisource Multimodal Data for Autonomous Driving | Yuhan Zhou et.al. | 2603.06544 | translate | read | null |
| 2026-03-06 | REACT++: Efficient Cross-Attention for Real-Time Scene Graph Generation | Maëlic Neau et.al. | 2603.06386 | translate | read | null |
| 2026-03-06 | Low-latency Event-based Object Detection with Spatially-Sparse Linear Attention | Haiqing Hao et.al. | 2603.06228 | translate | read | null |
| 2026-03-06 | CR-QAT: Curriculum Relational Quantization-Aware Training for Open-Vocabulary Object Detection | Jinyeong Park et.al. | 2603.05964 | translate | read | null |
| 2026-03-06 | CollabOD: Collaborative Multi-Backbone with Cross-scale Vision for UAV Small Object Detection | Xuecheng Bai et.al. | 2603.05905 | translate | read | null |
| 2026-03-05 | Post Fusion Bird’s Eye View Feature Stabilization for Robust Multimodal 3D Detection | Trung Tien Dong et.al. | 2603.05623 | translate | read | null |
| 2026-03-05 | NOTAI.AI: Explainable Detection of Machine-Generated Text via Curvature and Feature Attribution | Oleksandr Marchenko Breneur et.al. | 2603.05617 | translate | read | null |
| 2026-03-05 | Fusion4CA: Boosting 3D Object Detection via Comprehensive Image Exploitation | Kang Luo et.al. | 2603.05305 | translate | read | null |
| 2026-03-05 | Digital Twin Driven Textile Classification and Foreign Object Recognition in Automated Sorting Systems | Serkan Ergun et.al. | 2603.05230 | translate | read | null |
| 2026-03-05 | CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection | Zhaonian Kuang et.al. | 2603.05042 | translate | read | null |
| 2026-03-05 | RMK RetinaNet: Rotated Multi-Kernel RetinaNet for Robust Oriented Object Detection in Remote Sensing Imagery | Huiran Sun et.al. | 2603.04793 | translate | read | null |
| 2026-03-04 | Recognition of Daily Activities through Multi-Modal Deep Learning: A Video, Pose, and Object-Aware Approach for Ambient Assisted Living | Kooshan Hashemifard et.al. | 2603.04509 | translate | read | null |
| 2026-03-04 | VANGUARD: Vehicle-Anchored Ground Sample Distance Estimation for UAVs in GPS-Denied Environments | Yifei Chen et.al. | 2603.04277 | translate | read | null |
| 2026-03-04 | When Visual Evidence is Ambiguous: Pareidolia as a Diagnostic Probe for Vision Models | Qianpu Chen et.al. | 2603.03989 | translate | read | null |
| 2026-03-04 | Point Cloud Feature Coding for Object Detection over an Error-Prone Cloud-Edge Collaborative System | Chongzhen Tian et.al. | 2603.03890 | translate | read | null |
| 2026-03-04 | Adaptive Enhancement and Dual-Pooling Sequential Attention for Lightweight Underwater Object Detection with YOLOv10 | Md. Mushibur Rahman et.al. | 2603.03807 | translate | read | null |
| 2026-03-04 | Small Object Detection in Complex Backgrounds with Multi-Scale Attention and Global Relation Modeling | Wenguang Tao et.al. | 2603.03788 | translate | read | null |
| 2026-03-03 | IoUCert: Robustness Verification for Anchor-based Object Detectors | Benedikt Brückner et.al. | 2603.03043 | translate | read | null |
| 2026-03-03 | HDINO: A Concise and Efficient Open-Vocabulary Detector | Hao Zhang et.al. | 2603.02924 | translate | read | null |
| 2026-03-03 | CAWM-Mamba: A unified model for infrared-visible image fusion and compound adverse weather restoration | Huichun Liu et.al. | 2603.02560 | translate | read | null |
| 2026-03-03 | ForestPersons: A Large-Scale Dataset for Under-Canopy Missing Person Detection | Deokyun Kim et.al. | 2603.02541 | translate | read | null |
| 2026-03-03 | ModalPatch: A Plug-and-Play Module for Robust Multi-Modal 3D Object Detection under Modality Drop | Shuangzhi Li et.al. | 2603.02481 | translate | read | null |
| 2026-03-02 | From Leaderboard to Deployment: Code Quality Challenges in AV Perception Repositories | Mateus Karvat et.al. | 2603.02194 | translate | read | null |
| 2026-03-02 | Is Bigger Always Better? Efficiency Analysis in Resource-Constrained Small Object Detection | Kwame Mbobda-Kuate et.al. | 2603.02142 | translate | read | null |
| 2026-03-02 | physfusion: A Transformer-based Dual-Stream Radar and Vision Fusion Framework for Open Water Surface Object Detection | Yuting Wan et.al. | 2603.01947 | translate | read | null |
| 2026-03-02 | GroupEnsemble: Efficient Uncertainty Estimation for DETR-based Object Detection | Yutong Yang et.al. | 2603.01847 | translate | read | null |
| 2026-03-02 | Downstream Task Inspired Underwater Image Enhancement: A Perception-Aware Study from Dataset Construction to Network Design | Bosen Lin et.al. | 2603.01767 | translate | read | null |
| 2026-03-02 | Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining | Yuxuan Li et.al. | 2603.01758 | translate | read | null |
| 2026-03-02 | YCDa: YCbCr Decoupled Attention for Real-time Realistic Camouflaged Object Detection | PeiHuang Zheng et.al. | 2603.01602 | translate | read | null |
| 2026-03-02 | PPEDCRF: Privacy-Preserving Enhanced Dynamic CRF for Location-Privacy Protection for Sequence Videos with Minimal Detection Degradation | Bo Ma et.al. | 2603.01593 | translate | read | null |
| 2026-03-02 | Boosting AI Reliability with an FSM-Driven Streaming Inference Pipeline: An Industrial Case | Yutian Zhang et.al. | 2603.01528 | translate | read | null |
| 2026-03-02 | Better Matching, Less Forgetting: A Quality-Guided Matcher for Transformer-based Incremental Object Detection | Qirui Wu et.al. | 2603.01524 | translate | read | null |
| 2026-03-01 | Open-Vocabulary vs Supervised Learning Methods for Post-Disaster Visual Scene Understanding | Anna Michailidou et.al. | 2603.01324 | translate | read | null |
| 2026-03-01 | Monocular 3D Object Position Estimation with VLMs for Human-Robot Interaction | Ari Wahl et.al. | 2603.01224 | translate | read | null |
| 2026-03-01 | SMR-Net:Robot Snap Detection Based on Multi-Scale Features and Self-Attention Network | Kuanxu Hou et.al. | 2603.01036 | translate | read | null |
| 2026-03-01 | Accelerating Multi-Scale Deformable Attention Using Near-Memory-Processing Architecture | Huize Li et.al. | 2603.00959 | translate | read | null |
| 2026-03-01 | VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection | Yang Cao et.al. | 2603.00912 | translate | read | link |
(<a href=../Object_Detection.md>back to Object Detection</a>)