Object Detection - 2025-12

Publish Date Title Authors PDF Translate Read Code
2025-12-31 Compressed Map Priors for 3D Perception Brady Zhou et.al. 2601.00139 translate read null
2025-12-31 Automated electrostatic characterization of quantum dot devices in single- and bilayer heterostructures Merritt P. R. Losert et.al. 2601.00067 translate read null
2025-12-31 Semi-Supervised Diversity-Aware Domain Adaptation for 3D Object detection Bartłomiej Olber et.al. 2512.24922 translate read null
2025-12-31 Semi-Automated Data Annotation in Multisensor Datasets for Autonomous Vehicle Testing Andrii Gamalii et.al. 2512.24896 translate read null
2025-12-31 FireRescue: A UAV-Based Dataset and Enhanced YOLO Model for Object Detection in Fire Rescue Scenes Qingyu Xu et.al. 2512.24622 translate read null
2025-12-30 AI-Driven Evaluation of Surgical Skill via Action Recognition Yan Meng et.al. 2512.24411 translate read null
2025-12-30 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems Song Wang et.al. 2512.24385 translate read null
2025-12-30 Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images Jingzhou Chen et.al. 2512.24074 translate read null
2025-12-29 Automated river gauge plate reading using a hybrid object detection and generative AI framework in the Limpopo River Basin Kayathri Vigneswaran et.al. 2512.23454 translate read null
2025-12-29 YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection Xu Lin et.al. 2512.23273 translate read null
2025-12-29 LIMO: Low-Power In-Memory-Annealer and Matrix-Multiplication Primitive for Edge Computing Amod Holla et.al. 2512.23212 translate read null
2025-12-29 Exploring Syn-to-Real Domain Adaptation for Military Target Detection Jongoh Jeong et.al. 2512.23208 translate read null
2025-12-29 GVSynergy-Det: Synergistic Gaussian-Voxel Representations for Multi-View 3D Object Detection Yi Zhang et.al. 2512.23176 translate read null
2025-12-29 GeoTeacher: Geometry-Guided Semi-Supervised 3D Object Detection Jingyu Li et.al. 2512.23147 translate read null
2025-12-28 RealCamo: Boosting Real Camouflage Synthesis with Layout Controls and Textual-Visual Guidance Chunyuan Chen et.al. 2512.22974 translate read null
2025-12-28 YOLO-IOD: Towards Real Time Incremental Object Detection Shizhou Zhang et.al. 2512.22973 translate read null
2025-12-28 Wavelet-based Multi-View Fusion of 4D Radar Tensor and Camera for Robust 3D Object Detection Runwei Guan et.al. 2512.22972 translate read null
2025-12-28 Evaluating the Performance of Open-Vocabulary Object Detection in Low-quality Image Po-Chih Wu et.al. 2512.22801 translate read null
2025-12-27 SCAFusion: A Multimodal 3D Detection Framework for Small Object Detection in Lunar Surface Exploration Xin Chen et.al. 2512.22503 translate read null
2025-12-27 Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection Zihan Liu et.al. 2512.22483 translate read null
2025-12-27 Comparing Object Detection Models for Electrical Substation Component Mapping Haley Mody et.al. 2512.22454 translate read null
2025-12-27 SonoVision: A Computer Vision Approach for Helping Visually Challenged Individuals Locate Objects with the Help of Sound Cues Md Abu Obaida Zishan et.al. 2512.22449 translate read null
2025-12-27 Towards Robust Optical-SAR Object Detection under Missing Modalities: A Dynamic Quality-Aware Fusion Framework Zhicheng Zhao et.al. 2512.22447 translate read null
2025-12-26 DeFloMat: Detection with Flow Matching for Stable and Efficient Generative Object Localization Hansang Lee et.al. 2512.22406 translate read null
2025-12-23 Failure Analysis of Safety Controllers in Autonomous Vehicles Under Object-Based LiDAR Attacks Daniyal Ganiuly et.al. 2512.22244 translate read null
2025-12-26 Breaking Alignment Barriers: TPS-Driven Semantic Correlation Learning for Alignment-Free RGB-T Salient Object Detection Lupiao Hu et.al. 2512.21856 translate read null
2025-12-25 Detecting AI-Generated Paraphrases in Bengali: A Comparative Study of Zero-Shot and Fine-Tuned Transformers Md. Rakibul Islam et.al. 2512.21709 translate read null
2025-12-25 Comparative Analysis of Deep Learning Models for Perception in Autonomous Vehicles Jalal Khan et.al. 2512.21673 translate read null
2025-12-24 ORCA: Object Recognition and Comprehension for Archiving Marine Species Yuk-Kwan Wong et.al. 2512.21150 translate read null
2025-12-24 Self-supervised Multiplex Consensus Mamba for General Image Fusion Yingying Wang et.al. 2512.20921 translate read null
2025-12-23 Real-World Adversarial Attacks on RF-Based Drone Detectors Omer Gazit et.al. 2512.20712 translate read null
2025-12-23 Bridging Modalities and Transferring Knowledge: Enhanced Multimodal Understanding and Recognition Gorjan Radevski et.al. 2512.20501 translate read null
2025-12-23 ${D}^{3}${ETOR}: ${D}$ebate-Enhanced Pseudo Labeling and Frequency-Aware Progressive ${D}$ebiasing for Weakly-Supervised Camouflaged Object ${D}$ etection with Scribble Annotations Jiawei Ge et.al. 2512.20260 translate read null
2025-12-23 LiteFusion: Taming 3D Object Detectors from Vision-Based to Multi-Modal with Minimal Adaptation Xiangxuan Ren et.al. 2512.20217 translate read null
2025-12-23 Gaussian Process Assisted Meta-learning for Image Classification and Object Detection Models Anna R. Flowers et.al. 2512.20021 translate read null
2025-12-23 PaveSync: A Unified and Comprehensive Dataset for Pavement Distress Analysis and Classification Blessing Agyei Kyem et.al. 2512.20011 translate read null
2025-12-22 Photonic Spiking Graph Neural Network for Energy-Efficient Structured Data Processing Wanting Yu et.al. 2512.19182 translate read null
2025-12-20 The size of 3I/ATLAS from non-gravitational acceleration John C. Forbes et.al. 2512.18341 translate read null
2025-12-20 Pyramidal Adaptive Cross-Gating for Multimodal Detection Zidong Gu et.al. 2512.18291 translate read null
2025-12-20 Building UI/UX Dataset for Dark Pattern Detection and YOLOv12x-based Real-Time Object Recognition Detection System Se-Young Jang et.al. 2512.18269 translate read null
2025-12-20 Spectral Discrepancy and Cross-modal Semantic Consistency Learning for Object Detection in Hyperspectral Image Xiao He et.al. 2512.18245 translate read null
2025-12-20 ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection Janghyun Baek et.al. 2512.18187 translate read null
2025-12-19 YolovN-CBi: A Lightweight and Efficient Architecture for Real-Time Detection of Small UAVs Ami Pandat et.al. 2512.18046 translate read null
2025-12-19 StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection Di Wu et.al. 2512.17620 translate read null
2025-12-19 Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection Sairam VCR et.al. 2512.17514 translate read null
2025-12-19 PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases Ripan Kumar Kundu et.al. 2512.17172 translate read null
2025-12-18 DenseBEV: Transforming BEV Grid Cells into 3D Objects Marius Dähling et.al. 2512.16818 translate read null
2025-12-18 FlowDet: Unifying Object Detection and Generative Transport Flows Enis Baty et.al. 2512.16771 translate read null
2025-12-18 YOLO11-4K: An Efficient Architecture for Real-Time Small Object Detection in 4K Panoramic Images Huma Hafeez et.al. 2512.16493 translate read null
2025-12-18 Autoencoder-based Denoising Defense against Adversarial Attacks on Object Detection Min Geun Song et.al. 2512.16123 translate read null
2025-12-18 Auto-Vocabulary 3D Object Detection Haomeng Zhang et.al. 2512.16077 translate read null
2025-12-17 From Words to Wavelengths: VLMs for Few-Shot Multispectral Object Detection Manuel Nkegoum et.al. 2512.15971 translate read null
2025-12-13 Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real Yan Yang et.al. 2512.15774 translate read null
2025-12-17 IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion Shashank Mishra et.al. 2512.15581 translate read null
2025-12-17 Evaluation of deep learning architectures for wildlife object detection: A comparative study of ResNet and Inception Malach Obisa Amonga et.al. 2512.15480 translate read null
2025-12-17 Vision-based module for accurately reading linear scales in a laboratory Parvesh Saini et.al. 2512.15327 translate read null
2025-12-17 EPSM: A Novel Metric to Evaluate the Safety of Environmental Perception in Autonomous Driving Jörg Gamerdinger et.al. 2512.15195 translate read null
2025-12-17 Criticality Metrics for Relevance Classification in Safety Evaluation of Object Detection in Automated Driving Jörg Gamerdinger et.al. 2512.15181 translate read null
2025-12-17 Beyond Proximity: A Keypoint-Trajectory Framework for Classifying Affiliative and Agonistic Social Networks in Dairy Cattle Sibi Parivendan et.al. 2512.14998 translate read null
2025-12-16 TUMTraf EMOT: Event-Based Multi-Object Tracking Dataset and Baseline for Traffic Scenarios Mengyu Li et.al. 2512.14595 translate read null
2025-12-16 4D-RaDiff: Latent Diffusion for 4D Radar Point Cloud Generation Jimmie Kwok et.al. 2512.14235 translate read null
2025-12-16 CIS-BA: Continuous Interaction Space Based Backdoor Attack for Object Detection in the Real-World Shuxin Zhao et.al. 2512.14158 translate read null
2025-12-16 Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries Emanuele Mezzi et.al. 2512.14102 translate read null
2025-12-16 Deep Learning Perspective of Scene Understanding in Autonomous Robots Afia Maham et.al. 2512.14020 translate read null
2025-12-16 Real-Time Service Subscription and Adaptive Offloading Control in Vehicular Edge Computing Chuanchao Gao et.al. 2512.14002 translate read null
2025-12-16 FocalComm: Hard Instance-Aware Multi-Agent Perception Dereje Shenkut et.al. 2512.13982 translate read null
2025-12-15 Route-DETR: Pairwise Query Routing in Transformers for Object Detection Ye Zhang et.al. 2512.13876 translate read null
2025-12-15 VajraV1 – The most accurate Real Time Object Detector of the YOLO family Naman Balbir Singh Makkar et.al. 2512.13834 translate read null
2025-12-15 Near-Field Perception for Safety Enhancement of Autonomous Mobile Robots in Manufacturing Environments Li-Wei Shih et.al. 2512.13561 translate read null
2025-12-15 On the Ability of Deep Learning to Detect Signals with Unknown Parameters Tom Anders et.al. 2512.13542 translate read null
2025-12-15 Computer vision training dataset generation for robotic environments using Gaussian splatting Patryk Niżeniec et.al. 2512.13411 translate read null
2025-12-15 Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather Zhijian He et.al. 2512.13107 translate read null
2025-12-14 Cross-Level Sensor Fusion with Object Lists via Transformer for 3D Object Detection Xiangzhong Liu et.al. 2512.12884 translate read null
2025-12-13 INDOOR-LiDAR: Bridging Simulation and Reality for Robot-Centric 360 degree Indoor LiDAR Perception – A Robot-Centric Hybrid Dataset Haichuan Li et.al. 2512.12377 translate read null
2025-12-13 WeDetect: Fast Open-Vocabulary Object Detection as Retrieval Shenghao Fu et.al. 2512.12309 translate read null
2025-12-13 Cognitive-YOLO: LLM-Driven Architecture Synthesis from First Principles of Data for Object Detection Jiahao Zhao et.al. 2512.12281 translate read null
2025-12-13 AI-Augmented Pollen Recognition in Optical and Holographic Microscopy for Veterinary Imaging Swarn S. Warshaneyan et.al. 2512.12101 translate read null
2025-12-12 TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder Qinghao Meng et.al. 2512.11926 translate read null
2025-12-12 Depth-Copy-Paste: Multimodal and Depth-Aware Compositing for Robust Face Detection Qiushi Guo et.al. 2512.11683 translate read null
2025-12-12 DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation Mohamed Abdelsamad et.al. 2512.11465 translate read null
2025-12-12 Assisted Refinement Network Based on Channel Information Interaction for Camouflaged and Salient Object Detection Kuan Wang et.al. 2512.11369 translate read null
2025-12-12 Reliable Detection of Minute Targets in High-Resolution Aerial Imagery across Temporal Shifts Mohammad Sadegh Gholizadeh et.al. 2512.11360 translate read null
2025-12-11 VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction Weitai Kang et.al. 2512.11099 translate read null
2025-12-11 Salient Object Detection in Complex Weather Conditions via Noise Indicators Quan Chen et.al. 2512.10592 translate read null
2025-12-11 Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method Ge Zhang et.al. 2512.10386 translate read null
2025-12-10 ABBSPO: Adaptive Bounding Box Scaling and Symmetric Prior based Orientation Prediction for Detecting Aerial Image Objects Woojin Lee et.al. 2512.10031 translate read null
2025-12-10 NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway Sander Riisøen Jyhne et.al. 2512.09913 translate read null
2025-12-10 Hands-on Evaluation of Visual Transformers for Object Recognition and Detection Dimitrios N. Vlachogiannis et.al. 2512.09579 translate read null
2025-12-10 MODA: The First Challenging Benchmark for Multispectral Object Detection in Aerial Images Shuaihao Han et.al. 2512.09489 translate read null
2025-12-10 A Hierarchical, Model-Based System for High-Performance Humanoid Soccer Quanyou Wang et.al. 2512.09431 translate read null
2025-12-10 Identifying Bias in Machine-generated Text Detection Kevin Stowe et.al. 2512.09292 translate read null
2025-12-10 ROI-Packing: Efficient Region-Based Compression for Machine Vision Md Eimran Hossain Eimon et.al. 2512.09258 translate read null
2025-12-09 Automated Pollen Recognition in Optical and Holographic Microscopy Images Swarn Singh Warshaneyan et.al. 2512.08589 translate read null
2025-12-09 SSCATeR: Sparse Scatter-Based Convolution Algorithm with Temporal Data Recycling for Real-Time 3D Object Detection in LiDAR Point Clouds Alexander Dow et.al. 2512.08557 translate read null
2025-12-09 Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection Haowen Zheng et.al. 2512.08247 translate read null
2025-12-09 SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection Ching-Hung Cheng et.al. 2512.08223 translate read null
2025-12-09 Metasurfaces Enable Active-Like Passive Radar Mingyi Li et.al. 2512.08208 translate read null
2025-12-08 An AI-Powered Autonomous Underwater System for Sea Exploration and Scientific Research Hamad Almazrouei et.al. 2512.07652 translate read null
2025-12-08 Towards Robust DeepFake Detection under Unstable Face Sequences: Adaptive Sparse Graph Embedding with Order-Free Representation and Explicit Laplacian Spectral Prior Chih-Chung Hsu et.al. 2512.07498 translate read null
2025-12-08 Enhancing Small Object Detection with YOLO: A Novel Framework for Improved Accuracy and Efficiency Mahila Moghadami et.al. 2512.07379 translate read null
2025-12-08 A graph generation pipeline for critical infrastructures based on heuristics, images and depth data Mike Diessner et.al. 2512.07269 translate read null
2025-12-08 DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning Nithin Sivakumaran et.al. 2512.07132 translate read null
2025-12-08 DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cross-Scene Small Object Detection Bo Gao et.al. 2512.07078 translate read null
2025-12-07 Large Language Models and Forensic Linguistics: Navigating Opportunities and Threats in the Age of Generative AI George Mikros et.al. 2512.06922 translate read null
2025-12-07 Spatial Retrieval Augmented Autonomous Driving Xiaosong Jia et.al. 2512.06865 translate read null
2025-12-07 CoT4Det: A Chain-of-Thought Framework for Perception-Oriented Vision-Language Tasks Yu Qi et.al. 2512.06663 translate read null
2025-12-07 TextMamba: Scene Text Detector with Mamba Qiyan Zhao et.al. 2512.06657 translate read null
2025-12-06 Neural expressiveness for beyond importance model compression Angelos-Christos Maroudis et.al. 2512.06440 translate read null
2025-12-06 Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework Xinhao Xiang et.al. 2512.06376 translate read null
2025-12-05 OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning Xusheng Guo et.al. 2512.05698 translate read null
2025-12-05 LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection Johannes Meier et.al. 2512.05663 translate read null
2025-12-05 An Integrated System for WEEE Sorting Employing X-ray Imaging, AI-based Object Detection and Segmentation, and Delta Robot Manipulation Panagiotis Giannikos et.al. 2512.05599 translate read null
2025-12-05 Concept-based Explainable Data Mining with VLM for 3D Detection Mai Tsujimoto et.al. 2512.05482 translate read null
2025-12-05 Moving object detection from multi-depth images with an attention-enhanced CNN Masato Shibukawa et.al. 2512.05415 translate read null
2025-12-05 YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications Yida Lin et.al. 2512.05412 translate read null
2025-12-04 GeoPE:A Unified Geometric Positional Embedding for Structured Tensors Yupu Yao et.al. 2512.04963 translate read null
2025-12-04 ZeBROD: Zero-Retraining Based Recognition and Object Detection Framework Priyanto Hidayatullah et.al. 2512.04888 translate read null
2025-12-04 DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance Yinghui Xing et.al. 2512.04511 translate read null
2025-12-04 Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection Xiangyi Gao et.al. 2512.04413 translate read null
2025-12-03 Real-time Cricket Sorting By Sex Juan Manuel Cantarero Angulo et.al. 2512.04311 translate read null
2025-12-03 Fast & Efficient Normalizing Flows and Applications of Image Generative Models Sandeep Nagar et.al. 2512.04039 translate read null
2025-12-03 MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms Jiahao Zhang et.al. 2512.03640 translate read null
2025-12-03 Real-Time Control and Automation Framework for Acousto-Holographic Microscopy Hasan Berkay Abdioğlu et.al. 2512.03539 translate read null
2025-12-03 YOLOA: Real-Time Affordance Detection via LLM Adapter Yuqi Ji et.al. 2512.03418 translate read null
2025-12-02 GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection Md Sohag Mia et.al. 2512.02991 translate read null
2025-12-02 BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection Guowen Zhang et.al. 2512.02972 translate read null
2025-12-02 MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding Fan Yang et.al. 2512.02906 translate read null
2025-12-02 ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection Omid Reza Heidari et.al. 2512.02696 translate read null
2025-12-02 SAM2Grasp: Resolve Multi-modal Grasping via Prompt-conditioned Temporal Action Prediction Shengkai Wu et.al. 2512.02609 translate read null
2025-12-02 GeoDiT: A Diffusion-based Vision-Language Model for Geospatial Understanding Jiaqi Liu et.al. 2512.02505 translate read null
2025-12-02 Temporal Dynamics Enhancer for Directly Trained Spiking Object Detectors Fan Luo et.al. 2512.02447 translate read null
2025-12-01 Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory Chenyi Wang et.al. 2512.01934 translate read null
2025-12-01 SAM3-UNet: Simplified Adaptation of Segment Anything Model 3 Xinyu Xiong et.al. 2512.01789 translate read null
2025-12-01 Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery Zhicheng Zhao et.al. 2512.01665 translate read null
2025-12-01 ViT $^3$ : Unlocking Test-Time Training in Vision Dongchen Han et.al. 2512.01643 translate read null
2025-12-01 OpenBox: Annotate Any Bounding Boxes in 3D In-Jae Lee et.al. 2512.01352 translate read null
2025-12-01 FOD-S2R: A FOD Dataset for Sim2Real Transfer Learning based Object Detection Ashish Vashist et.al. 2512.01315 translate read null
2025-12-01 Supervised Contrastive Machine Unlearning of Background Bias in Sonar Image Classification with Fine-Grained Explainable AI Kamal Basha S et.al. 2512.01291 translate read null
2025-12-01 VSRD++: Autolabeling for 3D Object Detection via Instance-Aware Volumetric Silhouette Rendering Zihua Liu et.al. 2512.01178 translate read null
2025-12-01 Real-Time On-the-Go Annotation Framework Using YOLO for Automated Dataset Generation Mohamed Abdallah Salem et.al. 2512.01165 translate read null

(<a href=../Object_Detection.md>back to Object Detection</a>)