Object Detection - 2025-11

Publish Date Title Authors PDF Translate Read Code
2025-11-27 Semi-Supervised Contrastive Learning with Orthonormal Prototypes Huanran Li et.al. 2512.07880 translate read null
2025-11-30 Autonomous Grasping On Quadruped Robot With Task Level Interaction Muhtadin et.al. 2512.01052 translate read null
2025-11-30 Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning Haozhen Gong et.al. 2512.00818 translate read null
2025-11-30 DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering Toshiki Katsube et.al. 2512.00773 translate read null
2025-11-29 MM-DETR: An Efficient Multimodal Detection Transformer with Mamba-Driven Dual-Granularity Fusion and Frequency-Aware Modality Adapters Jianhong Han et.al. 2512.00363 translate read null
2025-11-28 Hybrid Synthetic Data Generation with Domain Randomization Enables Zero-Shot Vision-Based Part Inspection Under Extreme Class Imbalance Ruo-Syuan Mei et.al. 2512.00125 translate read null
2025-11-25 Diffusion-Based Synthetic Brightfield Microscopy Images for Enhanced Single Cell Detection Mario de Jesus da Graca et.al. 2512.00078 translate read null
2025-11-24 ProvRain: Rain-Adaptive Denoising and Vehicle Detection via MobileNet-UNet and Faster R-CNN Aswinkumar Varathakumaran et.al. 2512.00073 translate read null
2025-11-23 PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving Abdolazim Rezaei et.al. 2512.00060 translate read null
2025-11-28 Object-Centric Data Synthesis for Category-level Object Detection Vikhyat Agarwal et.al. 2511.23450 translate read null
2025-11-28 Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach Haruki Sakajo et.al. 2511.23311 translate read null
2025-11-28 Synthetic Industrial Object Detection: GenAI vs. Feature-Based Methods Jose Moises Araya-Martinez et.al. 2511.23241 translate read null
2025-11-28 Zero-Shot Multi-Criteria Visual Quality Inspection for Semi-Controlled Industrial Environments via Real-Time 3D Digital Twin Simulation Jose Moises Araya-Martinez et.al. 2511.23214 translate read null
2025-11-28 Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text Understanding Anik De et.al. 2511.23071 translate read null
2025-11-28 Barcode and QR Code Object Detection: An Experimental Study on YOLOv8 Models Kushagra Pandya et.al. 2511.22937 translate read null
2025-11-28 DM $^3$ T: Harmonizing Modalities via Diffusion for Multi-Object Tracking Weiran Li et.al. 2511.22896 translate read null
2025-11-27 DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA Ahmad Mohammadshirazi et.al. 2511.22521 translate read null
2025-11-27 Small Object Detection for Birds with Swin Transformer Da Huo et.al. 2511.22310 translate read null
2025-11-27 Simplex-Optimized Hybrid Ensemble for Large Language Model Text Detection Under Generative Distribution Drif Sepyan Purnama Kristanto et.al. 2511.22153 translate read null
2025-11-27 Bistatic Passive Tracking via CSI Power Zhongqin Wang et.al. 2511.22144 translate read null
2025-11-27 SemOD: Semantic Enabled Object Detection Network under Various Weather Conditions Aiyinsi Zuo et.al. 2511.22142 translate read null
2025-11-27 PAGen: Phase-guided Amplitude Generation for Domain-adaptive Object Detection Shuchen Du et.al. 2511.22029 translate read null
2025-11-22 A Lightweight Approach to Detection of AI-Generated Texts Using Stylometric Features Sergey K. Aityan et.al. 2511.21744 translate read null
2025-11-26 Continual Error Correction on Low-Resource Devices Kirill Paramonov et.al. 2511.21652 translate read null
2025-11-26 CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation Shizhe Sun et.al. 2511.21503 translate read null
2025-11-26 Co-Training Vision Language Models for Remote Sensing Multi-task Learning Qingyun Li et.al. 2511.21272 translate read null
2025-11-26 OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection Chujie Wang et.al. 2511.21064 translate read null
2025-11-26 AerialMind: Towards Referring Multi-Object Tracking in UAV Scenarios Chenglizhao Chen et.al. 2511.21053 translate read null
2025-11-26 Wavefront-Constrained Passive Obscured Object Detection Zhiwen Zheng et.al. 2511.20991 translate read null
2025-11-26 RefOnce: Distilling References into a Prototype Memory for Referring Camouflaged Object Detection Yu-Huan Wu et.al. 2511.20989 translate read null
2025-11-25 Video Object Recognition in Mobile Edge Networks: Local Tracking or Edge Detection? Kun Guo et.al. 2511.20716 translate read null
2025-11-25 MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities Tooba Tehreem Sheikh et.al. 2511.20650 translate read null
2025-11-25 Zoo3D: Zero-Shot 3D Object Detection at Scene Level Andrey Lemeshko et.al. 2511.20253 translate read null
2025-11-25 Intelligent Image Search Algorithms Fusing Visual Large Models Kehan Wang et.al. 2511.19920 translate read null
2025-11-24 Maritime Small Object Detection from UAVs using Deep Learning with Altitude-Aware Dynamic Tiling Sakib Ahmed et.al. 2511.19728 translate read null
2025-11-24 Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration Remi Petitpierre et.al. 2511.19538 translate read null
2025-11-24 SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation Tianrun Chen et.al. 2511.19425 translate read null
2025-11-24 IDEAL-M3D: Instance Diversity-Enriched Active Learning for Monocular 3D Detection Johannes Meier et.al. 2511.19301 translate read null
2025-11-24 SpectraNet: FFT-assisted Deep Learning Classifier for Deepfake Face Detection Nithira Jayarathne et.al. 2511.19187 translate read null
2025-11-24 MambaRefine-YOLO: A Dual-Modality Small Object Detector for UAV Imagery Shuyu Cao et.al. 2511.19134 translate read null
2025-11-24 3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion Minchong Chen et.al. 2511.19117 translate read null
2025-11-24 LLMAID: Identifying AI Capabilities in Android Apps with LLMs Pei Liu et.al. 2511.19059 translate read null
2025-11-24 LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space Hai Wu et.al. 2511.19057 translate read null
2025-11-24 Enhancing Fast Radio Transient Detection with Mask R-CNN Image Segmentation Sergio Belmonte Diaz et.al. 2511.19014 translate read null
2025-11-24 Peregrine: One-Shot Fine-Tuning for FHE Inference of General Deep CNNs Huaming Ling et.al. 2511.18976 translate read null
2025-11-24 DualGazeNet: A Biologically Inspired Dual-Gaze Query Network for Salient Object Detection Yu Zhang et.al. 2511.18865 translate read null
2025-11-24 DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video Jiawei Hou et.al. 2511.18814 translate read null
2025-11-24 StereoDETR: Stereo-based Transformer for 3D Object Detection Shiyi Mu et.al. 2511.18788 translate read null
2025-11-24 DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving Hongbin Lin et.al. 2511.18713 translate read null
2025-11-24 Dendritic Convolution for Noise Image Recognition Jiarui Xue et.al. 2511.18699 translate read null
2025-11-24 Multimodal Real-Time Anomaly Detection and Industrial Applications Aman Verma et.al. 2511.18698 translate read null
2025-11-24 Exploring Surround-View Fisheye Camera 3D Object Detection Changcai Li et.al. 2511.18695 translate read null
2025-11-23 UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization Siyi Li et.al. 2511.18254 translate read null
2025-11-22 VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object Detection Jianhang Yao et.al. 2511.18075 translate read null
2025-11-22 Diverse Instance Generation via Diffusion Models for Enhanced Few-Shot Object Detection in Remote Sensing Images Yanxing Liu et.al. 2511.18031 translate read null
2025-11-22 State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection Jiaying Zhou et.al. 2511.18012 translate read null
2025-11-21 REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion Ryoma Yataka et.al. 2511.17806 translate read null
2025-11-21 PUCP-Metrix: An Open-source and Comprehensive Toolkit for Linguistic Analysis of Spanish Texts Javier Alonso Villegas Luis et.al. 2511.17402 translate read null
2025-11-21 A lightweight detector for real-time detection of remote sensing images Qianyi Wang et.al. 2511.17147 translate read null
2025-11-21 OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding Teng Fu et.al. 2511.17053 translate read null
2025-11-20 Integrating Deep Learning and Spatial Statistics in Marine Ecosystem Monitoring Gian Mario Sangiovanni et.al. 2511.16447 translate read null
2025-11-20 StreetView-Waste: A Multi-Task Dataset for Urban Waste Management Diogo J. Paulo et.al. 2511.16440 translate read null
2025-11-04 In-Context Adaptation of VLMs for Few-Shot Cell Detection in Optical Microscopy Shreyan Ganguly et.al. 2511.05565 translate read null
2025-11-03 Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation Jiayuan Wang et.al. 2511.05557 translate read null
2025-11-06 NovisVQ: A Streaming Convolutional Neural Network for No-Reference Opinion-Unaware Frame Quality Assessment Kylie Cancilla et.al. 2511.04628 translate read null
2025-11-06 Evaluating the Impact of Weather-Induced Sensor Occlusion on BEVFusion for 3D Object Detection Sanjay Kumar et.al. 2511.04347 translate read null
2025-11-06 Comparative Study of CNN Architectures for Binary Classification of Horses and Motorcycles in the VOC 2008 Dataset Muhammad Annas Shaikh et.al. 2511.04344 translate read null
2025-11-06 Deep learning-based object detection of offshore platforms on Sentinel-1 Imagery and the impact of synthetic training data Robin Spanier et.al. 2511.04304 translate read null
2025-11-06 DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platforms Shengyu Tang et.al. 2511.04128 translate read null
2025-11-05 Desert Waste Detection and Classification Using Data-Based and Model-Based Enhanced YOLOv12 DL Model Abdulmumin Sa’ad et.al. 2511.03888 translate read null
2025-11-05 ISC-Perception: A Hybrid Computer Vision Dataset for Object Detection in Novel Steel Assembly Miftahur Rahman et.al. 2511.03098 translate read null
2025-11-05 A Computer Vision Based Proxy for Political Polarization in Religious Countries: A Turkiye Case Study Liangze Ke et.al. 2511.03088 translate read null
2025-11-04 Diffusion Models are Robust Pretrainers Mika Yagoda et.al. 2511.02793 translate read null
2025-11-04 DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding Zixuan Liu et.al. 2511.02495 translate read null
2025-11-04 Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization Tao Liu et.al. 2511.02489 translate read null
2025-11-04 Facial Expression Recognition System Using DNN Accelerator with Multi-threading on FPGA Takuto Ando et.al. 2511.02408 translate read null
2025-11-04 3D Point Cloud Object Detection on Edge Devices for Split Computing Taisuke Noguchi et.al. 2511.02293 translate read null
2025-11-04 Autobiasing Event Cameras for Flickering Mitigation Mehdi Sefidgar Dilmaghani et.al. 2511.02180 translate read null
2025-11-03 UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs Zhe Liu et.al. 2511.01768 translate read null
2025-11-03 CGF-DETR: Cross-Gated Fusion DETR for Enhanced Pneumonia Detection in Chest X-rays Yefeng Wu et.al. 2511.01730 translate read null
2025-11-03 Contrast-Guided Cross-Modal Distillation for Thermal Object Detection SiWoo Kim et.al. 2511.01435 translate read null
2025-11-03 Eyes on Target: Gaze-Aware Object Detection in Egocentric Video Vishakha Lall et.al. 2511.01237 translate read null
2025-11-03 DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection Guoxin Ma et.al. 2511.01192 translate read null
2025-11-02 Advancing Machine-Generated Text Detection from an Easy to Hard Supervision Perspective Chenwang Wu et.al. 2511.00988 translate read null
2025-11-02 A Hybrid YOLOv5-SSD IoT-Based Animal Detection System for Durian Plantation Protection Anis Suttan Shahrir et.al. 2511.00777 translate read null

(<a href=../Object_Detection.md>back to Object Detection</a>)