Object Detection - 2025-11 | Paper Arxiv Daily

Object Detection - 2025-11

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-11-27	Semi-Supervised Contrastive Learning with Orthonormal Prototypes	Huanran Li et.al.	2512.07880	translate	read	null
2025-11-30	Autonomous Grasping On Quadruped Robot With Task Level Interaction	Muhtadin et.al.	2512.01052	translate	read	null
2025-11-30	Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning	Haozhen Gong et.al.	2512.00818	translate	read	null
2025-11-30	DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering	Toshiki Katsube et.al.	2512.00773	translate	read	null
2025-11-29	MM-DETR: An Efficient Multimodal Detection Transformer with Mamba-Driven Dual-Granularity Fusion and Frequency-Aware Modality Adapters	Jianhong Han et.al.	2512.00363	translate	read	null
2025-11-28	Hybrid Synthetic Data Generation with Domain Randomization Enables Zero-Shot Vision-Based Part Inspection Under Extreme Class Imbalance	Ruo-Syuan Mei et.al.	2512.00125	translate	read	null
2025-11-25	Diffusion-Based Synthetic Brightfield Microscopy Images for Enhanced Single Cell Detection	Mario de Jesus da Graca et.al.	2512.00078	translate	read	null
2025-11-24	ProvRain: Rain-Adaptive Denoising and Vehicle Detection via MobileNet-UNet and Faster R-CNN	Aswinkumar Varathakumaran et.al.	2512.00073	translate	read	null
2025-11-23	PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving	Abdolazim Rezaei et.al.	2512.00060	translate	read	null
2025-11-28	Object-Centric Data Synthesis for Category-level Object Detection	Vikhyat Agarwal et.al.	2511.23450	translate	read	null
2025-11-28	Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach	Haruki Sakajo et.al.	2511.23311	translate	read	null
2025-11-28	Synthetic Industrial Object Detection: GenAI vs. Feature-Based Methods	Jose Moises Araya-Martinez et.al.	2511.23241	translate	read	null
2025-11-28	Zero-Shot Multi-Criteria Visual Quality Inspection for Semi-Controlled Industrial Environments via Real-Time 3D Digital Twin Simulation	Jose Moises Araya-Martinez et.al.	2511.23214	translate	read	null
2025-11-28	Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text Understanding	Anik De et.al.	2511.23071	translate	read	null
2025-11-28	Barcode and QR Code Object Detection: An Experimental Study on YOLOv8 Models	Kushagra Pandya et.al.	2511.22937	translate	read	null
2025-11-28	DM $^3$ T: Harmonizing Modalities via Diffusion for Multi-Object Tracking	Weiran Li et.al.	2511.22896	translate	read	null
2025-11-27	DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA	Ahmad Mohammadshirazi et.al.	2511.22521	translate	read	null
2025-11-27	Small Object Detection for Birds with Swin Transformer	Da Huo et.al.	2511.22310	translate	read	null
2025-11-27	Simplex-Optimized Hybrid Ensemble for Large Language Model Text Detection Under Generative Distribution Drif	Sepyan Purnama Kristanto et.al.	2511.22153	translate	read	null
2025-11-27	Bistatic Passive Tracking via CSI Power	Zhongqin Wang et.al.	2511.22144	translate	read	null
2025-11-27	SemOD: Semantic Enabled Object Detection Network under Various Weather Conditions	Aiyinsi Zuo et.al.	2511.22142	translate	read	null
2025-11-27	PAGen: Phase-guided Amplitude Generation for Domain-adaptive Object Detection	Shuchen Du et.al.	2511.22029	translate	read	null
2025-11-22	A Lightweight Approach to Detection of AI-Generated Texts Using Stylometric Features	Sergey K. Aityan et.al.	2511.21744	translate	read	null
2025-11-26	Continual Error Correction on Low-Resource Devices	Kirill Paramonov et.al.	2511.21652	translate	read	null
2025-11-26	CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation	Shizhe Sun et.al.	2511.21503	translate	read	null
2025-11-26	Co-Training Vision Language Models for Remote Sensing Multi-task Learning	Qingyun Li et.al.	2511.21272	translate	read	null
2025-11-26	OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection	Chujie Wang et.al.	2511.21064	translate	read	null
2025-11-26	AerialMind: Towards Referring Multi-Object Tracking in UAV Scenarios	Chenglizhao Chen et.al.	2511.21053	translate	read	null
2025-11-26	Wavefront-Constrained Passive Obscured Object Detection	Zhiwen Zheng et.al.	2511.20991	translate	read	null
2025-11-26	RefOnce: Distilling References into a Prototype Memory for Referring Camouflaged Object Detection	Yu-Huan Wu et.al.	2511.20989	translate	read	null
2025-11-25	Video Object Recognition in Mobile Edge Networks: Local Tracking or Edge Detection?	Kun Guo et.al.	2511.20716	translate	read	null
2025-11-25	MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities	Tooba Tehreem Sheikh et.al.	2511.20650	translate	read	null
2025-11-25	Zoo3D: Zero-Shot 3D Object Detection at Scene Level	Andrey Lemeshko et.al.	2511.20253	translate	read	null
2025-11-25	Intelligent Image Search Algorithms Fusing Visual Large Models	Kehan Wang et.al.	2511.19920	translate	read	null
2025-11-24	Maritime Small Object Detection from UAVs using Deep Learning with Altitude-Aware Dynamic Tiling	Sakib Ahmed et.al.	2511.19728	translate	read	null
2025-11-24	Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration	Remi Petitpierre et.al.	2511.19538	translate	read	null
2025-11-24	SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation	Tianrun Chen et.al.	2511.19425	translate	read	null
2025-11-24	IDEAL-M3D: Instance Diversity-Enriched Active Learning for Monocular 3D Detection	Johannes Meier et.al.	2511.19301	translate	read	null
2025-11-24	SpectraNet: FFT-assisted Deep Learning Classifier for Deepfake Face Detection	Nithira Jayarathne et.al.	2511.19187	translate	read	null
2025-11-24	MambaRefine-YOLO: A Dual-Modality Small Object Detector for UAV Imagery	Shuyu Cao et.al.	2511.19134	translate	read	null
2025-11-24	3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion	Minchong Chen et.al.	2511.19117	translate	read	null
2025-11-24	LLMAID: Identifying AI Capabilities in Android Apps with LLMs	Pei Liu et.al.	2511.19059	translate	read	null
2025-11-24	LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space	Hai Wu et.al.	2511.19057	translate	read	null
2025-11-24	Enhancing Fast Radio Transient Detection with Mask R-CNN Image Segmentation	Sergio Belmonte Diaz et.al.	2511.19014	translate	read	null
2025-11-24	Peregrine: One-Shot Fine-Tuning for FHE Inference of General Deep CNNs	Huaming Ling et.al.	2511.18976	translate	read	null
2025-11-24	DualGazeNet: A Biologically Inspired Dual-Gaze Query Network for Salient Object Detection	Yu Zhang et.al.	2511.18865	translate	read	null
2025-11-24	DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video	Jiawei Hou et.al.	2511.18814	translate	read	null
2025-11-24	StereoDETR: Stereo-based Transformer for 3D Object Detection	Shiyi Mu et.al.	2511.18788	translate	read	null
2025-11-24	DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving	Hongbin Lin et.al.	2511.18713	translate	read	null
2025-11-24	Dendritic Convolution for Noise Image Recognition	Jiarui Xue et.al.	2511.18699	translate	read	null
2025-11-24	Multimodal Real-Time Anomaly Detection and Industrial Applications	Aman Verma et.al.	2511.18698	translate	read	null
2025-11-24	Exploring Surround-View Fisheye Camera 3D Object Detection	Changcai Li et.al.	2511.18695	translate	read	null
2025-11-23	UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization	Siyi Li et.al.	2511.18254	translate	read	null
2025-11-22	VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object Detection	Jianhang Yao et.al.	2511.18075	translate	read	null
2025-11-22	Diverse Instance Generation via Diffusion Models for Enhanced Few-Shot Object Detection in Remote Sensing Images	Yanxing Liu et.al.	2511.18031	translate	read	null
2025-11-22	State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection	Jiaying Zhou et.al.	2511.18012	translate	read	null
2025-11-21	REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion	Ryoma Yataka et.al.	2511.17806	translate	read	null
2025-11-21	PUCP-Metrix: An Open-source and Comprehensive Toolkit for Linguistic Analysis of Spanish Texts	Javier Alonso Villegas Luis et.al.	2511.17402	translate	read	null
2025-11-21	A lightweight detector for real-time detection of remote sensing images	Qianyi Wang et.al.	2511.17147	translate	read	null
2025-11-21	OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding	Teng Fu et.al.	2511.17053	translate	read	null
2025-11-20	Integrating Deep Learning and Spatial Statistics in Marine Ecosystem Monitoring	Gian Mario Sangiovanni et.al.	2511.16447	translate	read	null
2025-11-20	StreetView-Waste: A Multi-Task Dataset for Urban Waste Management	Diogo J. Paulo et.al.	2511.16440	translate	read	null
2025-11-04	In-Context Adaptation of VLMs for Few-Shot Cell Detection in Optical Microscopy	Shreyan Ganguly et.al.	2511.05565	translate	read	null
2025-11-03	Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation	Jiayuan Wang et.al.	2511.05557	translate	read	null
2025-11-06	NovisVQ: A Streaming Convolutional Neural Network for No-Reference Opinion-Unaware Frame Quality Assessment	Kylie Cancilla et.al.	2511.04628	translate	read	null
2025-11-06	Evaluating the Impact of Weather-Induced Sensor Occlusion on BEVFusion for 3D Object Detection	Sanjay Kumar et.al.	2511.04347	translate	read	null
2025-11-06	Comparative Study of CNN Architectures for Binary Classification of Horses and Motorcycles in the VOC 2008 Dataset	Muhammad Annas Shaikh et.al.	2511.04344	translate	read	null
2025-11-06	Deep learning-based object detection of offshore platforms on Sentinel-1 Imagery and the impact of synthetic training data	Robin Spanier et.al.	2511.04304	translate	read	null
2025-11-06	DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platforms	Shengyu Tang et.al.	2511.04128	translate	read	null
2025-11-05	Desert Waste Detection and Classification Using Data-Based and Model-Based Enhanced YOLOv12 DL Model	Abdulmumin Sa’ad et.al.	2511.03888	translate	read	null
2025-11-05	ISC-Perception: A Hybrid Computer Vision Dataset for Object Detection in Novel Steel Assembly	Miftahur Rahman et.al.	2511.03098	translate	read	null
2025-11-05	A Computer Vision Based Proxy for Political Polarization in Religious Countries: A Turkiye Case Study	Liangze Ke et.al.	2511.03088	translate	read	null
2025-11-04	Diffusion Models are Robust Pretrainers	Mika Yagoda et.al.	2511.02793	translate	read	null
2025-11-04	DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding	Zixuan Liu et.al.	2511.02495	translate	read	null
2025-11-04	Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization	Tao Liu et.al.	2511.02489	translate	read	null
2025-11-04	Facial Expression Recognition System Using DNN Accelerator with Multi-threading on FPGA	Takuto Ando et.al.	2511.02408	translate	read	null
2025-11-04	3D Point Cloud Object Detection on Edge Devices for Split Computing	Taisuke Noguchi et.al.	2511.02293	translate	read	null
2025-11-04	Autobiasing Event Cameras for Flickering Mitigation	Mehdi Sefidgar Dilmaghani et.al.	2511.02180	translate	read	null
2025-11-03	UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs	Zhe Liu et.al.	2511.01768	translate	read	null
2025-11-03	CGF-DETR: Cross-Gated Fusion DETR for Enhanced Pneumonia Detection in Chest X-rays	Yefeng Wu et.al.	2511.01730	translate	read	null
2025-11-03	Contrast-Guided Cross-Modal Distillation for Thermal Object Detection	SiWoo Kim et.al.	2511.01435	translate	read	null
2025-11-03	Eyes on Target: Gaze-Aware Object Detection in Egocentric Video	Vishakha Lall et.al.	2511.01237	translate	read	null
2025-11-03	DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection	Guoxin Ma et.al.	2511.01192	translate	read	null
2025-11-02	Advancing Machine-Generated Text Detection from an Easy to Hard Supervision Perspective	Chenwang Wu et.al.	2511.00988	translate	read	null
2025-11-02	A Hybrid YOLOv5-SSD IoT-Based Animal Detection System for Durian Plantation Protection	Anis Suttan Shahrir et.al.	2511.00777	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)