Object Detection - 2025-12 | Paper Arxiv Daily

Object Detection - 2025-12

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-12-31	Compressed Map Priors for 3D Perception	Brady Zhou et.al.	2601.00139	translate	read	null
2025-12-31	Automated electrostatic characterization of quantum dot devices in single- and bilayer heterostructures	Merritt P. R. Losert et.al.	2601.00067	translate	read	null
2025-12-31	Semi-Supervised Diversity-Aware Domain Adaptation for 3D Object detection	Bartłomiej Olber et.al.	2512.24922	translate	read	null
2025-12-31	Semi-Automated Data Annotation in Multisensor Datasets for Autonomous Vehicle Testing	Andrii Gamalii et.al.	2512.24896	translate	read	null
2025-12-31	FireRescue: A UAV-Based Dataset and Enhanced YOLO Model for Object Detection in Fire Rescue Scenes	Qingyu Xu et.al.	2512.24622	translate	read	null
2025-12-30	AI-Driven Evaluation of Surgical Skill via Action Recognition	Yan Meng et.al.	2512.24411	translate	read	null
2025-12-30	Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems	Song Wang et.al.	2512.24385	translate	read	null
2025-12-30	Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images	Jingzhou Chen et.al.	2512.24074	translate	read	null
2025-12-29	Automated river gauge plate reading using a hybrid object detection and generative AI framework in the Limpopo River Basin	Kayathri Vigneswaran et.al.	2512.23454	translate	read	null
2025-12-29	YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection	Xu Lin et.al.	2512.23273	translate	read	null
2025-12-29	LIMO: Low-Power In-Memory-Annealer and Matrix-Multiplication Primitive for Edge Computing	Amod Holla et.al.	2512.23212	translate	read	null
2025-12-29	Exploring Syn-to-Real Domain Adaptation for Military Target Detection	Jongoh Jeong et.al.	2512.23208	translate	read	null
2025-12-29	GVSynergy-Det: Synergistic Gaussian-Voxel Representations for Multi-View 3D Object Detection	Yi Zhang et.al.	2512.23176	translate	read	null
2025-12-29	GeoTeacher: Geometry-Guided Semi-Supervised 3D Object Detection	Jingyu Li et.al.	2512.23147	translate	read	null
2025-12-28	RealCamo: Boosting Real Camouflage Synthesis with Layout Controls and Textual-Visual Guidance	Chunyuan Chen et.al.	2512.22974	translate	read	null
2025-12-28	YOLO-IOD: Towards Real Time Incremental Object Detection	Shizhou Zhang et.al.	2512.22973	translate	read	null
2025-12-28	Wavelet-based Multi-View Fusion of 4D Radar Tensor and Camera for Robust 3D Object Detection	Runwei Guan et.al.	2512.22972	translate	read	null
2025-12-28	Evaluating the Performance of Open-Vocabulary Object Detection in Low-quality Image	Po-Chih Wu et.al.	2512.22801	translate	read	null
2025-12-27	SCAFusion: A Multimodal 3D Detection Framework for Small Object Detection in Lunar Surface Exploration	Xin Chen et.al.	2512.22503	translate	read	null
2025-12-27	Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection	Zihan Liu et.al.	2512.22483	translate	read	null
2025-12-27	Comparing Object Detection Models for Electrical Substation Component Mapping	Haley Mody et.al.	2512.22454	translate	read	null
2025-12-27	SonoVision: A Computer Vision Approach for Helping Visually Challenged Individuals Locate Objects with the Help of Sound Cues	Md Abu Obaida Zishan et.al.	2512.22449	translate	read	null
2025-12-27	Towards Robust Optical-SAR Object Detection under Missing Modalities: A Dynamic Quality-Aware Fusion Framework	Zhicheng Zhao et.al.	2512.22447	translate	read	null
2025-12-26	DeFloMat: Detection with Flow Matching for Stable and Efficient Generative Object Localization	Hansang Lee et.al.	2512.22406	translate	read	null
2025-12-23	Failure Analysis of Safety Controllers in Autonomous Vehicles Under Object-Based LiDAR Attacks	Daniyal Ganiuly et.al.	2512.22244	translate	read	null
2025-12-26	Breaking Alignment Barriers: TPS-Driven Semantic Correlation Learning for Alignment-Free RGB-T Salient Object Detection	Lupiao Hu et.al.	2512.21856	translate	read	null
2025-12-25	Detecting AI-Generated Paraphrases in Bengali: A Comparative Study of Zero-Shot and Fine-Tuned Transformers	Md. Rakibul Islam et.al.	2512.21709	translate	read	null
2025-12-25	Comparative Analysis of Deep Learning Models for Perception in Autonomous Vehicles	Jalal Khan et.al.	2512.21673	translate	read	null
2025-12-24	ORCA: Object Recognition and Comprehension for Archiving Marine Species	Yuk-Kwan Wong et.al.	2512.21150	translate	read	null
2025-12-24	Self-supervised Multiplex Consensus Mamba for General Image Fusion	Yingying Wang et.al.	2512.20921	translate	read	null
2025-12-23	Real-World Adversarial Attacks on RF-Based Drone Detectors	Omer Gazit et.al.	2512.20712	translate	read	null
2025-12-23	Bridging Modalities and Transferring Knowledge: Enhanced Multimodal Understanding and Recognition	Gorjan Radevski et.al.	2512.20501	translate	read	null
2025-12-23	${D}^{3}${ETOR}: ${D}$ebate-Enhanced Pseudo Labeling and Frequency-Aware Progressive ${D}$ebiasing for Weakly-Supervised Camouflaged Object ${D}$ etection with Scribble Annotations	Jiawei Ge et.al.	2512.20260	translate	read	null
2025-12-23	LiteFusion: Taming 3D Object Detectors from Vision-Based to Multi-Modal with Minimal Adaptation	Xiangxuan Ren et.al.	2512.20217	translate	read	null
2025-12-23	Gaussian Process Assisted Meta-learning for Image Classification and Object Detection Models	Anna R. Flowers et.al.	2512.20021	translate	read	null
2025-12-23	PaveSync: A Unified and Comprehensive Dataset for Pavement Distress Analysis and Classification	Blessing Agyei Kyem et.al.	2512.20011	translate	read	null
2025-12-22	Photonic Spiking Graph Neural Network for Energy-Efficient Structured Data Processing	Wanting Yu et.al.	2512.19182	translate	read	null
2025-12-20	The size of 3I/ATLAS from non-gravitational acceleration	John C. Forbes et.al.	2512.18341	translate	read	null
2025-12-20	Pyramidal Adaptive Cross-Gating for Multimodal Detection	Zidong Gu et.al.	2512.18291	translate	read	null
2025-12-20	Building UI/UX Dataset for Dark Pattern Detection and YOLOv12x-based Real-Time Object Recognition Detection System	Se-Young Jang et.al.	2512.18269	translate	read	null
2025-12-20	Spectral Discrepancy and Cross-modal Semantic Consistency Learning for Object Detection in Hyperspectral Image	Xiao He et.al.	2512.18245	translate	read	null
2025-12-20	ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection	Janghyun Baek et.al.	2512.18187	translate	read	null
2025-12-19	YolovN-CBi: A Lightweight and Efficient Architecture for Real-Time Detection of Small UAVs	Ami Pandat et.al.	2512.18046	translate	read	null
2025-12-19	StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection	Di Wu et.al.	2512.17620	translate	read	null
2025-12-19	Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection	Sairam VCR et.al.	2512.17514	translate	read	null
2025-12-19	PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases	Ripan Kumar Kundu et.al.	2512.17172	translate	read	null
2025-12-18	DenseBEV: Transforming BEV Grid Cells into 3D Objects	Marius Dähling et.al.	2512.16818	translate	read	null
2025-12-18	FlowDet: Unifying Object Detection and Generative Transport Flows	Enis Baty et.al.	2512.16771	translate	read	null
2025-12-18	YOLO11-4K: An Efficient Architecture for Real-Time Small Object Detection in 4K Panoramic Images	Huma Hafeez et.al.	2512.16493	translate	read	null
2025-12-18	Autoencoder-based Denoising Defense against Adversarial Attacks on Object Detection	Min Geun Song et.al.	2512.16123	translate	read	null
2025-12-18	Auto-Vocabulary 3D Object Detection	Haomeng Zhang et.al.	2512.16077	translate	read	null
2025-12-17	From Words to Wavelengths: VLMs for Few-Shot Multispectral Object Detection	Manuel Nkegoum et.al.	2512.15971	translate	read	null
2025-12-13	Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real	Yan Yang et.al.	2512.15774	translate	read	null
2025-12-17	IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion	Shashank Mishra et.al.	2512.15581	translate	read	null
2025-12-17	Evaluation of deep learning architectures for wildlife object detection: A comparative study of ResNet and Inception	Malach Obisa Amonga et.al.	2512.15480	translate	read	null
2025-12-17	Vision-based module for accurately reading linear scales in a laboratory	Parvesh Saini et.al.	2512.15327	translate	read	null
2025-12-17	EPSM: A Novel Metric to Evaluate the Safety of Environmental Perception in Autonomous Driving	Jörg Gamerdinger et.al.	2512.15195	translate	read	null
2025-12-17	Criticality Metrics for Relevance Classification in Safety Evaluation of Object Detection in Automated Driving	Jörg Gamerdinger et.al.	2512.15181	translate	read	null
2025-12-17	Beyond Proximity: A Keypoint-Trajectory Framework for Classifying Affiliative and Agonistic Social Networks in Dairy Cattle	Sibi Parivendan et.al.	2512.14998	translate	read	null
2025-12-16	TUMTraf EMOT: Event-Based Multi-Object Tracking Dataset and Baseline for Traffic Scenarios	Mengyu Li et.al.	2512.14595	translate	read	null
2025-12-16	4D-RaDiff: Latent Diffusion for 4D Radar Point Cloud Generation	Jimmie Kwok et.al.	2512.14235	translate	read	null
2025-12-16	CIS-BA: Continuous Interaction Space Based Backdoor Attack for Object Detection in the Real-World	Shuxin Zhao et.al.	2512.14158	translate	read	null
2025-12-16	Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries	Emanuele Mezzi et.al.	2512.14102	translate	read	null
2025-12-16	Deep Learning Perspective of Scene Understanding in Autonomous Robots	Afia Maham et.al.	2512.14020	translate	read	null
2025-12-16	Real-Time Service Subscription and Adaptive Offloading Control in Vehicular Edge Computing	Chuanchao Gao et.al.	2512.14002	translate	read	null
2025-12-16	FocalComm: Hard Instance-Aware Multi-Agent Perception	Dereje Shenkut et.al.	2512.13982	translate	read	null
2025-12-15	Route-DETR: Pairwise Query Routing in Transformers for Object Detection	Ye Zhang et.al.	2512.13876	translate	read	null
2025-12-15	VajraV1 – The most accurate Real Time Object Detector of the YOLO family	Naman Balbir Singh Makkar et.al.	2512.13834	translate	read	null
2025-12-15	Near-Field Perception for Safety Enhancement of Autonomous Mobile Robots in Manufacturing Environments	Li-Wei Shih et.al.	2512.13561	translate	read	null
2025-12-15	On the Ability of Deep Learning to Detect Signals with Unknown Parameters	Tom Anders et.al.	2512.13542	translate	read	null
2025-12-15	Computer vision training dataset generation for robotic environments using Gaussian splatting	Patryk Niżeniec et.al.	2512.13411	translate	read	null
2025-12-15	Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather	Zhijian He et.al.	2512.13107	translate	read	null
2025-12-14	Cross-Level Sensor Fusion with Object Lists via Transformer for 3D Object Detection	Xiangzhong Liu et.al.	2512.12884	translate	read	null
2025-12-13	INDOOR-LiDAR: Bridging Simulation and Reality for Robot-Centric 360 degree Indoor LiDAR Perception – A Robot-Centric Hybrid Dataset	Haichuan Li et.al.	2512.12377	translate	read	null
2025-12-13	WeDetect: Fast Open-Vocabulary Object Detection as Retrieval	Shenghao Fu et.al.	2512.12309	translate	read	null
2025-12-13	Cognitive-YOLO: LLM-Driven Architecture Synthesis from First Principles of Data for Object Detection	Jiahao Zhao et.al.	2512.12281	translate	read	null
2025-12-13	AI-Augmented Pollen Recognition in Optical and Holographic Microscopy for Veterinary Imaging	Swarn S. Warshaneyan et.al.	2512.12101	translate	read	null
2025-12-12	TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder	Qinghao Meng et.al.	2512.11926	translate	read	null
2025-12-12	Depth-Copy-Paste: Multimodal and Depth-Aware Compositing for Robust Face Detection	Qiushi Guo et.al.	2512.11683	translate	read	null
2025-12-12	DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation	Mohamed Abdelsamad et.al.	2512.11465	translate	read	null
2025-12-12	Assisted Refinement Network Based on Channel Information Interaction for Camouflaged and Salient Object Detection	Kuan Wang et.al.	2512.11369	translate	read	null
2025-12-12	Reliable Detection of Minute Targets in High-Resolution Aerial Imagery across Temporal Shifts	Mohammad Sadegh Gholizadeh et.al.	2512.11360	translate	read	null
2025-12-11	VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction	Weitai Kang et.al.	2512.11099	translate	read	null
2025-12-11	Salient Object Detection in Complex Weather Conditions via Noise Indicators	Quan Chen et.al.	2512.10592	translate	read	null
2025-12-11	Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method	Ge Zhang et.al.	2512.10386	translate	read	null
2025-12-10	ABBSPO: Adaptive Bounding Box Scaling and Symmetric Prior based Orientation Prediction for Detecting Aerial Image Objects	Woojin Lee et.al.	2512.10031	translate	read	null
2025-12-10	NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway	Sander Riisøen Jyhne et.al.	2512.09913	translate	read	null
2025-12-10	Hands-on Evaluation of Visual Transformers for Object Recognition and Detection	Dimitrios N. Vlachogiannis et.al.	2512.09579	translate	read	null
2025-12-10	MODA: The First Challenging Benchmark for Multispectral Object Detection in Aerial Images	Shuaihao Han et.al.	2512.09489	translate	read	null
2025-12-10	A Hierarchical, Model-Based System for High-Performance Humanoid Soccer	Quanyou Wang et.al.	2512.09431	translate	read	null
2025-12-10	Identifying Bias in Machine-generated Text Detection	Kevin Stowe et.al.	2512.09292	translate	read	null
2025-12-10	ROI-Packing: Efficient Region-Based Compression for Machine Vision	Md Eimran Hossain Eimon et.al.	2512.09258	translate	read	null
2025-12-09	Automated Pollen Recognition in Optical and Holographic Microscopy Images	Swarn Singh Warshaneyan et.al.	2512.08589	translate	read	null
2025-12-09	SSCATeR: Sparse Scatter-Based Convolution Algorithm with Temporal Data Recycling for Real-Time 3D Object Detection in LiDAR Point Clouds	Alexander Dow et.al.	2512.08557	translate	read	null
2025-12-09	Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection	Haowen Zheng et.al.	2512.08247	translate	read	null
2025-12-09	SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection	Ching-Hung Cheng et.al.	2512.08223	translate	read	null
2025-12-09	Metasurfaces Enable Active-Like Passive Radar	Mingyi Li et.al.	2512.08208	translate	read	null
2025-12-08	An AI-Powered Autonomous Underwater System for Sea Exploration and Scientific Research	Hamad Almazrouei et.al.	2512.07652	translate	read	null
2025-12-08	Towards Robust DeepFake Detection under Unstable Face Sequences: Adaptive Sparse Graph Embedding with Order-Free Representation and Explicit Laplacian Spectral Prior	Chih-Chung Hsu et.al.	2512.07498	translate	read	null
2025-12-08	Enhancing Small Object Detection with YOLO: A Novel Framework for Improved Accuracy and Efficiency	Mahila Moghadami et.al.	2512.07379	translate	read	null
2025-12-08	A graph generation pipeline for critical infrastructures based on heuristics, images and depth data	Mike Diessner et.al.	2512.07269	translate	read	null
2025-12-08	DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning	Nithin Sivakumaran et.al.	2512.07132	translate	read	null
2025-12-08	DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cross-Scene Small Object Detection	Bo Gao et.al.	2512.07078	translate	read	null
2025-12-07	Large Language Models and Forensic Linguistics: Navigating Opportunities and Threats in the Age of Generative AI	George Mikros et.al.	2512.06922	translate	read	null
2025-12-07	Spatial Retrieval Augmented Autonomous Driving	Xiaosong Jia et.al.	2512.06865	translate	read	null
2025-12-07	CoT4Det: A Chain-of-Thought Framework for Perception-Oriented Vision-Language Tasks	Yu Qi et.al.	2512.06663	translate	read	null
2025-12-07	TextMamba: Scene Text Detector with Mamba	Qiyan Zhao et.al.	2512.06657	translate	read	null
2025-12-06	Neural expressiveness for beyond importance model compression	Angelos-Christos Maroudis et.al.	2512.06440	translate	read	null
2025-12-06	Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework	Xinhao Xiang et.al.	2512.06376	translate	read	null
2025-12-05	OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning	Xusheng Guo et.al.	2512.05698	translate	read	null
2025-12-05	LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection	Johannes Meier et.al.	2512.05663	translate	read	null
2025-12-05	An Integrated System for WEEE Sorting Employing X-ray Imaging, AI-based Object Detection and Segmentation, and Delta Robot Manipulation	Panagiotis Giannikos et.al.	2512.05599	translate	read	null
2025-12-05	Concept-based Explainable Data Mining with VLM for 3D Detection	Mai Tsujimoto et.al.	2512.05482	translate	read	null
2025-12-05	Moving object detection from multi-depth images with an attention-enhanced CNN	Masato Shibukawa et.al.	2512.05415	translate	read	null
2025-12-05	YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications	Yida Lin et.al.	2512.05412	translate	read	null
2025-12-04	GeoPE:A Unified Geometric Positional Embedding for Structured Tensors	Yupu Yao et.al.	2512.04963	translate	read	null
2025-12-04	ZeBROD: Zero-Retraining Based Recognition and Object Detection Framework	Priyanto Hidayatullah et.al.	2512.04888	translate	read	null
2025-12-04	DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance	Yinghui Xing et.al.	2512.04511	translate	read	null
2025-12-04	Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection	Xiangyi Gao et.al.	2512.04413	translate	read	null
2025-12-03	Real-time Cricket Sorting By Sex	Juan Manuel Cantarero Angulo et.al.	2512.04311	translate	read	null
2025-12-03	Fast & Efficient Normalizing Flows and Applications of Image Generative Models	Sandeep Nagar et.al.	2512.04039	translate	read	null
2025-12-03	MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms	Jiahao Zhang et.al.	2512.03640	translate	read	null
2025-12-03	Real-Time Control and Automation Framework for Acousto-Holographic Microscopy	Hasan Berkay Abdioğlu et.al.	2512.03539	translate	read	null
2025-12-03	YOLOA: Real-Time Affordance Detection via LLM Adapter	Yuqi Ji et.al.	2512.03418	translate	read	null
2025-12-02	GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection	Md Sohag Mia et.al.	2512.02991	translate	read	null
2025-12-02	BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection	Guowen Zhang et.al.	2512.02972	translate	read	null
2025-12-02	MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding	Fan Yang et.al.	2512.02906	translate	read	null
2025-12-02	ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection	Omid Reza Heidari et.al.	2512.02696	translate	read	null
2025-12-02	SAM2Grasp: Resolve Multi-modal Grasping via Prompt-conditioned Temporal Action Prediction	Shengkai Wu et.al.	2512.02609	translate	read	null
2025-12-02	GeoDiT: A Diffusion-based Vision-Language Model for Geospatial Understanding	Jiaqi Liu et.al.	2512.02505	translate	read	null
2025-12-02	Temporal Dynamics Enhancer for Directly Trained Spiking Object Detectors	Fan Luo et.al.	2512.02447	translate	read	null
2025-12-01	Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory	Chenyi Wang et.al.	2512.01934	translate	read	null
2025-12-01	SAM3-UNet: Simplified Adaptation of Segment Anything Model 3	Xinyu Xiong et.al.	2512.01789	translate	read	null
2025-12-01	Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery	Zhicheng Zhao et.al.	2512.01665	translate	read	null
2025-12-01	ViT $^3$ : Unlocking Test-Time Training in Vision	Dongchen Han et.al.	2512.01643	translate	read	null
2025-12-01	OpenBox: Annotate Any Bounding Boxes in 3D	In-Jae Lee et.al.	2512.01352	translate	read	null
2025-12-01	FOD-S2R: A FOD Dataset for Sim2Real Transfer Learning based Object Detection	Ashish Vashist et.al.	2512.01315	translate	read	null
2025-12-01	Supervised Contrastive Machine Unlearning of Background Bias in Sonar Image Classification with Fine-Grained Explainable AI	Kamal Basha S et.al.	2512.01291	translate	read	null
2025-12-01	VSRD++: Autolabeling for 3D Object Detection via Instance-Aware Volumetric Silhouette Rendering	Zihua Liu et.al.	2512.01178	translate	read	null
2025-12-01	Real-Time On-the-Go Annotation Framework Using YOLO for Automated Dataset Generation	Mohamed Abdallah Salem et.al.	2512.01165	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)