Object Detection - 2025-06 | Paper Arxiv Daily

Object Detection - 2025-06

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-06-30	Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios	Deng Li et.al.	2506.24063	translate	read	null
2025-06-30	Visual Textualization for Image Prompted Object Detection	Yongjian Wu et.al.	2506.23785	translate	read	null
2025-06-30	PBCAT: Patch-based composite adversarial training against physically realizable attacks on object detection	Xiao Li et.al.	2506.23581	translate	read	null
2025-06-30	Event-based Tiny Object Detection: A Benchmark Dataset and Baseline	Nuo Chen et.al.	2506.23575	translate	read	null
2025-06-30	OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving	Mingqian Ji et.al.	2506.23565	translate	read	null
2025-06-30	From Sight to Insight: Unleashing Eye-Tracking in Weakly Supervised Video Salient Object Detection	Qi Qin et.al.	2506.23519	translate	read	null
2025-06-30	Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed Augmentation	Tinh Nguyen et.al.	2506.23505	translate	read	null
2025-06-29	Detecting What Matters: A Novel Approach for Out-of-Distribution 3D Object Detection in Autonomous Vehicles	Menna Taha et.al.	2506.23426	translate	read	null
2025-06-29	Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement	Siyuan Chai et.al.	2506.23353	translate	read	null
2025-06-29	GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields	Shunsuke Yasuki et.al.	2506.23352	translate	read	null
2025-06-27	Attention-disentangled Uniform Orthogonal Feature Space Optimization for Few-shot Object Detection	Taijin Zhao et.al.	2506.22161	translate	read	null
2025-06-27	Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration	Noora Sassali et.al.	2506.22116	translate	read	null
2025-06-27	CERBERUS: Crack Evaluation & Recognition Benchmark for Engineering Reliability & Urban Stability	Justin Reinman et.al.	2506.21909	translate	read	null
2025-06-27	Visual Content Detection in Educational Videos with Transfer Learning and Dataset Enrichment	Dipayan Biswas et.al.	2506.21903	translate	read	null
2025-06-27	Embodied Domain Adaptation for Object Detection	Xiangyu Shi et.al.	2506.21860	translate	read	null
2025-06-26	PhotonSplat: 3D Scene Reconstruction and Colorization from SPAD Sensors	Sai Sri Teja et.al.	2506.21680	translate	read	null
2025-06-26	Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection	Tobias J. Riedlinger et.al.	2506.21486	translate	read	null
2025-06-26	TITAN: Query-Token based Domain Adaptive Adversarial Learning	Tajamul Ashraf et.al.	2506.21484	translate	read	null
2025-06-26	A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario	Cyrus Addy et.al.	2506.21451	translate	read	null
2025-06-26	DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic	Munish Monga et.al.	2506.21260	translate	read	null
2025-06-26	LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection	Lei Hao et.al.	2506.21018	translate	read	null
2025-06-26	ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation	Shruti Bansal et.al.	2506.20969	translate	read	null
2025-06-25	Lightweight Multi-Frame Integration for Robust YOLO Object Detection in Videos	Yitong Quan et.al.	2506.20550	translate	read	null
2025-06-25	Learning-based safety lifting monitoring system for cranes on construction sites	Hao Chen et.al.	2506.20475	translate	read	null
2025-06-25	Feature Hallucination for Self-supervised Action Recognition	Lei Wang et.al.	2506.20342	translate	read	null
2025-06-25	From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents	Sergio Torres Aguilar et.al.	2506.20326	translate	read	null
2025-06-25	TDiR: Transformer based Diffusion for Image Restoration Tasks	Abbas Anwar et.al.	2506.20302	translate	read	null
2025-06-25	Integrated optomechanical ultrasonic sensors with nano-Pascal-level sensitivity	Xuening Cao et.al.	2506.20219	translate	read	null
2025-06-24	A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects	Shulan Ruan et.al.	2506.19769	translate	read	null
2025-06-24	Semantic Scene Graph for Ultrasound Image Explanation and Scanning Guidance	Xuesong Li et.al.	2506.19683	translate	read	null
2025-06-24	Probabilistic modelling and safety assurance of an agriculture robot providing light-treatment	Mustafa Adam et.al.	2506.19620	translate	read	null
2025-06-24	USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation	Lin Hong et.al.	2506.19472	translate	read	null
2025-06-23	SpaNN: Detecting Multiple Adversarial Patches on CNNs by Spanning Saliency Thresholds	Mauricio Byrd Victorica et.al.	2506.18591	translate	read	null
2025-06-23	Improvement on LiDAR-Camera Calibration Using Square Targets	Zhongyuan Li et.al.	2506.18294	translate	read	null
2025-06-23	Learning Approach to Efficient Vision-based Active Tracking of a Flying Target by an Unmanned Aerial Vehicle	Jagadeswara PKV Pothuri et.al.	2506.18264	translate	read	null
2025-06-23	Ground tracking for improved landmine detection in a GPR system	Li Tang et.al.	2506.18258	translate	read	null
2025-06-24	Referring Expression Instance Retrieval and A Strong End-to-End Baseline	Xiangzhao Hao et.al.	2506.18246	translate	read	null
2025-06-24	Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages	Klaudia Ropel et.al.	2506.18069	translate	read	null
2025-06-21	YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception	Mengqi Lei et.al.	2506.17733	translate	read	link
2025-06-21	CSDN: A Context-Gated Self-Adaptive Detection Network for Real-Time Object Detection	Wei Haolin et.al.	2506.17679	translate	read	null
2025-06-21	DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving	Mihir Godbole et.al.	2506.17590	translate	read	null
2025-06-20	YASMOT: Yet another stereo image multi-object tracker	Ketil Malde et.al.	2506.17186	translate	read	link
2025-06-20	Class Agnostic Instance-level Descriptor for Visual Instance Search	Qi-Ying Sun et.al.	2506.16745	translate	read	null
2025-06-20	Cross-modal Offset-guided Dynamic Alignment and Fusion for Weakly Aligned UAV Object Detection	Liu Zongzhen et.al.	2506.16737	translate	read	null
2025-06-19	How Hard Is Snow? A Paired Domain Adaptation Dataset for Clear and Snowy Weather: CADC+	Mei Qi Tang et.al.	2506.16531	translate	read	null
2025-06-19	Can AI Dream of Unseen Galaxies? Conditional Diffusion Model for Galaxy Morphology Augmentation	Chenrui Ma et.al.	2506.16233	translate	read	null
2025-06-19	VideoGAN-based Trajectory Proposal for Automated Vehicles	Annajoyce Mariani et.al.	2506.16209	translate	read	null
2025-06-19	BLADE: An Automated Framework for Classifying Light Curves from the Center for Near-Earth Object Studies (CNEOS) Fireball Database	Elizabeth A. Silber et.al.	2506.16099	translate	read	null
2025-06-19	Polyline Path Masked Attention for Vision Transformer	Zhongchen Zhao et.al.	2506.15940	translate	read	link
2025-06-18	PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning	Yuhui Shi et.al.	2506.15683	translate	read	null
2025-06-18	BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion	Yuqing Lan et.al.	2506.15610	translate	read	null
2025-06-18	Retrospective Memory for Camouflaged Object Detection	Chenxi Zhang et.al.	2506.15244	translate	read	null
2025-06-18	Fiber Signal Denoising Algorithm using Hybrid Deep Learning Networks	Linlin Wang et.al.	2506.15125	translate	read	null
2025-06-19	Efficient Retail Video Annotation: A Robust Key Frame Generation Approach for Product and Customer Interaction Analysis	Varun Mannam et.al.	2506.14854	translate	read	null
2025-06-18	YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework	Dahang Wan et.al.	2506.14696	translate	read	null
2025-06-17	VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning	Md. Adnanul Islam et.al.	2506.14629	translate	read	link
2025-06-17	GAMORA: A Gesture Articulated Meta Operative Robotic Arm for Hazardous Material Handling in Containment-Level Environments	Farha Abdul Wasay et.al.	2506.14513	translate	read	null
2025-06-17	Comparison of Two Methods for Stationary Incident Detection Based on Background Image	Deepak Ghimire et.al.	2506.14256	translate	read	null
2025-06-16	A Point Cloud Completion Approach for the Grasping of Partially Occluded Objects and Its Applications in Robotic Strawberry Harvesting	Ali Abouzeid et.al.	2506.14066	translate	read	link
2025-06-16	FindMeIfYouCan: Bringing Open Set metrics to $\textit{near} $, $ \textit{far} $ and $\textit{farther}$ Out-of-Distribution Object Detection	Daniel Montoya et.al.	2506.14008	translate	read	null
2025-06-16	How Real is CARLAs Dynamic Vision Sensor? A Study on the Sim-to-Real Gap in Traffic Object Detection	Kaiyuan Tan et.al.	2506.13722	translate	read	null
2025-06-17	Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational Videos	Dipayan Biswas et.al.	2506.13657	translate	read	link
2025-06-16	UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data	Vasiliki Balaska et.al.	2506.13505	translate	read	null
2025-06-16	Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection	Shenqi Wang et.al.	2506.13440	translate	read	null
2025-06-16	Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots	Jaehong Oh et.al.	2506.13149	translate	read	null
2025-06-15	MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection	Yuxiang Wang et.al.	2506.12697	translate	read	null
2025-06-14	UniDet-D: A Unified Dynamic Spectral Attention Model for Object Detection under Adverse Weathers	Yuantao Wang et.al.	2506.12324	translate	read	null
2025-06-14	MatchPlant: An Open-Source Pipeline for UAV-Based Single-Plant Detection and Data Extraction	Worasit Sangjan et.al.	2506.12295	translate	read	link
2025-06-13	Vision-based Lifting of 2D Object Detections for Automated Driving	Hendrik Königshof et.al.	2506.11839	translate	read	null
2025-06-13	Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds	Filippo Bragato et.al.	2506.11804	translate	read	null
2025-06-13	GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers	Guang Liang et.al.	2506.11784	translate	read	null
2025-06-13	On the Natural Robustness of Vision-Language Models Against Visual Perception Attacks in Autonomous Driving	Pedram MohajerAnsari et.al.	2506.11472	translate	read	null
2025-06-12	Teaching in adverse scenes: a statistically feedback-driven threshold and mask adjustment teacher-student framework for object detection in UAV images under adverse scenes	Hongyu Chen et.al.	2506.11175	translate	read	null
2025-06-12	Discrete Lorenz Attractors in 3D Sinusoidal Maps	Sishu Shankar Muni et.al.	2506.10788	translate	read	null
2025-06-12	Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement	Yuqi Shen et.al.	2506.10712	translate	read	null
2025-06-12	Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection	Xinyuan Liu et.al.	2506.10601	translate	read	link
2025-06-12	Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration	Jun Wang et.al.	2506.10573	translate	read	null
2025-06-12	FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion	Tianpei Zhang et.al.	2506.10366	translate	read	link
2025-06-11	DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos	Rajeev Yasarla et.al.	2506.10242	translate	read	null
2025-06-11	CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects	Tao Liu et.al.	2506.09897	translate	read	null
2025-06-11	3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection	Yi Zhang et.al.	2506.09541	translate	read	null
2025-06-11	MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning	Tong Wang et.al.	2506.09327	translate	read	null
2025-06-10	Efficient Edge Deployment of Quantized YOLOv4-Tiny for Aerial Emergency Object Detection on Raspberry Pi 5	Sindhu Boddu et.al.	2506.09300	translate	read	null
2025-06-10	Lightweight Object Detection Using Quantized YOLOv4-Tiny for Emergency Response in Aerial Imagery	Sindhu Boddu et.al.	2506.09299	translate	read	null
2025-06-10	WD-DETR: Wavelet Denoising-Enhanced Real-Time Object Detection Transformer for Robot Perception with Event Cameras	Yangjie Cui et.al.	2506.09098	translate	read	null
2025-06-11	Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models	Xuanchi Ren et.al.	2506.09042	translate	read	link
2025-06-10	ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations	Amirreza Rouhi et.al.	2506.08968	translate	read	null
2025-06-10	Data Augmentation For Small Object using Fast AutoAugment	DaeEun Yoon et.al.	2506.08956	translate	read	null
2025-06-11	Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting	Keyi Liu et.al.	2506.08777	translate	read	null
2025-06-09	CrosswalkNet: An Optimized Deep Learning Framework for Pedestrian Crosswalk Detection in Aerial Images with High-Performance Computing	Zubin Bhuyan et.al.	2506.07885	translate	read	null
2025-06-09	SAM2Auto: Auto Annotation Using FLASH	Arash Rocky et.al.	2506.07850	translate	read	null
2025-06-09	Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods	Beining Xu et.al.	2506.07779	translate	read	null
2025-06-09	SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding	Xuemei Chen et.al.	2506.07737	translate	read	null
2025-06-09	Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study	Xiaomeng Zhu et.al.	2506.07539	translate	read	null
2025-06-09	SpatialLM: Training Large Language Models for Structured Indoor Modeling	Yongsen Mao et.al.	2506.07491	translate	read	link
2025-06-09	Happiness Finder: Exploring the Role of AI in Enhancing Well-Being During Four-Leaf Clover Searches	Anna Yokokubo et.al.	2506.07393	translate	read	null
2025-06-09	Multiple Object Stitching for Unsupervised Representation Learning	Chengchao Shen et.al.	2506.07364	translate	read	link
2025-06-09	CBAM-STN-TPS-YOLO: Enhancing Agricultural Object Detection through Spatially Adaptive Attention Mechanisms	Satvik Praveen et.al.	2506.07357	translate	read	null
2025-06-08	UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning	Weiqi Yan et.al.	2506.07087	translate	read	null
2025-06-06	Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection	Yu Li et.al.	2506.05872	translate	read	null
2025-06-06	Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration	Fanhu Zeng et.al.	2506.05709	translate	read	null
2025-06-06	Integer Binary-Range Alignment Neuron for Spiking Neural Networks	Binghao Ye et.al.	2506.05679	translate	read	null
2025-06-05	CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media	Tianyi Huang et.al.	2506.05107	translate	read	null
2025-06-05	Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training	Aneesh Deogan et.al.	2506.05092	translate	read	null
2025-06-06	Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets	Mikhail Kennerley et.al.	2506.04737	translate	read	null
2025-06-05	Gen-n-Val: Agentic Image Data Generation and Validation	Jing-En Huang et.al.	2506.04676	translate	read	null
2025-06-05	VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection	Wuyang Li et.al.	2506.04623	translate	read	null
2025-06-04	FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices	Shizhong Han et.al.	2506.04499	translate	read	null
2025-06-04	Neural Object Detection for 4D STEM: High-Throughput Sub-Pixel Electron Diffraction Pattern Recognition	Arda Genc et.al.	2506.04477	translate	read	null
2025-06-04	Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector	Boyong He et.al.	2506.04211	translate	read	link
2025-06-04	FSHNet: Fully Sparse Hybrid Network for 3D Object Detection	Shuai Liu et.al.	2506.03714	translate	read	null
2025-06-04	How PARTs assemble into wholes: Learning the relative composition of images	Melika Ayoughi et.al.	2506.03682	translate	read	null
2025-06-05	MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection	Xiaochun Lei et.al.	2506.03654	translate	read	null
2025-06-04	DiagNet: Detecting Objects using Diagonal Constraints on Adjacency Matrix of Graph Neural Network	Chong Hyun Lee et.al.	2506.03571	translate	read	null
2025-06-03	SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports	Dheeraj Khanna et.al.	2506.03335	translate	read	null
2025-06-03	Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding	Weiqing Xiao et.al.	2506.03134	translate	read	link
2025-06-03	HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring	Zhixiong Su et.al.	2506.02959	translate	read	null
2025-06-03	Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection	Yechi Ma et.al.	2506.02914	translate	read	null
2025-06-03	A Dynamic Transformer Network for Vehicle Detection	Chunwei Tian et.al.	2506.02765	translate	read	null
2025-06-03	Open-PMC-18M: A High-Fidelity Large Scale Medical Dataset for Multimodal Representation Learning	Negin Baghbanzadeh et.al.	2506.02738	translate	read	null
2025-06-03	GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal	Shufan Qing et.al.	2506.02736	translate	read	link
2025-06-03	Sight Guide: A Wearable Assistive Perception and Navigation System for the Vision Assistance Race in the Cybathlon 2024	Patrick Pfreundschuh et.al.	2506.02676	translate	read	null
2025-06-03	Probabilistic Online Event Downsampling	Andreu Girbau-Xalabarder et.al.	2506.02547	translate	read	null
2025-06-03	Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning	Kunyu Wang et.al.	2506.02462	translate	read	null
2025-06-03	Auto-Labeling Data for Object Detection	Brent A. Griffin et.al.	2506.02359	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)