Object Detection - 2025-07 | Paper Arxiv Daily

Object Detection - 2025-07

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-07-31	3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection	Yung-Hsu Yang et.al.	2507.23567	translate	read	link
2025-07-24	Protecting Vulnerable Voices: Synthetic Dataset Generation for Self-Disclosure Detection	Shalini Jangra et.al.	2507.22930	translate	read	null
2025-07-25	Bias Analysis for Synthetic Face Detection: A Case Study of the Impact of Facial Attributes	Asmae Lamsaf et.al.	2507.19705	translate	read	null
2025-07-25	Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing	Haichuan Li et.al.	2507.19691	translate	read	null
2025-07-25	An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles	Matthias Weiß et.al.	2507.19446	translate	read	null
2025-07-25	EffiComm: Bandwidth Efficient Multi Agent Communication	Melih Yazgan et.al.	2507.19354	translate	read	null
2025-07-25	Multistream Network for LiDAR and Camera-based 3D Object Detection in Outdoor Scenes	Muhammad Ibrahim et.al.	2507.19304	translate	read	null
2025-07-25	Cross Spatial Temporal Fusion Attention for Remote Sensing Object Detection via Image Feature Matching	Abu Sadat Mohammad Salehin Amit et.al.	2507.19118	translate	read	null
2025-07-25	Revisiting DETR for Small Object Detection via Noise-Resilient Query Optimization	Xiaocheng Fang et.al.	2507.19059	translate	read	null
2025-07-25	YOLO for Knowledge Extraction from Vehicle Images: A Baseline Study	Saraa Al-Saddik et.al.	2507.18966	translate	read	null
2025-07-25	WiSE-OD: Benchmarking Robustness in Infrared Object Detection	Heitor R. Medeiros et.al.	2507.18925	translate	read	null
2025-07-25	Synthetic-to-Real Camouflaged Object Detection	Zhihao Luo et.al.	2507.18911	translate	read	null
2025-07-24	Towards Large Scale Geostatistical Methane Monitoring with Part-based Object Detection	Adhemar de Senneville et.al.	2507.18513	translate	read	null
2025-07-24	Human Scanpath Prediction in Target-Present Visual Search with Semantic-Foveal Bayesian Attention	João Luzio et.al.	2507.18503	translate	read	null
2025-07-24	A COCO-Formatted Instance-Level Dataset for Plasmodium Falciparum Detection in Giemsa-Stained Blood Smears	Frauke Wilm et.al.	2507.18483	translate	read	null
2025-07-24	Revisiting Physically Realizable Adversarial Object Attack against LiDAR-based Detection: Clarifying Problem Formulation and Experimental Protocols	Luo Cheng et.al.	2507.18457	translate	read	null
2025-07-24	Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction	Runmin Zhang et.al.	2507.18331	translate	read	link
2025-07-24	LMM-Det: Make Large Multimodal Models Excel in Object Detection	Jincheng Li et.al.	2507.18300	translate	read	link
2025-07-24	Evaluation of facial landmark localization performance in a surgical setting	Ines Frajtag et.al.	2507.18248	translate	read	null
2025-07-24	Real-Time Object Detection and Classification using YOLO for Edge FPGAs	Rashed Al Amin et.al.	2507.18174	translate	read	null
2025-07-24	WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection	Haodong Zhu et.al.	2507.18173	translate	read	null
2025-07-24	OpenNav: Open-World Navigation with Multimodal Large Language Models	Mingfeng Yuan et.al.	2507.18033	translate	read	null
2025-07-23	Bearded Dragon Activity Recognition Pipeline: An AI-Based Approach to Behavioural Monitoring	Arsen Yermukan et.al.	2507.17987	translate	read	null
2025-07-23	FishDet-M: A Unified Large-Scale Benchmark for Robust Fish Detection and CLIP-Guided Model Selection in Diverse Aquatic Visual Domains	Muayad Abujabal et.al.	2507.17859	translate	read	null
2025-07-23	Perspective-Invariant 3D Object Detection	Ao Liang et.al.	2507.17665	translate	read	null
2025-07-23	Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning	Xinyao Liu et.al.	2507.17539	translate	read	link
2025-07-23	Illicit object detection in X-ray imaging using deep learning techniques: A comparative evaluation	Jorgen Cani et.al.	2507.17508	translate	read	link
2025-07-23	Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection	Yehao Lu et.al.	2507.17436	translate	read	null
2025-07-23	SFUOD: Source-Free Unknown Object Detection	Keon-Hee Park et.al.	2507.17373	translate	read	null
2025-07-23	Optimizing Delivery Logistics: Enhancing Speed and Safety with Drone Technology	Maharshi Shastri et.al.	2507.17253	translate	read	null
2025-07-23	A Low-Cost Machine Learning Approach for Timber Diameter Estimation	Fatemeh Hasanzadeh Fard et.al.	2507.17219	translate	read	null
2025-07-22	Few-Shot Learning in Video and 3D Object Detection: A Survey	Md Meftahul Ferdaus et.al.	2507.17079	translate	read	null
2025-07-22	Transformer Based Building Boundary Reconstruction using Attraction Field Maps	Muhammad Kamran et.al.	2507.17038	translate	read	null
2025-07-22	Divisive Decisions: Improving Salience-Based Training for Generalization in Binary Classification Tasks	Jacob Piland et.al.	2507.17000	translate	read	null
2025-07-22	Task-Specific Zero-shot Quantization-Aware Training for Object Detection	Changhao Li et.al.	2507.16782	translate	read	null
2025-07-22	Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation	Viktor Muryn et.al.	2507.16704	translate	read	null
2025-07-22	QRetinex-Net: Quaternion-Valued Retinex Decomposition for Low-Level Computer Vision Applications	Sos Agaian et.al.	2507.16683	translate	read	null
2025-07-22	Benchmarking pig detection and tracking under diverse and challenging conditions	Jonathan Henrich et.al.	2507.16639	translate	read	null
2025-07-22	A2Mamba: Attention-augmented State Space Models for Visual Recognition	Meng Lou et.al.	2507.16624	translate	read	null
2025-07-22	PlantSAM: An Object Detection-Driven Segmentation Pipeline for Herbarium Specimens	Youcef Sklab et.al.	2507.16506	translate	read	null
2025-07-22	Towards Railway Domain Adaptation for LiDAR-based 3D Detection: Road-to-Rail and Sim-to-Real via SynDRA-BBox	Xavier Diaz et.al.	2507.16413	translate	read	null
2025-07-22	Scene Text Detection and Recognition “in light of” Challenging Environmental Conditions using Aria Glasses Egocentric Vision Cameras	Joseph De Mathia et.al.	2507.16330	translate	read	null
2025-07-22	MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks	Junhao Su et.al.	2507.16279	translate	read	null
2025-07-22	Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective	Seunghyeon Kim et.al.	2507.16254	translate	read	null
2025-07-21	Experimenting active and sequential learning in a medieval music manuscript	Sachin Sharma et.al.	2507.15633	translate	read	null
2025-07-21	Few-Shot Object Detection via Spatial-Channel State Space Model	Zhimeng Xin et.al.	2507.15308	translate	read	null
2025-07-21	Beyond Easy Wins: A Text Hardness-Aware Benchmark for LLM-generated Text Detection	Navid Ayoobi et.al.	2507.15286	translate	read	null
2025-07-20	Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection	Aayush Atul Verma et.al.	2507.15150	translate	read	null
2025-07-20	BleedOrigin: Dynamic Bleeding Source Localization in Endoscopic Submucosal Dissection via Dual-Stage Detection and Tracking	Mengya Xu et.al.	2507.15094	translate	read	null
2025-07-20	InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis	Jiale Liu et.al.	2507.14899	translate	read	null
2025-07-20	An Uncertainty-aware DETR Enhancement Framework for Object Detection	Xingshu Chen et.al.	2507.14855	translate	read	null
2025-07-20	Seeing Through Deepfakes: A Human-Inspired Framework for Multi-Face Detection	Juan Hu et.al.	2507.14807	translate	read	null
2025-07-19	GCC-Spam: Spam Detection via GAN, Contrastive Learning, and Character Similarity Networks	Zixin Xu et.al.	2507.14679	translate	read	null
2025-07-19	Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection	Jifeng Shen et.al.	2507.14643	translate	read	null
2025-07-18	C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs	Yung-Hong Sun et.al.	2507.14095	translate	read	null
2025-07-18	Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection	Yujian Mo et.al.	2507.13899	translate	read	null
2025-07-18	Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation	Masahiro Ogawa et.al.	2507.13628	translate	read	null
2025-07-17	NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS (C/2025 N1)	Colin Orion Chandler et.al.	2507.13409	translate	read	null
2025-07-17	A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains	Antonio Finocchiaro et.al.	2507.13326	translate	read	null
2025-07-17	RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images	Xiaozheng Jiang et.al.	2507.13120	translate	read	null
2025-07-17	Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection	Riku Inoue et.al.	2507.13085	translate	read	null
2025-07-17	Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis	Saswat Priyadarshi Nayak et.al.	2507.13073	translate	read	null
2025-07-17	SOD-YOLO: Enhancing YOLO-Based Detection of Small Objects in UAV Imagery	Peijun Wang et.al.	2507.12727	translate	read	null
2025-07-16	Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios	Van-Hoang-Anh Phan et.al.	2507.12449	translate	read	null
2025-07-16	InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization	Haoyuan Liu et.al.	2507.12420	translate	read	null
2025-07-16	AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models	Santosh Vasa et.al.	2507.12414	translate	read	null
2025-07-16	OD-VIRAT: A Large-Scale Benchmark for Object Detection in Realistic Surveillance Environments	Hayat Ullah et.al.	2507.12396	translate	read	null
2025-07-16	Improving Lightweight Weed Detection via Knowledge Distillation	Ahmet Oğuz Saltık et.al.	2507.12344	translate	read	null
2025-07-16	SS-DC: Spatial-Spectral Decoupling and Coupling Across Visible-Infrared Gap for Domain Adaptive Object Detection	Xiwei Zhang et.al.	2507.12017	translate	read	null
2025-07-16	Frequency-Dynamic Attention Modulation for Dense Prediction	Linwei Chen et.al.	2507.12006	translate	read	null
2025-07-15	Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping	Yujie Zhang et.al.	2507.11279	translate	read	null
2025-07-15	Using Continual Learning for Real-Time Detection of Vulnerable Road Users in Complex Traffic Scenarios	Faryal Aurooj Nasir et.al.	2507.11046	translate	read	null
2025-07-15	Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery	Nicolas Drapier et.al.	2507.11040	translate	read	null
2025-07-14	A Lightweight and Robust Framework for Real-Time Colorectal Polyp Detection Using LOF-Based Preprocessing and YOLO-v11n	Saadat Behzadi et.al.	2507.10864	translate	read	null
2025-07-14	LLM-Guided Agentic Object Detection for Open-World Understanding	Furkan Mumcu et.al.	2507.10844	translate	read	null
2025-07-14	Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection	Huiyi Wang et.al.	2507.10814	translate	read	null
2025-07-14	Fine-Grained Zero-Shot Object Detection	Hongxu Ma et.al.	2507.10358	translate	read	null
2025-07-14	BlueGlass: A Framework for Composite AI Safety	Harshal Nandigramwar et.al.	2507.10106	translate	read	null
2025-07-14	SRG/ART-XC All-Sky X-ray Survey: Sensitivity Assessment Based on Aperture Photometry	N. Y. Tyrin et.al.	2507.10060	translate	read	null
2025-07-14	3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving	Yixun Zhang et.al.	2507.09993	translate	read	null
2025-07-14	Measuring the Impact of Rotation Equivariance on Aerial Object Detection	Xiuyu Wu et.al.	2507.09896	translate	read	null
2025-07-14	Secure and Efficient UAV-Based Face Detection via Homomorphic Encryption and Edge Computing	Nguyen Van Duc et.al.	2507.09860	translate	read	null
2025-07-13	MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression	Ofir Gordon et.al.	2507.09616	translate	read	null
2025-07-12	Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline	Shiyi Mu et.al.	2507.09214	translate	read	null
2025-07-12	On the Fragility of Multimodal Perception to Temporal Misalignment in Autonomous Driving	Md Hasan Shahriar et.al.	2507.09095	translate	read	null
2025-07-11	VISTA: A Visual Analytics Framework to Enhance Foundation Model-Generated Data Labels	Xiwei Xuan et.al.	2507.09008	translate	read	null
2025-07-11	RoundaboutHD: High-Resolution Real-World Urban Environment Benchmark for Multi-Camera Vehicle Tracking	Yuqiang Lin et.al.	2507.08729	translate	read	null
2025-07-11	DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images	Haoran Sun et.al.	2507.08648	translate	read	null
2025-07-11	OnlineBEV: Recurrent Temporal Fusion in Bird’s Eye View Representations for Multi-Camera 3D Perception	Junho Koh et.al.	2507.08644	translate	read	null
2025-07-11	Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset	Mathias Zinnen et.al.	2507.08384	translate	read	null
2025-07-11	Spectroscopic Observations of Four Candidates for Blue Large-Amplitude Pulsators. No BLAPs at High Galactic Latitudes	P. Pietrukowicz et.al.	2507.08372	translate	read	null
2025-07-11	Understanding Driving Risks using Large Language Models: Toward Elderly Driver Assessment	Yuki Yoshihara et.al.	2507.08367	translate	read	null
2025-07-10	An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision	Jareen Anjom et.al.	2507.08165	translate	read	null
2025-07-10	Rainbow Artifacts from Electromagnetic Signal Injection Attacks on Image Sensors	Youqian Zhang et.al.	2507.07773	translate	read	null
2025-07-09	Automated Video Segmentation Machine Learning Pipeline	Johannes Merz et.al.	2507.07242	translate	read	null
2025-07-09	Aerial Maritime Vessel Detection and Identification	Antonella Barisic Kulas et.al.	2507.07153	translate	read	null
2025-07-09	DenoiseCP-Net: Efficient Collective Perception in Adverse Weather via Joint LiDAR-Based 3D Object Detection and Denoising	Sven Teufel et.al.	2507.06976	translate	read	null
2025-07-09	A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level	Johanna Orsholm et.al.	2507.06972	translate	read	null
2025-07-09	Dataset and Benchmark for Enhancing Critical Retained Foreign Object Detection	Yuli Wang et.al.	2507.06937	translate	read	null
2025-07-09	Unlocking Thermal Aerial Imaging: Synthetic Enhancement of UAV Datasets	Antonella Barisic Kulas et.al.	2507.06797	translate	read	null
2025-07-09	LOVON: Legged Open-Vocabulary Object Navigator	Daojie Peng et.al.	2507.06747	translate	read	null
2025-07-09	EA: An Event Autoencoder for High-Speed Vision Sensing	Riadul Islam et.al.	2507.06459	translate	read	null
2025-07-08	Hierarchical Multi-Stage Transformer Architecture for Context-Aware Temporal Action Localization	Hayat Ullah et.al.	2507.06411	translate	read	null
2025-07-08	ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge	Daghash K. Alqahtani et.al.	2507.06011	translate	read	null
2025-07-08	R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding	Joonhyung Park et.al.	2507.05673	translate	read	null
2025-07-07	YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries	Aquino Joctum et.al.	2507.05376	translate	read	null
2025-07-07	From a Different Star: 3I/ATLAS in the context of the Ōtautahi-Oxford interstellar object population model	Matthew J. Hopkins et.al.	2507.05318	translate	read	null
2025-07-07	Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations	Xiang Xu et.al.	2507.05260	translate	read	null
2025-07-07	AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models	Chinnappa Guggilla et.al.	2507.05157	translate	read	null
2025-07-07	LERa: Replanning with Visual Feedback in Instruction Following	Svyatoslav Pchelintsev et.al.	2507.05135	translate	read	null
2025-07-07	Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking	Maria Damanaki et.al.	2507.04762	translate	read	null
2025-07-07	CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection	Hanzhi Zhong et.al.	2507.04587	translate	read	null
2025-07-06	MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection	Hanshi Wang et.al.	2507.04369	translate	read	null
2025-07-06	DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection	Paul Hill et.al.	2507.04323	translate	read	null
2025-07-06	ZERO: Multi-modal Prompt-based Visual Grounding	Sangbum Choi et.al.	2507.04270	translate	read	null
2025-07-05	Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge	Linshen Liu et.al.	2507.04123	translate	read	null
2025-07-04	Zero Memory Overhead Approach for Protecting Vision Transformer Parameters	Fereshteh Baradaran et.al.	2507.03816	translate	read	null
2025-07-03	Partial Weakly-Supervised Oriented Object Detection	Mingxin Liu et.al.	2507.02751	translate	read	null
2025-07-03	Automatic Labelling for Low-Light Pedestrian Detection	Dimitrios Bouzoulas et.al.	2507.02513	translate	read	null
2025-07-03	Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection	Weiwei Duan et.al.	2507.02454	translate	read	null
2025-07-03	A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion	Maryem Fadili et.al.	2507.02430	translate	read	null
2025-07-03	PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection	Seokyeong Lee et.al.	2507.02393	translate	read	null
2025-07-03	Two-Steps Neural Networks for an Automated Cerebrovascular Landmark Detection	Rafic Nader et.al.	2507.02349	translate	read	null
2025-07-03	Perception Activator: An intuitive and portable framework for brain cognitive exploration	Le Xu et.al.	2507.02311	translate	read	null
2025-07-03	Understanding Trade offs When Conditioning Synthetic Data	Brandon Trabucco et.al.	2507.02217	translate	read	null
2025-07-02	How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks	Rahul Ramachandran et.al.	2507.01955	translate	read	link
2025-07-02	Survivability of Backdoor Attacks on Unconstrained Face Recognition Systems	Quentin Le Roux et.al.	2507.01607	translate	read	null
2025-07-02	Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation	Andrei Jelea et.al.	2507.01347	translate	read	null
2025-07-01	Rapid Salient Object Detection with Difference Convolutional Neural Networks	Zhuo Su et.al.	2507.01182	translate	read	null
2025-07-01	Robust Component Detection for Flexible Manufacturing: A Deep Learning Approach to Tray-Free Object Recognition under Variable Lighting	Fatemeh Sadat Daneshmand et.al.	2507.00852	translate	read	null
2025-07-01	UAVD-Mamba: Deformable Token Fusion Vision Mamba for Multimodal UAV Detection	Wei Li et.al.	2507.00849	translate	read	null
2025-07-01	High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery	Hongxing Peng et.al.	2507.00825	translate	read	null
2025-07-01	Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation	Hao Xing et.al.	2507.00752	translate	read	null
2025-07-01	UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement	Xiao Zhang et.al.	2507.00721	translate	read	null
2025-07-01	Rectifying Magnitude Neglect in Linear Attention	Qihang Fan et.al.	2507.00698	translate	read	link

(<a href=../Object_Detection.md>back to Object Detection</a>)