Object Detection - 2025-07

Publish Date Title Authors PDF Translate Read Code
2025-07-31 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection Yung-Hsu Yang et.al. 2507.23567 translate read link
2025-07-24 Protecting Vulnerable Voices: Synthetic Dataset Generation for Self-Disclosure Detection Shalini Jangra et.al. 2507.22930 translate read null
2025-07-25 Bias Analysis for Synthetic Face Detection: A Case Study of the Impact of Facial Attributes Asmae Lamsaf et.al. 2507.19705 translate read null
2025-07-25 Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing Haichuan Li et.al. 2507.19691 translate read null
2025-07-25 An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles Matthias Weiß et.al. 2507.19446 translate read null
2025-07-25 EffiComm: Bandwidth Efficient Multi Agent Communication Melih Yazgan et.al. 2507.19354 translate read null
2025-07-25 Multistream Network for LiDAR and Camera-based 3D Object Detection in Outdoor Scenes Muhammad Ibrahim et.al. 2507.19304 translate read null
2025-07-25 Cross Spatial Temporal Fusion Attention for Remote Sensing Object Detection via Image Feature Matching Abu Sadat Mohammad Salehin Amit et.al. 2507.19118 translate read null
2025-07-25 Revisiting DETR for Small Object Detection via Noise-Resilient Query Optimization Xiaocheng Fang et.al. 2507.19059 translate read null
2025-07-25 YOLO for Knowledge Extraction from Vehicle Images: A Baseline Study Saraa Al-Saddik et.al. 2507.18966 translate read null
2025-07-25 WiSE-OD: Benchmarking Robustness in Infrared Object Detection Heitor R. Medeiros et.al. 2507.18925 translate read null
2025-07-25 Synthetic-to-Real Camouflaged Object Detection Zhihao Luo et.al. 2507.18911 translate read null
2025-07-24 Towards Large Scale Geostatistical Methane Monitoring with Part-based Object Detection Adhemar de Senneville et.al. 2507.18513 translate read null
2025-07-24 Human Scanpath Prediction in Target-Present Visual Search with Semantic-Foveal Bayesian Attention João Luzio et.al. 2507.18503 translate read null
2025-07-24 A COCO-Formatted Instance-Level Dataset for Plasmodium Falciparum Detection in Giemsa-Stained Blood Smears Frauke Wilm et.al. 2507.18483 translate read null
2025-07-24 Revisiting Physically Realizable Adversarial Object Attack against LiDAR-based Detection: Clarifying Problem Formulation and Experimental Protocols Luo Cheng et.al. 2507.18457 translate read null
2025-07-24 Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction Runmin Zhang et.al. 2507.18331 translate read link
2025-07-24 LMM-Det: Make Large Multimodal Models Excel in Object Detection Jincheng Li et.al. 2507.18300 translate read link
2025-07-24 Evaluation of facial landmark localization performance in a surgical setting Ines Frajtag et.al. 2507.18248 translate read null
2025-07-24 Real-Time Object Detection and Classification using YOLO for Edge FPGAs Rashed Al Amin et.al. 2507.18174 translate read null
2025-07-24 WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection Haodong Zhu et.al. 2507.18173 translate read null
2025-07-24 OpenNav: Open-World Navigation with Multimodal Large Language Models Mingfeng Yuan et.al. 2507.18033 translate read null
2025-07-23 Bearded Dragon Activity Recognition Pipeline: An AI-Based Approach to Behavioural Monitoring Arsen Yermukan et.al. 2507.17987 translate read null
2025-07-23 FishDet-M: A Unified Large-Scale Benchmark for Robust Fish Detection and CLIP-Guided Model Selection in Diverse Aquatic Visual Domains Muayad Abujabal et.al. 2507.17859 translate read null
2025-07-23 Perspective-Invariant 3D Object Detection Ao Liang et.al. 2507.17665 translate read null
2025-07-23 Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning Xinyao Liu et.al. 2507.17539 translate read link
2025-07-23 Illicit object detection in X-ray imaging using deep learning techniques: A comparative evaluation Jorgen Cani et.al. 2507.17508 translate read link
2025-07-23 Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection Yehao Lu et.al. 2507.17436 translate read null
2025-07-23 SFUOD: Source-Free Unknown Object Detection Keon-Hee Park et.al. 2507.17373 translate read null
2025-07-23 Optimizing Delivery Logistics: Enhancing Speed and Safety with Drone Technology Maharshi Shastri et.al. 2507.17253 translate read null
2025-07-23 A Low-Cost Machine Learning Approach for Timber Diameter Estimation Fatemeh Hasanzadeh Fard et.al. 2507.17219 translate read null
2025-07-22 Few-Shot Learning in Video and 3D Object Detection: A Survey Md Meftahul Ferdaus et.al. 2507.17079 translate read null
2025-07-22 Transformer Based Building Boundary Reconstruction using Attraction Field Maps Muhammad Kamran et.al. 2507.17038 translate read null
2025-07-22 Divisive Decisions: Improving Salience-Based Training for Generalization in Binary Classification Tasks Jacob Piland et.al. 2507.17000 translate read null
2025-07-22 Task-Specific Zero-shot Quantization-Aware Training for Object Detection Changhao Li et.al. 2507.16782 translate read null
2025-07-22 Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation Viktor Muryn et.al. 2507.16704 translate read null
2025-07-22 QRetinex-Net: Quaternion-Valued Retinex Decomposition for Low-Level Computer Vision Applications Sos Agaian et.al. 2507.16683 translate read null
2025-07-22 Benchmarking pig detection and tracking under diverse and challenging conditions Jonathan Henrich et.al. 2507.16639 translate read null
2025-07-22 A2Mamba: Attention-augmented State Space Models for Visual Recognition Meng Lou et.al. 2507.16624 translate read null
2025-07-22 PlantSAM: An Object Detection-Driven Segmentation Pipeline for Herbarium Specimens Youcef Sklab et.al. 2507.16506 translate read null
2025-07-22 Towards Railway Domain Adaptation for LiDAR-based 3D Detection: Road-to-Rail and Sim-to-Real via SynDRA-BBox Xavier Diaz et.al. 2507.16413 translate read null
2025-07-22 Scene Text Detection and Recognition “in light of” Challenging Environmental Conditions using Aria Glasses Egocentric Vision Cameras Joseph De Mathia et.al. 2507.16330 translate read null
2025-07-22 MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks Junhao Su et.al. 2507.16279 translate read null
2025-07-22 Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective Seunghyeon Kim et.al. 2507.16254 translate read null
2025-07-21 Experimenting active and sequential learning in a medieval music manuscript Sachin Sharma et.al. 2507.15633 translate read null
2025-07-21 Few-Shot Object Detection via Spatial-Channel State Space Model Zhimeng Xin et.al. 2507.15308 translate read null
2025-07-21 Beyond Easy Wins: A Text Hardness-Aware Benchmark for LLM-generated Text Detection Navid Ayoobi et.al. 2507.15286 translate read null
2025-07-20 Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection Aayush Atul Verma et.al. 2507.15150 translate read null
2025-07-20 BleedOrigin: Dynamic Bleeding Source Localization in Endoscopic Submucosal Dissection via Dual-Stage Detection and Tracking Mengya Xu et.al. 2507.15094 translate read null
2025-07-20 InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis Jiale Liu et.al. 2507.14899 translate read null
2025-07-20 An Uncertainty-aware DETR Enhancement Framework for Object Detection Xingshu Chen et.al. 2507.14855 translate read null
2025-07-20 Seeing Through Deepfakes: A Human-Inspired Framework for Multi-Face Detection Juan Hu et.al. 2507.14807 translate read null
2025-07-19 GCC-Spam: Spam Detection via GAN, Contrastive Learning, and Character Similarity Networks Zixin Xu et.al. 2507.14679 translate read null
2025-07-19 Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection Jifeng Shen et.al. 2507.14643 translate read null
2025-07-18 C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs Yung-Hong Sun et.al. 2507.14095 translate read null
2025-07-18 Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection Yujian Mo et.al. 2507.13899 translate read null
2025-07-18 Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation Masahiro Ogawa et.al. 2507.13628 translate read null
2025-07-17 NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS (C/2025 N1) Colin Orion Chandler et.al. 2507.13409 translate read null
2025-07-17 A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains Antonio Finocchiaro et.al. 2507.13326 translate read null
2025-07-17 RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images Xiaozheng Jiang et.al. 2507.13120 translate read null
2025-07-17 Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection Riku Inoue et.al. 2507.13085 translate read null
2025-07-17 Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis Saswat Priyadarshi Nayak et.al. 2507.13073 translate read null
2025-07-17 SOD-YOLO: Enhancing YOLO-Based Detection of Small Objects in UAV Imagery Peijun Wang et.al. 2507.12727 translate read null
2025-07-16 Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios Van-Hoang-Anh Phan et.al. 2507.12449 translate read null
2025-07-16 InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization Haoyuan Liu et.al. 2507.12420 translate read null
2025-07-16 AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models Santosh Vasa et.al. 2507.12414 translate read null
2025-07-16 OD-VIRAT: A Large-Scale Benchmark for Object Detection in Realistic Surveillance Environments Hayat Ullah et.al. 2507.12396 translate read null
2025-07-16 Improving Lightweight Weed Detection via Knowledge Distillation Ahmet Oğuz Saltık et.al. 2507.12344 translate read null
2025-07-16 SS-DC: Spatial-Spectral Decoupling and Coupling Across Visible-Infrared Gap for Domain Adaptive Object Detection Xiwei Zhang et.al. 2507.12017 translate read null
2025-07-16 Frequency-Dynamic Attention Modulation for Dense Prediction Linwei Chen et.al. 2507.12006 translate read null
2025-07-15 Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping Yujie Zhang et.al. 2507.11279 translate read null
2025-07-15 Using Continual Learning for Real-Time Detection of Vulnerable Road Users in Complex Traffic Scenarios Faryal Aurooj Nasir et.al. 2507.11046 translate read null
2025-07-15 Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery Nicolas Drapier et.al. 2507.11040 translate read null
2025-07-14 A Lightweight and Robust Framework for Real-Time Colorectal Polyp Detection Using LOF-Based Preprocessing and YOLO-v11n Saadat Behzadi et.al. 2507.10864 translate read null
2025-07-14 LLM-Guided Agentic Object Detection for Open-World Understanding Furkan Mumcu et.al. 2507.10844 translate read null
2025-07-14 Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection Huiyi Wang et.al. 2507.10814 translate read null
2025-07-14 Fine-Grained Zero-Shot Object Detection Hongxu Ma et.al. 2507.10358 translate read null
2025-07-14 BlueGlass: A Framework for Composite AI Safety Harshal Nandigramwar et.al. 2507.10106 translate read null
2025-07-14 SRG/ART-XC All-Sky X-ray Survey: Sensitivity Assessment Based on Aperture Photometry N. Y. Tyrin et.al. 2507.10060 translate read null
2025-07-14 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving Yixun Zhang et.al. 2507.09993 translate read null
2025-07-14 Measuring the Impact of Rotation Equivariance on Aerial Object Detection Xiuyu Wu et.al. 2507.09896 translate read null
2025-07-14 Secure and Efficient UAV-Based Face Detection via Homomorphic Encryption and Edge Computing Nguyen Van Duc et.al. 2507.09860 translate read null
2025-07-13 MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression Ofir Gordon et.al. 2507.09616 translate read null
2025-07-12 Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline Shiyi Mu et.al. 2507.09214 translate read null
2025-07-12 On the Fragility of Multimodal Perception to Temporal Misalignment in Autonomous Driving Md Hasan Shahriar et.al. 2507.09095 translate read null
2025-07-11 VISTA: A Visual Analytics Framework to Enhance Foundation Model-Generated Data Labels Xiwei Xuan et.al. 2507.09008 translate read null
2025-07-11 RoundaboutHD: High-Resolution Real-World Urban Environment Benchmark for Multi-Camera Vehicle Tracking Yuqiang Lin et.al. 2507.08729 translate read null
2025-07-11 DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images Haoran Sun et.al. 2507.08648 translate read null
2025-07-11 OnlineBEV: Recurrent Temporal Fusion in Bird’s Eye View Representations for Multi-Camera 3D Perception Junho Koh et.al. 2507.08644 translate read null
2025-07-11 Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset Mathias Zinnen et.al. 2507.08384 translate read null
2025-07-11 Spectroscopic Observations of Four Candidates for Blue Large-Amplitude Pulsators. No BLAPs at High Galactic Latitudes P. Pietrukowicz et.al. 2507.08372 translate read null
2025-07-11 Understanding Driving Risks using Large Language Models: Toward Elderly Driver Assessment Yuki Yoshihara et.al. 2507.08367 translate read null
2025-07-10 An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision Jareen Anjom et.al. 2507.08165 translate read null
2025-07-10 Rainbow Artifacts from Electromagnetic Signal Injection Attacks on Image Sensors Youqian Zhang et.al. 2507.07773 translate read null
2025-07-09 Automated Video Segmentation Machine Learning Pipeline Johannes Merz et.al. 2507.07242 translate read null
2025-07-09 Aerial Maritime Vessel Detection and Identification Antonella Barisic Kulas et.al. 2507.07153 translate read null
2025-07-09 DenoiseCP-Net: Efficient Collective Perception in Adverse Weather via Joint LiDAR-Based 3D Object Detection and Denoising Sven Teufel et.al. 2507.06976 translate read null
2025-07-09 A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level Johanna Orsholm et.al. 2507.06972 translate read null
2025-07-09 Dataset and Benchmark for Enhancing Critical Retained Foreign Object Detection Yuli Wang et.al. 2507.06937 translate read null
2025-07-09 Unlocking Thermal Aerial Imaging: Synthetic Enhancement of UAV Datasets Antonella Barisic Kulas et.al. 2507.06797 translate read null
2025-07-09 LOVON: Legged Open-Vocabulary Object Navigator Daojie Peng et.al. 2507.06747 translate read null
2025-07-09 EA: An Event Autoencoder for High-Speed Vision Sensing Riadul Islam et.al. 2507.06459 translate read null
2025-07-08 Hierarchical Multi-Stage Transformer Architecture for Context-Aware Temporal Action Localization Hayat Ullah et.al. 2507.06411 translate read null
2025-07-08 ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge Daghash K. Alqahtani et.al. 2507.06011 translate read null
2025-07-08 R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding Joonhyung Park et.al. 2507.05673 translate read null
2025-07-07 YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries Aquino Joctum et.al. 2507.05376 translate read null
2025-07-07 From a Different Star: 3I/ATLAS in the context of the Ōtautahi-Oxford interstellar object population model Matthew J. Hopkins et.al. 2507.05318 translate read null
2025-07-07 Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations Xiang Xu et.al. 2507.05260 translate read null
2025-07-07 AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models Chinnappa Guggilla et.al. 2507.05157 translate read null
2025-07-07 LERa: Replanning with Visual Feedback in Instruction Following Svyatoslav Pchelintsev et.al. 2507.05135 translate read null
2025-07-07 Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking Maria Damanaki et.al. 2507.04762 translate read null
2025-07-07 CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection Hanzhi Zhong et.al. 2507.04587 translate read null
2025-07-06 MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection Hanshi Wang et.al. 2507.04369 translate read null
2025-07-06 DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection Paul Hill et.al. 2507.04323 translate read null
2025-07-06 ZERO: Multi-modal Prompt-based Visual Grounding Sangbum Choi et.al. 2507.04270 translate read null
2025-07-05 Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge Linshen Liu et.al. 2507.04123 translate read null
2025-07-04 Zero Memory Overhead Approach for Protecting Vision Transformer Parameters Fereshteh Baradaran et.al. 2507.03816 translate read null
2025-07-03 Partial Weakly-Supervised Oriented Object Detection Mingxin Liu et.al. 2507.02751 translate read null
2025-07-03 Automatic Labelling for Low-Light Pedestrian Detection Dimitrios Bouzoulas et.al. 2507.02513 translate read null
2025-07-03 Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection Weiwei Duan et.al. 2507.02454 translate read null
2025-07-03 A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion Maryem Fadili et.al. 2507.02430 translate read null
2025-07-03 PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection Seokyeong Lee et.al. 2507.02393 translate read null
2025-07-03 Two-Steps Neural Networks for an Automated Cerebrovascular Landmark Detection Rafic Nader et.al. 2507.02349 translate read null
2025-07-03 Perception Activator: An intuitive and portable framework for brain cognitive exploration Le Xu et.al. 2507.02311 translate read null
2025-07-03 Understanding Trade offs When Conditioning Synthetic Data Brandon Trabucco et.al. 2507.02217 translate read null
2025-07-02 How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks Rahul Ramachandran et.al. 2507.01955 translate read link
2025-07-02 Survivability of Backdoor Attacks on Unconstrained Face Recognition Systems Quentin Le Roux et.al. 2507.01607 translate read null
2025-07-02 Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation Andrei Jelea et.al. 2507.01347 translate read null
2025-07-01 Rapid Salient Object Detection with Difference Convolutional Neural Networks Zhuo Su et.al. 2507.01182 translate read null
2025-07-01 Robust Component Detection for Flexible Manufacturing: A Deep Learning Approach to Tray-Free Object Recognition under Variable Lighting Fatemeh Sadat Daneshmand et.al. 2507.00852 translate read null
2025-07-01 UAVD-Mamba: Deformable Token Fusion Vision Mamba for Multimodal UAV Detection Wei Li et.al. 2507.00849 translate read null
2025-07-01 High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery Hongxing Peng et.al. 2507.00825 translate read null
2025-07-01 Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation Hao Xing et.al. 2507.00752 translate read null
2025-07-01 UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement Xiao Zhang et.al. 2507.00721 translate read null
2025-07-01 Rectifying Magnitude Neglect in Linear Attention Qihang Fan et.al. 2507.00698 translate read link

(<a href=../Object_Detection.md>back to Object Detection</a>)