Object Detection - 2025-10
Object Detection - 2025-10
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-10-28 | Which LiDAR scanning pattern is better for roadside perception: Repetitive or Non-repetitive? | Zhiqi Qi et.al. | 2511.00060 | translate | read | null |
| 2025-10-31 | Gaussian Combined Distance: A Generic Metric for Object Detection | Ziqian Guan et.al. | 2510.27649 | translate | read | null |
| 2025-10-31 | Parameterized Prompt for Incremental Object Detection | Zijia An et.al. | 2510.27316 | translate | read | null |
| 2025-10-31 | C-LEAD: Contrastive Learning for Enhanced Adversarial Defense | Suklav Ghosh et.al. | 2510.27249 | translate | read | null |
| 2025-10-31 | M^3Detection: Multi-Frame Multi-Level Feature Fusion for Multi-Modal 3D Object Detection with Camera and 4D Imaging Radar | Xiaozhi Li et.al. | 2510.27166 | translate | read | null |
| 2025-10-31 | Generating Accurate and Detailed Captions for High-Resolution Images | Hankyeol Lee et.al. | 2510.27164 | translate | read | null |
| 2025-10-31 | MLPerf Automotive | Radoyeh Shojaei et.al. | 2510.27065 | translate | read | null |
| 2025-10-30 | Using Salient Object Detection to Identify Manipulative Cookie Banners that Circumvent GDPR | Riley Grossman et.al. | 2510.26967 | translate | read | null |
| 2025-10-30 | Improving Classification of Occluded Objects through Scene Context | Courtney M. King et.al. | 2510.26681 | translate | read | null |
| 2025-10-30 | All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles | Sayed Pedram Haeri Boroujeni et.al. | 2510.26641 | translate | read | null |
| 2025-10-30 | PT-DETR: Small Target Detection Based on Partially-Aware Detail Focus | Bingcong Huo et.al. | 2510.26630 | translate | read | null |
| 2025-10-30 | Spiking Patches: Asynchronous, Sparse, and Efficient Tokens for Event Cameras | Christoffer Koo Øhrstrøm et.al. | 2510.26614 | translate | read | null |
| 2025-10-30 | Detecting Unauthorized Vehicles using Deep Learning for Smart Cities: A Case Study on Bangladesh | Sudipto Das Sukanto et.al. | 2510.26154 | translate | read | null |
| 2025-10-29 | Enhancing Underwater Object Detection through Spatio-Temporal Analysis and Spatial Attention Networks | Sai Likhith Karri et.al. | 2510.25797 | translate | read | null |
| 2025-10-29 | Prototype-Driven Adaptation for Few-Shot Object Detection | Yushen Huang et.al. | 2510.25318 | translate | read | null |
| 2025-10-29 | GaTector+: A Unified Head-free Framework for Gaze Object and Gaze Following Prediction | Yang Jin et.al. | 2510.25301 | translate | read | null |
| 2025-10-29 | RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models | Zijun Liao et.al. | 2510.25257 | translate | read | null |
| 2025-10-29 | Test-Time Adaptive Object Detection with Foundation Model | Yingjie Gao et.al. | 2510.25175 | translate | read | null |
| 2025-10-29 | DINO-YOLO: Self-Supervised Pre-training for Data-Efficient Object Detection in Civil Engineering Applications | Malaisree P et.al. | 2510.25140 | translate | read | null |
| 2025-10-28 | Pixels to Signals: A Real-Time Framework for Traffic Demand Estimation | H Mhatre et.al. | 2510.24902 | translate | read | null |
| 2025-10-28 | MIC-BEV: Multi-Infrastructure Camera Bird’s-Eye-View Transformer with Relation-Aware Fusion for 3D Object Detection | Yun Zhang et.al. | 2510.24688 | translate | read | null |
| 2025-10-28 | A Critical Study towards the Detection of Parkinsons Disease using ML Technologies | Vivek Chetia et.al. | 2510.24456 | translate | read | null |
| 2025-10-28 | Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy | Qing Zhao et.al. | 2510.24232 | translate | read | null |
| 2025-10-28 | Mars-Bench: A Benchmark for Evaluating Foundation Models for Mars Science Tasks | Mirali Purohit et.al. | 2510.24010 | translate | read | null |
| 2025-10-27 | A U-Net and Transformer Pipeline for Multilingual Image Translation | Siddharth Sahay et.al. | 2510.23554 | translate | read | null |
| 2025-10-27 | FRBNet: Revisiting Low-Light Vision through Frequency-Domain Radial Basis Network | Fangtong Sun et.al. | 2510.23444 | translate | read | null |
| 2025-10-27 | One-Timestep is Enough: Achieving High-performance ANN-to-SNN Conversion via Scale-and-Fire Neurons | Qiuyang Chen et.al. | 2510.23383 | translate | read | null |
| 2025-10-27 | Spoofing resilience for simple-detection quantum illumination LIDAR | Richard J. Murchie et.al. | 2510.23228 | translate | read | null |
| 2025-10-27 | AG-Fusion: adaptive gated multimodal fusion for 3d object detection in complex scenes | Sixian Liu et.al. | 2510.23151 | translate | read | null |
| 2025-10-27 | DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios | Ziyu Wang et.al. | 2510.23144 | translate | read | null |
| 2025-10-27 | M $^{3}$ T2IBench: A Large-Scale Multi-Category, Multi-Instance, Multi-Relation Text-to-Image Benchmark | Huixuan Zhang et.al. | 2510.23020 | translate | read | null |
| 2025-10-26 | A Comprehensive Dataset for Human vs. AI Generated Text Detection | Rajarshi Roy et.al. | 2510.22874 | translate | read | null |
| 2025-10-26 | A Critical Study on Tea Leaf Disease Detection using Deep Learning Techniques | Nabajyoti Borah et.al. | 2510.22647 | translate | read | null |
| 2025-10-25 | 3D Roadway Scene Object Detection with LIDARs in Snowfall Conditions | Ghazal Farhani et.al. | 2510.22436 | translate | read | null |
| 2025-10-25 | TrajGATFormer: A Graph-Based Transformer Approach for Worker and Obstacle Trajectory Prediction in Off-site Construction Environments | Mohammed Alduais et.al. | 2510.22205 | translate | read | null |
| 2025-10-21 | Comparative Analysis of Object Detection Algorithms for Surface Defect Detection | Arpan Maity et.al. | 2510.21811 | translate | read | null |
| 2025-10-24 | On Thin Ice: Towards Explainable Conservation Monitoring via Attribution and Perturbations | Jiayi Zhou et.al. | 2510.21689 | translate | read | null |
| 2025-10-24 | S3OD: Towards Generalizable Salient Object Detection with Synthetic Data | Orest Kupyn et.al. | 2510.21605 | translate | read | null |
| 2025-10-24 | Scalpel: Automotive Deep Learning Framework Testing via Assembling Model Components | Yinglong Zou et.al. | 2510.21451 | translate | read | null |
| 2025-10-24 | Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks | Jieyuan Zhang et.al. | 2510.21403 | translate | read | null |
| 2025-10-24 | WhaleVAD-BPN: Improving Baleen Whale Call Detection with Boundary Proposal Networks and Post-processing Optimisation | Christiaan M. Geldenhuys et.al. | 2510.21280 | translate | read | null |
| 2025-10-23 | BioDet: Boosting Industrial Object Detection with Image Preprocessing Strategies | Jiaqi Hu et.al. | 2510.21000 | translate | read | null |
| 2025-10-23 | BUSTED at AraGenEval Shared Task: A Comparative Study of Transformer-Based Models for Arabic AI-Generated Text Detection | Ali Zain et.al. | 2510.20610 | translate | read | null |
| 2025-10-23 | Synthetic Data for Robust Runway Detection | Estelle Chigot et.al. | 2510.20349 | translate | read | null |
| 2025-10-23 | Physics-Guided Fusion for Robust 3D Tracking of Fast Moving Small Objects | Prithvi Raj Singh et.al. | 2510.20126 | translate | read | null |
| 2025-10-22 | A Unified Detection Pipeline for Robust Object Detection in Fisheye-Based Traffic Surveillance | Neema Jakisa Owor et.al. | 2510.20016 | translate | read | null |
| 2025-10-22 | Can You Trust What You See? Alpha Channel No-Box Attacks on Video Object Detection | Ariana Yi et.al. | 2510.19574 | translate | read | null |
| 2025-10-22 | Machine Text Detectors are Membership Inference Attacks | Ryuto Koike et.al. | 2510.19492 | translate | read | link |
| 2025-10-22 | Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts | Chen Li et.al. | 2510.19487 | translate | read | null |
| 2025-10-22 | Space Object Detection using Multi-frame Temporal Trajectory Completion Method | Xiaoqing Lan et.al. | 2510.19220 | translate | read | null |
| 2025-10-22 | SFGFusion: Surface Fitting Guided 3D Object Detection with 4D Radar and Camera Fusion | Xiaozhi Li et.al. | 2510.19215 | translate | read | null |
| 2025-10-21 | Kinematic Analysis and Integration of Vision Algorithms for a Mobile Manipulator Employed Inside a Self-Driving Laboratory | Shifa Sulaiman et.al. | 2510.19081 | translate | read | null |
| 2025-10-21 | GBlobs: Local LiDAR Geometry for Improved Sensor Placement Generalization | Dušan Malić et.al. | 2510.18539 | translate | read | null |
| 2025-10-21 | DWaste: Greener AI for Waste Sorting using Mobile and Edge Devices | Suman Kunwar et.al. | 2510.18513 | translate | read | null |
| 2025-10-21 | Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection | Ji Du et.al. | 2510.18437 | translate | read | null |
| 2025-10-21 | ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters | Zhiwei Hao et.al. | 2510.18431 | translate | read | null |
| 2025-10-21 | Beyond Frequency: Scoring-Driven Debiasing for Object Detection via Blueprint-Prompted Image Synthesis | Xinhao Cai et.al. | 2510.18229 | translate | read | null |
| 2025-10-20 | Accelerating Vision Transformers with Adaptive Patch Sizes | Rohan Choudhury et.al. | 2510.18091 | translate | read | link |
| 2025-10-20 | Big Data, Tiny Targets: An Exploratory Study in Machine Learning-enhanced Detection of Microplastic from Filters | Paul-Tiberiu Miclea et.al. | 2510.18089 | translate | read | null |
| 2025-10-15 | MUSE: Model-based Uncertainty-aware Similarity Estimation for zero-shot 2D Object Detection and Segmentation | Sungmin Cho et.al. | 2510.17866 | translate | read | null |
| 2025-10-20 | Towards 3D Objectness Learning in an Open World | Taichi Liu et.al. | 2510.17686 | translate | read | null |
| 2025-10-20 | On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration | Yehonathan Refael et.al. | 2510.17670 | translate | read | null |
| 2025-10-20 | DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning | Yongxin He et.al. | 2510.17489 | translate | read | link |
| 2025-10-20 | Split-Fuse-Transport: Annotation-Free Saliency via Dual Clustering and Optimal Transport Alignment | Muhammad Umer Ramzan et.al. | 2510.17484 | translate | read | null |
| 2025-10-20 | Monitoring Horses in Stalls: From Object to Event Detection | Dmitrii Galimzianov et.al. | 2510.17409 | translate | read | null |
| 2025-10-20 | Machine Vision-Based Surgical Lighting System:Design and Implementation | Amir Gharghabi et.al. | 2510.17287 | translate | read | null |
| 2025-10-20 | Investigating Adversarial Robustness against Preprocessing used in Blackbox Face Recognition | Roland Croft et.al. | 2510.17169 | translate | read | null |
| 2025-10-20 | Towards a Generalizable Fusion Architecture for Multimodal Object Detection | Jad Berjawi et.al. | 2510.17078 | translate | read | null |
| 2025-10-19 | ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification | Akhila Kambhatla et.al. | 2510.16854 | translate | read | null |
| 2025-10-18 | Towards Intelligent Traffic Signaling in Dhaka City Based on Vehicle Detection and Congestion Optimization | Kazi Ababil Azam et.al. | 2510.16622 | translate | read | null |
| 2025-10-18 | AI-Generated Text Detection in Low-Resource Languages: A Case Study on Urdu | Muhammad Ammar et.al. | 2510.16573 | translate | read | null |
| 2025-10-18 | ReviewGuard: Enhancing Deficient Peer Review Detection via LLM-Driven Data Augmentation | Haoxuan Zhang et.al. | 2510.16549 | translate | read | null |
| 2025-10-18 | OOS-DSD: Improving Out-of-stock Detection in Retail Images using Auxiliary Tasks | Franko Šikić et.al. | 2510.16508 | translate | read | null |
| 2025-10-18 | Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance | Chien Thai et.al. | 2510.16445 | translate | read | null |
| 2025-10-17 | Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI | Zheng Huang et.al. | 2510.16196 | translate | read | null |
| 2025-10-17 | ObjectTransforms for Uncertainty Quantification and Reduction in Vision-Based Perception for Autonomous Vehicles | Nishad Sahu et.al. | 2510.16118 | translate | read | null |
| 2025-10-17 | StripRFNet: A Strip Receptive Field and Shape-Aware Network for Road Damage Detection | Jianhan Lin et.al. | 2510.16115 | translate | read | null |
| 2025-10-17 | LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal | Shr-Ruei Tsai et.al. | 2510.15868 | translate | read | link |
| 2025-10-17 | ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection | Haowei Zhu et.al. | 2510.15783 | translate | read | null |
| 2025-10-17 | Valeo Near-Field: a novel dataset for pedestrian intent detection | Antonyo Musabini et.al. | 2510.15673 | translate | read | null |
| 2025-10-17 | FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers | Haisheng Su et.al. | 2510.15385 | translate | read | null |
| 2025-10-17 | Symmetric Entropy-Constrained Video Coding for Machines | Yuxiao Sun et.al. | 2510.15347 | translate | read | null |
| 2025-10-16 | MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning | Mattia Segu et.al. | 2510.15026 | translate | read | null |
| 2025-10-16 | EdgeNavMamba: Mamba Optimized Object Detection for Energy Efficient Edge Devices | Romina Aalishah et.al. | 2510.14946 | translate | read | null |
| 2025-10-16 | VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation | Han Zhao et.al. | 2510.14902 | translate | read | link |
| 2025-10-16 | CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection | Hojun Choi et.al. | 2510.14792 | translate | read | null |
| 2025-10-16 | Cross-Layer Feature Self-Attention Module for Multi-Scale Object Detection | Dingzhou Xie et.al. | 2510.14726 | translate | read | null |
| 2025-10-16 | Structured Universal Adversarial Attacks on Object Detection for Video Sequences | Sven Jacob et.al. | 2510.14460 | translate | read | null |
| 2025-10-16 | Beat Tracking as Object Detection | Jaehoon Ahn et.al. | 2510.14391 | translate | read | null |
| 2025-10-15 | How Sampling Affects the Detectability of Machine-written texts: A Comprehensive Study | Matthieu Dubois et.al. | 2510.13681 | translate | read | null |
| 2025-10-15 | A Modular Object Detection System for Humanoid Robots Using YOLO | Nicolas Pottier et.al. | 2510.13625 | translate | read | null |
| 2025-10-15 | Fusion Meets Diverse Conditions: A High-diversity Benchmark and Baseline for UAV-based Multimodal Object Detection with Condition Cues | Chen Chen et.al. | 2510.13620 | translate | read | null |
| 2025-10-15 | Automated document processing system for government agencies using DBNET++ and BART models | Aya Kaysan Bahjat et.al. | 2510.13303 | translate | read | null |
| 2025-10-15 | LLM one-shot style transfer for Authorship Attribution and Verification | Pablo Miralles-González et.al. | 2510.13302 | translate | read | null |
| 2025-10-15 | What “Not” to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging | Inha Kang et.al. | 2510.13232 | translate | read | null |
| 2025-10-15 | An Analytical Framework to Enhance Autonomous Vehicle Perception for Smart Cities | Jalal Khan et.al. | 2510.13230 | translate | read | null |
| 2025-10-14 | Detect Anything via Next Point Prediction | Qing Jiang et.al. | 2510.12798 | translate | read | link |
| 2025-10-14 | StyleDecipher: Robust and Explainable Detection of LLM-Generated Texts with Stylistic Analysis | Siyuan Li et.al. | 2510.12608 | translate | read | null |
| 2025-10-14 | WaterFlow: Explicit Physics-Prior Rectified Flow for Underwater Saliency Mask Generation | Runting Li et.al. | 2510.12605 | translate | read | null |
| 2025-10-14 | When Personalization Tricks Detectors: The Feature-Inversion Trap in Machine-Generated Text Detection | Lang Gao et.al. | 2510.12476 | translate | read | null |
| 2025-10-14 | The Impact of Synthetic Data on Object Detection Model Performance: A Comparative Analysis with Real-World Data | Muammer Bay et.al. | 2510.12208 | translate | read | null |
| 2025-10-14 | SpikePool: Event-driven Spiking Transformer with Pooling Attention | Donghyun Lee et.al. | 2510.12102 | translate | read | null |
| 2025-10-14 | APGNet: Adaptive Prior-Guided for Underwater Camouflaged Object Detection | Xinxin Huang et.al. | 2510.12056 | translate | read | null |
| 2025-10-13 | NV3D: Leveraging Spatial Shape Through Normal Vector-based 3D Object Detection | Krittin Chaowakarn et.al. | 2510.11632 | translate | read | null |
| 2025-10-13 | Enhancing Maritime Domain Awareness on Inland Waterways: A YOLO-Based Fusion of Satellite and AIS for Vessel Characterization | Geoffery Agorku et.al. | 2510.11449 | translate | read | null |
| 2025-10-13 | A Modular AIoT Framework for Low-Latency Real-Time Robotic Teleoperation in Smart Cities | Shih-Chieh Sun et.al. | 2510.11421 | translate | read | null |
| 2025-10-13 | REACT3D: Recovering Articulations for Interactive Physical 3D Scenes | Zhao Huang et.al. | 2510.11340 | translate | read | null |
| 2025-10-13 | When Does Supervised Training Pay Off? The Hidden Economics of Object Detection in the Era of Vision-Language Models | Samer Al-Hamadani et.al. | 2510.11302 | translate | read | null |
| 2025-10-13 | A Large-Language-Model Assisted Automated Scale Bar Detection and Extraction Framework for Scanning Electron Microscopic Images | Yuxuan Chen et.al. | 2510.11260 | translate | read | null |
| 2025-10-13 | Source-Free Object Detection with Detection Transformer | Huizai Yao et.al. | 2510.11090 | translate | read | null |
| 2025-10-13 | Slitless Spectroscopy Source Detection Using YOLO Deep Neural Network | Xiaohan Chen et.al. | 2510.10922 | translate | read | null |
| 2025-10-12 | EGD-YOLO: A Lightweight Multimodal Framework for Robust Drone-Bird Discrimination via Ghost-Enhanced YOLOv8n and EMA Attention under Adverse Condition | Sudipto Sarkar et.al. | 2510.10765 | translate | read | null |
| 2025-10-12 | MRS-YOLO Railroad Transmission Line Foreign Object Detection Based on Improved YOLO11 and Channel Pruning | Siyuan Liu et.al. | 2510.10553 | translate | read | null |
| 2025-10-12 | Risk-Budgeted Control Framework for Balanced Performance and Safety in Autonomous Vehicles | Pei Yu Chang et.al. | 2510.10442 | translate | read | null |
| 2025-10-11 | Ordinal Scale Traffic Congestion Classification with Multi-Modal Vision-Language and Motion Analysis | Yu-Hsuan Lin et.al. | 2510.10342 | translate | read | null |
| 2025-10-11 | Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking | Markus Käppeler et.al. | 2510.10287 | translate | read | null |
| 2025-10-11 | MRI Brain Tumor Detection with Computer Vision | Jack Krolik et.al. | 2510.10250 | translate | read | null |
| 2025-10-11 | BurstDeflicker: A Benchmark Dataset for Flicker Removal in Dynamic Scenes | Lishen Qu et.al. | 2510.09996 | translate | read | null |
| 2025-10-10 | SpectralCA: Bi-Directional Cross-Attention for Next-Generation UAV Hyperspectral Vision | D. V. Brovko et.al. | 2510.09912 | translate | read | null |
| 2025-10-06 | Ultralytics YOLO Evolution: An Overview of YOLO26, YOLO11, YOLOv8 and YOLOv5 Object Detectors for Computer Vision and Pattern Recognition | Ranjan Sapkota et.al. | 2510.09653 | translate | read | null |
| 2025-10-10 | FSP-DETR: Few-Shot Prototypical Parasitic Ova Detection | Shubham Trehan et.al. | 2510.09583 | translate | read | null |
| 2025-10-10 | PRNet: Original Information Is All You Have | PeiHuang Zheng et.al. | 2510.09531 | translate | read | null |
| 2025-10-10 | Utilizing dynamic sparsity on pretrained DETR | Reza Sedghi et.al. | 2510.09380 | translate | read | null |
| 2025-10-10 | TARO: Toward Semantically Rich Open-World Object Detection | Yuchen Zhang et.al. | 2510.09173 | translate | read | null |
| 2025-10-10 | SOS: Synthetic Object Segments Improve Detection, Segmentation, and Grounding | Weikai Huang et.al. | 2510.09110 | translate | read | null |
| 2025-10-09 | Re-Identifying Kākā with AI-Automated Video Key Frame Extraction | Paula Maddigan et.al. | 2510.08775 | translate | read | null |
| 2025-10-03 | Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes | Nirmal Elamon et.al. | 2510.08589 | translate | read | null |
| 2025-10-09 | A Multimodal Depth-Aware Method For Embodied Reference Understanding | Fevziye Irem Eyiokur et.al. | 2510.08278 | translate | read | null |
| 2025-10-09 | RayFusion: Ray Fusion Enhanced Collaborative Visual Perception | Shaohong Wang et.al. | 2510.08017 | translate | read | null |
| 2025-10-09 | A Large-scale Dataset for Robust Complex Anime Scene Text Detection | Ziyi Dong et.al. | 2510.07951 | translate | read | null |
| 2025-10-08 | Robust Measurement of Stellar Streams Around the Milky Way: Correcting Spatially Variable Observational Selection Effects in Optical Imaging Surveys | K. Boone et.al. | 2510.07511 | translate | read | null |
| 2025-10-08 | A million-solar-mass object detected at cosmological distance using gravitational imaging | D. M. Powell et.al. | 2510.07382 | translate | read | null |
| 2025-10-08 | Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban Environments | Jingfei Huang et.al. | 2510.07359 | translate | read | null |
| 2025-10-07 | Enhancing Maritime Object Detection in Real-Time with RT-DETR and Data Augmentation | Nader Nemati et.al. | 2510.07346 | translate | read | null |
| 2025-10-08 | Explaining raw data complexity to improve satellite onboard processing | Adrien Dorise et.al. | 2510.06858 | translate | read | null |
| 2025-10-08 | Extreme Amodal Face Detection | Changlin Song et.al. | 2510.06791 | translate | read | null |
| 2025-10-08 | SDQM: Synthetic Data Quality Metric for Object Detection Dataset Evaluation | Ayush Zenith et.al. | 2510.06596 | translate | read | link |
| 2025-10-08 | Adaptive Stain Normalization for Cross-Domain Medical Histology | Tianyue Xu et.al. | 2510.06592 | translate | read | null |
| 2025-10-06 | General and Efficient Visual Goal-Conditioned Reinforcement Learning using Object-Agnostic Masks | Fahim Shahriar et.al. | 2510.06277 | translate | read | null |
| 2025-10-06 | Comparative Analysis of YOLOv5, Faster R-CNN, SSD, and RetinaNet for Motorbike Detection in Kigali Autonomous Driving Context | Ngeyen Yinkfu et.al. | 2510.04912 | translate | read | null |
| 2025-10-06 | CLEAR-IR: Clarity-Enhanced Active Reconstruction of Infrared Imagery | Nathan Shankar et.al. | 2510.04883 | translate | read | null |
| 2025-10-06 | SPEGNet: Synergistic Perception-Guided Network for Camouflaged Object Detection | Baber Jan et.al. | 2510.04472 | translate | read | link |
| 2025-10-04 | From Filters to VLMs: Benchmarking Defogging Methods through Object Detection and Segmentation Performance | Ardalan Aryashad et.al. | 2510.03906 | translate | read | null |
| 2025-10-04 | Cross-View Open-Vocabulary Object Detection in Aerial Imagery | Jyoti Kini et.al. | 2510.03858 | translate | read | null |
| 2025-10-04 | Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models | Leander Girrbach et.al. | 2510.03721 | translate | read | null |
| 2025-10-04 | SAMSOD: Rethinking SAM Optimization for RGB-T Salient Object Detection | Zhengyi Liu et.al. | 2510.03689 | translate | read | null |
| 2025-10-03 | ALHD: A Large-Scale and Multigenre Benchmark Dataset for Arabic LLM-Generated Text Detection | Ali Khairallah et.al. | 2510.03502 | translate | read | null |
| 2025-10-03 | Visual Language Model as a Judge for Object Detection in Industrial Diagrams | Sanjukta Ghosh et.al. | 2510.03376 | translate | read | null |
| 2025-10-03 | Neural Posterior Estimation with Autoregressive Tiling for Detecting Objects in Astronomical Images | Jeffrey Regier et.al. | 2510.03074 | translate | read | null |
| 2025-10-03 | Align Your Query: Representation Alignment for Multimodality Medical Object Detection | Ara Seo et.al. | 2510.02789 | translate | read | null |
| 2025-10-02 | Multimodal Large Language Model Framework for Safe and Interpretable Grid-Integrated EVs | Jean Douglas Carvalho et.al. | 2510.02592 | translate | read | null |
| 2025-10-02 | Clink! Chop! Thud! – Learning Object Sounds from Real-World Interactions | Mengyu Yang et.al. | 2510.02313 | translate | read | null |
| 2025-10-02 | kabr-tools: Automated Framework for Multi-Species Behavioral Monitoring | Jenna Kline et.al. | 2510.02030 | translate | read | link |
| 2025-10-02 | Automated Defect Detection for Mass-Produced Electronic Components Based on YOLO Object Detection Models | Wei-Lung Mao et.al. | 2510.01914 | translate | read | null |
| 2025-10-02 | Calibrating the Full Predictive Class Distribution of 3D Object Detectors for Autonomous Driving | Cornelius Schröder et.al. | 2510.01829 | translate | read | null |
| 2025-10-01 | Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks | Shoumik Saha et.al. | 2510.01359 | translate | read | null |
| 2025-10-01 | Span-level Detection of AI-generated Scientific Text via Contrastive Learning and Structural Calibration | Zhen Yin et.al. | 2510.00890 | translate | read | null |
| 2025-10-01 | Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation | Jinchang Zhang et.al. | 2510.00681 | translate | read | null |
| 2025-10-01 | Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests | Aoduo Li et.al. | 2510.00547 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)