Object Detection - 2026-01

Publish Date Title Authors PDF Translate Read Code
2026-01-31 Enhancing Open-Vocabulary Object Detection through Multi-Level Fine-Grained Visual-Language Alignment Tianyi Zhang et.al. 2602.00531 translate read null
2026-01-30 Deep Learning-Based Object Detection for Autonomous Vehicles: A Comparative Study of One-Stage and Two-Stage Detectors on Basic Traffic Objects Bsher Karbouj et.al. 2602.00385 translate read null
2026-01-30 Leveraging Textual-Cues for Enhancing Multimodal Sentiment Analysis by Object Recognition Sumana Biswas et.al. 2602.00360 translate read null
2026-01-29 SDCM: Simulated Densifying and Compensatory Modeling Fusion for Radar-Vision 3-D Object Detection in Internet of Vehicles Shucong Li et.al. 2602.00149 translate read null
2026-01-26 Observing Health Outcomes Using Remote Sensing Imagery and Geo-Context Guided Visual Transformer Yu Li et.al. 2602.00110 translate read null
2026-01-30 User Prompting Strategies and Prompt Enhancement Methods for Open-Set Object Detection in XR Environments Junfeng Lin et.al. 2601.23281 translate read null
2026-01-30 A Comparative Evaluation of Large Vision-Language Models for 2D Object Detection under SOTIF Conditions Ji Zhou et.al. 2601.22830 translate read null
2026-01-30 Active Learning-Driven Lightweight YOLOv9: Enhancing Efficiency in Smart Agriculture Hung-Chih Tu et.al. 2601.22732 translate read null
2026-01-30 OOVDet: Low-Density Prior Learning for Zero-Shot Out-of-Vocabulary Object Detection Binyi Su et.al. 2601.22685 translate read null
2026-01-30 UniGeo: A Unified 3D Indoor Object Detection Framework Integrating Geometry-Aware Learning and Dynamic Channel Gating Xing Yi et.al. 2601.22616 translate read null
2026-01-29 CORDS: Continuous Representations of Discrete Structures Tin Hadži Veljković et.al. 2601.21583 translate read null
2026-01-29 Don’t double it: Efficient Agent Prediction in Occlusions Anna Rothenhäusler et.al. 2601.21504 translate read null
2026-01-28 BadDet+: Robust Backdoor Attacks for Object Detection Kealan Dunnett et.al. 2601.21066 translate read null
2026-01-27 On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text Michał Gromadzki et.al. 2601.20006 translate read null
2026-01-27 VGGT-SLAM 2.0: Real-time Dense Feed-forward Scene Reconstruction Dominic Maggio et.al. 2601.19887 translate read null
2026-01-27 Learned split-spectrum metalens for obstruction-free broadband imaging in the visible Seungwoo Yoon et.al. 2601.19403 translate read null
2026-01-27 MIRAGE: Enabling Real-Time Automotive Mediated Reality Pascal Jansen et.al. 2601.19385 translate read null
2026-01-27 Instance-Guided Radar Depth Estimation for 3D Object Detection Chen-Chou Lo et.al. 2601.19314 translate read null
2026-01-27 Implicit Non-Causal Factors are Out via Dataset Splitting for Domain Generalization Object Detection Zhilong Zhang et.al. 2601.19127 translate read null
2026-01-26 On the Role of Depth in Surgical Vision Foundation Models: An Empirical Study of RGB-D Pre-training John J. Han et.al. 2601.18929 translate read null
2026-01-26 Dynamic Mask-Based Backdoor Attack Against Vision AI Models: A Case Study on Mushroom Detection Zeineb Dridi et.al. 2601.18845 translate read null
2026-01-26 EFSI-DETR: Efficient Frequency-Semantic Integration for Real-Time Small Object Detection in UAV Imagery Yu Xia et.al. 2601.18597 translate read null
2026-01-26 YOLO-DS: Fine-Grained Feature Decoupling via Dual-Statistic Synergy Operator for Object Detection Lin Huang et.al. 2601.18172 translate read null
2026-01-26 Text-Pass Filter: An Efficient Scene Text Detector Chuang Yang et.al. 2601.18098 translate read null
2026-01-23 Boundary and Position Information Mining for Aerial Small Object Detection Rongxin Huang et.al. 2601.16617 translate read null
2026-01-23 Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding Xiaojiang Peng et.al. 2601.16449 translate read null
2026-01-22 The Latency Wall: Benchmarking Off-the-Shelf Emotion Recognition for Real-Time Virtual Avatars Yarin Benyamin et.al. 2601.15914 translate read null
2026-01-22 Performance-guided Reinforced Active Learning for Object Detection Zhixuan Liang et.al. 2601.15688 translate read null
2026-01-21 ZENITH: Automated Gradient Norm Informed Stochastic Optimization Dhrubo Saha et.al. 2601.15212 translate read null
2026-01-21 Graph Recognition via Subgraph Prediction André Eberhard et.al. 2601.15133 translate read null
2026-01-21 M2I2HA: A Multi-modal Object Detection Method Based on Intra- and Inter-Modal Hypergraph Attention Xiaofan Yang et.al. 2601.14776 translate read null
2026-01-21 A comprehensive overview of deep learning models for object detection from videos/images Sukana Zulfqar et.al. 2601.14677 translate read null
2026-01-20 GutenOCR: A Grounded Vision-Language Front-End for Documents Hunter Heidenreich et.al. 2601.14490 translate read link
2026-01-20 Gaussian Based Adaptive Multi-Modal 3D Semantic Occupancy Prediction A. Enes Doruk et.al. 2601.14448 translate read null
2026-01-20 DExTeR: Weakly Semi-Supervised Object Detection with Class and Instance Experts for Medical Imaging Adrien Meyer et.al. 2601.13954 translate read null
2026-01-19 Leveraging Transformer Decoder for Automotive Radar Object Detection Changxu Zhang et.al. 2601.13386 translate read null
2026-01-19 Practical Insights into Semi-Supervised Object Detection Approaches Chaoxin Wang et.al. 2601.13380 translate read null
2026-01-19 Real-Time 4D Radar Perception for Robust Human Detection in Harsh Enclosed Environments Zhenan Liu et.al. 2601.13364 translate read null
2026-01-19 AsyncBEV: Cross-modal Flow Alignment in Asynchronous 3D Object Detection Shiming Wang et.al. 2601.12994 translate read null
2026-01-19 Membership Inference Test: Auditing Training Data in Object Classification Models Gonzalo Mancera et.al. 2601.12929 translate read null
2026-01-19 YOLO26: An Analysis of NMS-Free End to End Framework for Real-Time Object Detection Sudip Chakrabarty et.al. 2601.12882 translate read null
2026-01-19 Towards Unbiased Source-Free Object Detection via Vision Foundation Models Zhi Cai et.al. 2601.12765 translate read null
2026-01-19 RSOD: Reliability-Guided Sonar Image Object Detection with Extremely Limited Labels Chengzhou Li et.al. 2601.12715 translate read null
2026-01-19 BlocksecRT-DETR: Decentralized Privacy-Preserving and Token-Efficient Federated Transformer Learning for Secure Real-Time Object Detection in ITS Mohoshin Ara Tahera et.al. 2601.12693 translate read null
2026-01-19 Mixed Precision PointPillars for Efficient 3D Object Detection with TensorRT Ninnart Fuengfusin et.al. 2601.12638 translate read null
2026-01-15 SecMLOps: A Comprehensive Framework for Integrating Security Throughout the MLOps Lifecycle Xinrui Zhang et.al. 2601.10848 translate read null
2026-01-15 Beyond Single Prompts: Synergistic Fusion and Arrangement for VICL Wenwen Liao et.al. 2601.10117 translate read null
2026-01-15 Enhancing Visual In-Context Learning by Multi-Faceted Fusion Wenwen Liao et.al. 2601.10107 translate read null
2026-01-14 LCF3D: A Robust and Real-Time Late-Cascade Fusion Framework for 3D Object Detection in Autonomous Driving Carlo Sgaravatti et.al. 2601.09812 translate read link
2026-01-14 AquaFeat+: an Underwater Vision Learning-based Enhancement Method for Object Detection, Classification, and Tracking Emanuel da Costa Silva et.al. 2601.09652 translate read null
2026-01-14 Towards Robust Cross-Dataset Object Detection Generalization under Domain Specificity Ritabrata Chakraborty et.al. 2601.09497 translate read link
2026-01-14 DeTracker: Motion-decoupled Vehicle Detection and Tracking in Unstabilized Satellite Videos Jiajun Chen et.al. 2601.09240 translate read null
2026-01-14 Disentangle Object and Non-object Infrared Features via Language Guidance Fan Liu et.al. 2601.09228 translate read null
2026-01-13 DentalX: Context-Aware Dental Disease Detection with Radiographs Zhi Qin Tan et.al. 2601.08797 translate read link
2026-01-13 WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation Zishan Shu et.al. 2601.08602 translate read null
2026-01-13 Edge-Optimized Multimodal Learning for UAV Video Understanding via BLIP-2 Yizhan Feng et.al. 2601.08408 translate read null
2026-01-13 Human-inspired Global-to-Parallel Multi-scale Encoding for Lightweight Vision Models Wei Xu et.al. 2601.08190 translate read null
2026-01-13 Towards Cross-Platform Generalization: Domain Adaptive 3D Detection with Augmentation and Pseudo-Labeling Xiyan Feng et.al. 2601.08174 translate read null
2026-01-13 Representation Learning with Semantic-aware Instance and Sparse Token Alignments Phuoc-Nguyen Bui et.al. 2601.08165 translate read null
2026-01-13 From Prompts to Deployment: Auto-Curated Domain-Specific Dataset Generation via Diffusion Models Dongsik Yoon et.al. 2601.08095 translate read null
2026-01-12 Integrating Attendance Tracking and Emotion Detection for Enhanced Student Engagement in Smart Classrooms Keith Ainebyona et.al. 2601.08049 translate read null
2026-01-06 Edge-AI Perception Node for Cooperative Road-Safety Enforcement and Connected-Vehicle Integration Shree Charran R et.al. 2601.07845 translate read null
2026-01-12 GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection Chen Min et.al. 2601.07273 translate read null
2026-01-12 SC-MII: Infrastructure LiDAR-based 3D Object Detection on Edge Devices for Split Computing with Multiple Intermediate Outputs Integration Taisuke Noguchi et.al. 2601.07119 translate read null
2026-01-11 Billboard in Focus: Estimating Driver Gaze Duration from a Single Image Carlos Pizarroso et.al. 2601.07073 translate read null
2026-01-08 STResNet & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs Sudhakar Sah et.al. 2601.05364 translate read null
2026-01-08 UniLiPs: Unified LiDAR Pseudo-Labeling with Geometry-Grounded Dynamic Scene Decomposition Filippo Ghilotti et.al. 2601.05105 translate read null
2026-01-08 Character Detection using YOLO for Writer Identification in multiple Medieval books Alessandra Scotto di Freca et.al. 2601.04834 translate read null
2026-01-08 When AI Settles Down: Late-Stage Stability as a Signature of AI-Generated Text Detection Ke Sun et.al. 2601.04833 translate read null
2026-01-08 Optimization of Deep Learning Models for Radio Galaxy Classification Philipp Denzel et.al. 2601.04773 translate read null
2026-01-08 DP-MGTD: Privacy-Preserving Machine-Generated Text Detection via Adaptive Differentially Private Entity Sanitization Lionel Z. Wang et.al. 2601.04641 translate read null
2026-01-07 Few-Shot LoRA Adaptation of a Flow-Matching Foundation Model for Cross-Spectral Object Detection Maxim Clouser et.al. 2601.04381 translate read null
2026-01-07 Correcting Autonomous Driving Object Detection Misclassifications with Automated Commonsense Reasoning Keegan Kimbrell et.al. 2601.04271 translate read null
2026-01-07 AI Generated Text Detection Adilkhan Alikhanov et.al. 2601.03812 translate read null
2026-01-07 A Comparative Study of 3D Model Acquisition Methods for Synthetic Data Generation of Agricultural Products Steven Moonen et.al. 2601.03784 translate read null
2026-01-07 HyperCOD: The First Challenging Benchmark and Baseline for Hyperspectral Camouflaged Object Detection Shuyan Bai et.al. 2601.03736 translate read null
2026-01-07 Systematic Evaluation of Depth Backbones and Semantic Cues for Monocular Pseudo-LiDAR 3D Detection Samson Oseiwe Ajadalu et.al. 2601.03617 translate read null
2026-01-07 Physics-Constrained Cross-Resolution Enhancement Network for Optics-Guided Thermal UAV Image Super-Resolution Zhicheng Zhao et.al. 2601.03526 translate read null
2026-01-06 CageDroneRF: A Large-Scale RF Benchmark and Toolkit for Drone Perception Mohammad Rostami et.al. 2601.03302 translate read null
2026-01-06 Motion Blur Robust Wheat Pest Damage Detection with Dynamic Fuzzy Feature Fusion Han Zhang et.al. 2601.03046 translate read null
2026-01-06 Towards Efficient 3D Object Detection for Vehicle-Infrastructure Collaboration via Risk-Intent Selection Li Wang et.al. 2601.03001 translate read null
2026-01-06 DGA-Net: Enhancing SAM with Depth Prompting and Graph-Anchor Guidance for Camouflaged Object Detection Yuetong Li et.al. 2601.02831 translate read null
2026-01-06 D $^3$ R-DETR: DETR with Dual-Domain Density Refinement for Tiny Object Detection in Aerial Images Zixiao Wen et.al. 2601.02747 translate read null
2026-01-05 SortWaste: A Densely Annotated Dataset for Object Detection in Industrial Waste Sorting Sara Inácio et.al. 2601.02299 translate read null
2026-01-05 SLGNet: Synergizing Structural Priors and Language-Guided Modulation for Multimodal Object Detection Xiantai Xiang et.al. 2601.02249 translate read null
2026-01-05 Enhancing Object Detection with Privileged Information: A Model-Agnostic Teacher-Student Approach Matthias Bartolo et.al. 2601.02016 translate read link
2026-01-05 Point-SRA: Self-Representation Alignment for 3D Representation Learning Lintong Wei et.al. 2601.01746 translate read null
2026-01-05 An AI-guided mechanotyping instrument for fully automated oocyte quality assessment Yining Guo et.al. 2601.01728 translate read null
2026-01-04 Learnability-Driven Submodular Optimization for Active Roadside 3D Detection Ruiyu Mao et.al. 2601.01695 translate read null
2026-01-04 Optically Transparent Meta-Grating Embedded in Rear Windshields for Automotive Radar Detection Sergey Geyman et.al. 2601.01551 translate read null
2026-01-04 Robust Ship Detection and Tracking Using Modified ViBe and Backwash Cancellation Algorithm Mohammad Hassan Saghafi et.al. 2601.01481 translate read null
2026-01-04 Evaluation of Convolutional Neural Network For Image Classification with Agricultural and Urban Datasets Shamik Shafkat Avro et.al. 2601.01393 translate read null
2026-01-03 RFAssigner: A Generic Label Assignment Strategy for Dense Object Detection Ziqian Guan et.al. 2601.01240 translate read null
2026-01-03 GenCAMO: Scene-Graph Contextual Decoupling for Environment-aware and Mask-free Camouflage Image-Dense Annotation Generation Chenglizhao Chen et.al. 2601.01181 translate read null
2026-01-03 Evolving CNN Architectures: From Custom Designs to Deep Residual Models for Diverse Image Classification and Detection Tasks Mahmudul Hasan et.al. 2601.01099 translate read null
2026-01-03 Mono3DV: Monocular 3D Object Detection with 3D-Aware Bipartite Matching and Variational Query DeNoising Kiet Dang Vu et.al. 2601.01036 translate read null
2026-01-02 Noise-Robust Tiny Object Localization with Flows Huixin Sun et.al. 2601.00617 translate read null
2026-01-01 RoLID-11K: A Dashcam Dataset for Small-Object Roadside Litter Detection Tao Wu et.al. 2601.00398 translate read null
2026-01-01 Intelligent Traffic Surveillance for Real-Time Vehicle Detection, License Plate Recognition, and Speed Estimation Bruce Mugizi et.al. 2601.00344 translate read null

(<a href=../Object_Detection.md>back to Object Detection</a>)