Object Detection - 2026-01
Object Detection - 2026-01
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-01-31 | Enhancing Open-Vocabulary Object Detection through Multi-Level Fine-Grained Visual-Language Alignment | Tianyi Zhang et.al. | 2602.00531 | translate | read | null |
| 2026-01-30 | Deep Learning-Based Object Detection for Autonomous Vehicles: A Comparative Study of One-Stage and Two-Stage Detectors on Basic Traffic Objects | Bsher Karbouj et.al. | 2602.00385 | translate | read | null |
| 2026-01-30 | Leveraging Textual-Cues for Enhancing Multimodal Sentiment Analysis by Object Recognition | Sumana Biswas et.al. | 2602.00360 | translate | read | null |
| 2026-01-29 | SDCM: Simulated Densifying and Compensatory Modeling Fusion for Radar-Vision 3-D Object Detection in Internet of Vehicles | Shucong Li et.al. | 2602.00149 | translate | read | null |
| 2026-01-26 | Observing Health Outcomes Using Remote Sensing Imagery and Geo-Context Guided Visual Transformer | Yu Li et.al. | 2602.00110 | translate | read | null |
| 2026-01-30 | User Prompting Strategies and Prompt Enhancement Methods for Open-Set Object Detection in XR Environments | Junfeng Lin et.al. | 2601.23281 | translate | read | null |
| 2026-01-30 | A Comparative Evaluation of Large Vision-Language Models for 2D Object Detection under SOTIF Conditions | Ji Zhou et.al. | 2601.22830 | translate | read | null |
| 2026-01-30 | Active Learning-Driven Lightweight YOLOv9: Enhancing Efficiency in Smart Agriculture | Hung-Chih Tu et.al. | 2601.22732 | translate | read | null |
| 2026-01-30 | OOVDet: Low-Density Prior Learning for Zero-Shot Out-of-Vocabulary Object Detection | Binyi Su et.al. | 2601.22685 | translate | read | null |
| 2026-01-30 | UniGeo: A Unified 3D Indoor Object Detection Framework Integrating Geometry-Aware Learning and Dynamic Channel Gating | Xing Yi et.al. | 2601.22616 | translate | read | null |
| 2026-01-29 | CORDS: Continuous Representations of Discrete Structures | Tin Hadži Veljković et.al. | 2601.21583 | translate | read | null |
| 2026-01-29 | Don’t double it: Efficient Agent Prediction in Occlusions | Anna Rothenhäusler et.al. | 2601.21504 | translate | read | null |
| 2026-01-28 | BadDet+: Robust Backdoor Attacks for Object Detection | Kealan Dunnett et.al. | 2601.21066 | translate | read | null |
| 2026-01-27 | On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text | Michał Gromadzki et.al. | 2601.20006 | translate | read | null |
| 2026-01-27 | VGGT-SLAM 2.0: Real-time Dense Feed-forward Scene Reconstruction | Dominic Maggio et.al. | 2601.19887 | translate | read | null |
| 2026-01-27 | Learned split-spectrum metalens for obstruction-free broadband imaging in the visible | Seungwoo Yoon et.al. | 2601.19403 | translate | read | null |
| 2026-01-27 | MIRAGE: Enabling Real-Time Automotive Mediated Reality | Pascal Jansen et.al. | 2601.19385 | translate | read | null |
| 2026-01-27 | Instance-Guided Radar Depth Estimation for 3D Object Detection | Chen-Chou Lo et.al. | 2601.19314 | translate | read | null |
| 2026-01-27 | Implicit Non-Causal Factors are Out via Dataset Splitting for Domain Generalization Object Detection | Zhilong Zhang et.al. | 2601.19127 | translate | read | null |
| 2026-01-26 | On the Role of Depth in Surgical Vision Foundation Models: An Empirical Study of RGB-D Pre-training | John J. Han et.al. | 2601.18929 | translate | read | null |
| 2026-01-26 | Dynamic Mask-Based Backdoor Attack Against Vision AI Models: A Case Study on Mushroom Detection | Zeineb Dridi et.al. | 2601.18845 | translate | read | null |
| 2026-01-26 | EFSI-DETR: Efficient Frequency-Semantic Integration for Real-Time Small Object Detection in UAV Imagery | Yu Xia et.al. | 2601.18597 | translate | read | null |
| 2026-01-26 | YOLO-DS: Fine-Grained Feature Decoupling via Dual-Statistic Synergy Operator for Object Detection | Lin Huang et.al. | 2601.18172 | translate | read | null |
| 2026-01-26 | Text-Pass Filter: An Efficient Scene Text Detector | Chuang Yang et.al. | 2601.18098 | translate | read | null |
| 2026-01-23 | Boundary and Position Information Mining for Aerial Small Object Detection | Rongxin Huang et.al. | 2601.16617 | translate | read | null |
| 2026-01-23 | Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding | Xiaojiang Peng et.al. | 2601.16449 | translate | read | null |
| 2026-01-22 | The Latency Wall: Benchmarking Off-the-Shelf Emotion Recognition for Real-Time Virtual Avatars | Yarin Benyamin et.al. | 2601.15914 | translate | read | null |
| 2026-01-22 | Performance-guided Reinforced Active Learning for Object Detection | Zhixuan Liang et.al. | 2601.15688 | translate | read | null |
| 2026-01-21 | ZENITH: Automated Gradient Norm Informed Stochastic Optimization | Dhrubo Saha et.al. | 2601.15212 | translate | read | null |
| 2026-01-21 | Graph Recognition via Subgraph Prediction | André Eberhard et.al. | 2601.15133 | translate | read | null |
| 2026-01-21 | M2I2HA: A Multi-modal Object Detection Method Based on Intra- and Inter-Modal Hypergraph Attention | Xiaofan Yang et.al. | 2601.14776 | translate | read | null |
| 2026-01-21 | A comprehensive overview of deep learning models for object detection from videos/images | Sukana Zulfqar et.al. | 2601.14677 | translate | read | null |
| 2026-01-20 | GutenOCR: A Grounded Vision-Language Front-End for Documents | Hunter Heidenreich et.al. | 2601.14490 | translate | read | link |
| 2026-01-20 | Gaussian Based Adaptive Multi-Modal 3D Semantic Occupancy Prediction | A. Enes Doruk et.al. | 2601.14448 | translate | read | null |
| 2026-01-20 | DExTeR: Weakly Semi-Supervised Object Detection with Class and Instance Experts for Medical Imaging | Adrien Meyer et.al. | 2601.13954 | translate | read | null |
| 2026-01-19 | Leveraging Transformer Decoder for Automotive Radar Object Detection | Changxu Zhang et.al. | 2601.13386 | translate | read | null |
| 2026-01-19 | Practical Insights into Semi-Supervised Object Detection Approaches | Chaoxin Wang et.al. | 2601.13380 | translate | read | null |
| 2026-01-19 | Real-Time 4D Radar Perception for Robust Human Detection in Harsh Enclosed Environments | Zhenan Liu et.al. | 2601.13364 | translate | read | null |
| 2026-01-19 | AsyncBEV: Cross-modal Flow Alignment in Asynchronous 3D Object Detection | Shiming Wang et.al. | 2601.12994 | translate | read | null |
| 2026-01-19 | Membership Inference Test: Auditing Training Data in Object Classification Models | Gonzalo Mancera et.al. | 2601.12929 | translate | read | null |
| 2026-01-19 | YOLO26: An Analysis of NMS-Free End to End Framework for Real-Time Object Detection | Sudip Chakrabarty et.al. | 2601.12882 | translate | read | null |
| 2026-01-19 | Towards Unbiased Source-Free Object Detection via Vision Foundation Models | Zhi Cai et.al. | 2601.12765 | translate | read | null |
| 2026-01-19 | RSOD: Reliability-Guided Sonar Image Object Detection with Extremely Limited Labels | Chengzhou Li et.al. | 2601.12715 | translate | read | null |
| 2026-01-19 | BlocksecRT-DETR: Decentralized Privacy-Preserving and Token-Efficient Federated Transformer Learning for Secure Real-Time Object Detection in ITS | Mohoshin Ara Tahera et.al. | 2601.12693 | translate | read | null |
| 2026-01-19 | Mixed Precision PointPillars for Efficient 3D Object Detection with TensorRT | Ninnart Fuengfusin et.al. | 2601.12638 | translate | read | null |
| 2026-01-15 | SecMLOps: A Comprehensive Framework for Integrating Security Throughout the MLOps Lifecycle | Xinrui Zhang et.al. | 2601.10848 | translate | read | null |
| 2026-01-15 | Beyond Single Prompts: Synergistic Fusion and Arrangement for VICL | Wenwen Liao et.al. | 2601.10117 | translate | read | null |
| 2026-01-15 | Enhancing Visual In-Context Learning by Multi-Faceted Fusion | Wenwen Liao et.al. | 2601.10107 | translate | read | null |
| 2026-01-14 | LCF3D: A Robust and Real-Time Late-Cascade Fusion Framework for 3D Object Detection in Autonomous Driving | Carlo Sgaravatti et.al. | 2601.09812 | translate | read | link |
| 2026-01-14 | AquaFeat+: an Underwater Vision Learning-based Enhancement Method for Object Detection, Classification, and Tracking | Emanuel da Costa Silva et.al. | 2601.09652 | translate | read | null |
| 2026-01-14 | Towards Robust Cross-Dataset Object Detection Generalization under Domain Specificity | Ritabrata Chakraborty et.al. | 2601.09497 | translate | read | link |
| 2026-01-14 | DeTracker: Motion-decoupled Vehicle Detection and Tracking in Unstabilized Satellite Videos | Jiajun Chen et.al. | 2601.09240 | translate | read | null |
| 2026-01-14 | Disentangle Object and Non-object Infrared Features via Language Guidance | Fan Liu et.al. | 2601.09228 | translate | read | null |
| 2026-01-13 | DentalX: Context-Aware Dental Disease Detection with Radiographs | Zhi Qin Tan et.al. | 2601.08797 | translate | read | link |
| 2026-01-13 | WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation | Zishan Shu et.al. | 2601.08602 | translate | read | null |
| 2026-01-13 | Edge-Optimized Multimodal Learning for UAV Video Understanding via BLIP-2 | Yizhan Feng et.al. | 2601.08408 | translate | read | null |
| 2026-01-13 | Human-inspired Global-to-Parallel Multi-scale Encoding for Lightweight Vision Models | Wei Xu et.al. | 2601.08190 | translate | read | null |
| 2026-01-13 | Towards Cross-Platform Generalization: Domain Adaptive 3D Detection with Augmentation and Pseudo-Labeling | Xiyan Feng et.al. | 2601.08174 | translate | read | null |
| 2026-01-13 | Representation Learning with Semantic-aware Instance and Sparse Token Alignments | Phuoc-Nguyen Bui et.al. | 2601.08165 | translate | read | null |
| 2026-01-13 | From Prompts to Deployment: Auto-Curated Domain-Specific Dataset Generation via Diffusion Models | Dongsik Yoon et.al. | 2601.08095 | translate | read | null |
| 2026-01-12 | Integrating Attendance Tracking and Emotion Detection for Enhanced Student Engagement in Smart Classrooms | Keith Ainebyona et.al. | 2601.08049 | translate | read | null |
| 2026-01-06 | Edge-AI Perception Node for Cooperative Road-Safety Enforcement and Connected-Vehicle Integration | Shree Charran R et.al. | 2601.07845 | translate | read | null |
| 2026-01-12 | GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection | Chen Min et.al. | 2601.07273 | translate | read | null |
| 2026-01-12 | SC-MII: Infrastructure LiDAR-based 3D Object Detection on Edge Devices for Split Computing with Multiple Intermediate Outputs Integration | Taisuke Noguchi et.al. | 2601.07119 | translate | read | null |
| 2026-01-11 | Billboard in Focus: Estimating Driver Gaze Duration from a Single Image | Carlos Pizarroso et.al. | 2601.07073 | translate | read | null |
| 2026-01-08 | STResNet & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs | Sudhakar Sah et.al. | 2601.05364 | translate | read | null |
| 2026-01-08 | UniLiPs: Unified LiDAR Pseudo-Labeling with Geometry-Grounded Dynamic Scene Decomposition | Filippo Ghilotti et.al. | 2601.05105 | translate | read | null |
| 2026-01-08 | Character Detection using YOLO for Writer Identification in multiple Medieval books | Alessandra Scotto di Freca et.al. | 2601.04834 | translate | read | null |
| 2026-01-08 | When AI Settles Down: Late-Stage Stability as a Signature of AI-Generated Text Detection | Ke Sun et.al. | 2601.04833 | translate | read | null |
| 2026-01-08 | Optimization of Deep Learning Models for Radio Galaxy Classification | Philipp Denzel et.al. | 2601.04773 | translate | read | null |
| 2026-01-08 | DP-MGTD: Privacy-Preserving Machine-Generated Text Detection via Adaptive Differentially Private Entity Sanitization | Lionel Z. Wang et.al. | 2601.04641 | translate | read | null |
| 2026-01-07 | Few-Shot LoRA Adaptation of a Flow-Matching Foundation Model for Cross-Spectral Object Detection | Maxim Clouser et.al. | 2601.04381 | translate | read | null |
| 2026-01-07 | Correcting Autonomous Driving Object Detection Misclassifications with Automated Commonsense Reasoning | Keegan Kimbrell et.al. | 2601.04271 | translate | read | null |
| 2026-01-07 | AI Generated Text Detection | Adilkhan Alikhanov et.al. | 2601.03812 | translate | read | null |
| 2026-01-07 | A Comparative Study of 3D Model Acquisition Methods for Synthetic Data Generation of Agricultural Products | Steven Moonen et.al. | 2601.03784 | translate | read | null |
| 2026-01-07 | HyperCOD: The First Challenging Benchmark and Baseline for Hyperspectral Camouflaged Object Detection | Shuyan Bai et.al. | 2601.03736 | translate | read | null |
| 2026-01-07 | Systematic Evaluation of Depth Backbones and Semantic Cues for Monocular Pseudo-LiDAR 3D Detection | Samson Oseiwe Ajadalu et.al. | 2601.03617 | translate | read | null |
| 2026-01-07 | Physics-Constrained Cross-Resolution Enhancement Network for Optics-Guided Thermal UAV Image Super-Resolution | Zhicheng Zhao et.al. | 2601.03526 | translate | read | null |
| 2026-01-06 | CageDroneRF: A Large-Scale RF Benchmark and Toolkit for Drone Perception | Mohammad Rostami et.al. | 2601.03302 | translate | read | null |
| 2026-01-06 | Motion Blur Robust Wheat Pest Damage Detection with Dynamic Fuzzy Feature Fusion | Han Zhang et.al. | 2601.03046 | translate | read | null |
| 2026-01-06 | Towards Efficient 3D Object Detection for Vehicle-Infrastructure Collaboration via Risk-Intent Selection | Li Wang et.al. | 2601.03001 | translate | read | null |
| 2026-01-06 | DGA-Net: Enhancing SAM with Depth Prompting and Graph-Anchor Guidance for Camouflaged Object Detection | Yuetong Li et.al. | 2601.02831 | translate | read | null |
| 2026-01-06 | D $^3$ R-DETR: DETR with Dual-Domain Density Refinement for Tiny Object Detection in Aerial Images | Zixiao Wen et.al. | 2601.02747 | translate | read | null |
| 2026-01-05 | SortWaste: A Densely Annotated Dataset for Object Detection in Industrial Waste Sorting | Sara Inácio et.al. | 2601.02299 | translate | read | null |
| 2026-01-05 | SLGNet: Synergizing Structural Priors and Language-Guided Modulation for Multimodal Object Detection | Xiantai Xiang et.al. | 2601.02249 | translate | read | null |
| 2026-01-05 | Enhancing Object Detection with Privileged Information: A Model-Agnostic Teacher-Student Approach | Matthias Bartolo et.al. | 2601.02016 | translate | read | link |
| 2026-01-05 | Point-SRA: Self-Representation Alignment for 3D Representation Learning | Lintong Wei et.al. | 2601.01746 | translate | read | null |
| 2026-01-05 | An AI-guided mechanotyping instrument for fully automated oocyte quality assessment | Yining Guo et.al. | 2601.01728 | translate | read | null |
| 2026-01-04 | Learnability-Driven Submodular Optimization for Active Roadside 3D Detection | Ruiyu Mao et.al. | 2601.01695 | translate | read | null |
| 2026-01-04 | Optically Transparent Meta-Grating Embedded in Rear Windshields for Automotive Radar Detection | Sergey Geyman et.al. | 2601.01551 | translate | read | null |
| 2026-01-04 | Robust Ship Detection and Tracking Using Modified ViBe and Backwash Cancellation Algorithm | Mohammad Hassan Saghafi et.al. | 2601.01481 | translate | read | null |
| 2026-01-04 | Evaluation of Convolutional Neural Network For Image Classification with Agricultural and Urban Datasets | Shamik Shafkat Avro et.al. | 2601.01393 | translate | read | null |
| 2026-01-03 | RFAssigner: A Generic Label Assignment Strategy for Dense Object Detection | Ziqian Guan et.al. | 2601.01240 | translate | read | null |
| 2026-01-03 | GenCAMO: Scene-Graph Contextual Decoupling for Environment-aware and Mask-free Camouflage Image-Dense Annotation Generation | Chenglizhao Chen et.al. | 2601.01181 | translate | read | null |
| 2026-01-03 | Evolving CNN Architectures: From Custom Designs to Deep Residual Models for Diverse Image Classification and Detection Tasks | Mahmudul Hasan et.al. | 2601.01099 | translate | read | null |
| 2026-01-03 | Mono3DV: Monocular 3D Object Detection with 3D-Aware Bipartite Matching and Variational Query DeNoising | Kiet Dang Vu et.al. | 2601.01036 | translate | read | null |
| 2026-01-02 | Noise-Robust Tiny Object Localization with Flows | Huixin Sun et.al. | 2601.00617 | translate | read | null |
| 2026-01-01 | RoLID-11K: A Dashcam Dataset for Small-Object Roadside Litter Detection | Tao Wu et.al. | 2601.00398 | translate | read | null |
| 2026-01-01 | Intelligent Traffic Surveillance for Real-Time Vehicle Detection, License Plate Recognition, and Speed Estimation | Bruce Mugizi et.al. | 2601.00344 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)