Object Detection - 2026-01 | Paper Arxiv Daily

Object Detection - 2026-01

Publish Date	Title	Authors	PDF	Translate	Read	Code
2026-01-31	Enhancing Open-Vocabulary Object Detection through Multi-Level Fine-Grained Visual-Language Alignment	Tianyi Zhang et.al.	2602.00531	translate	read	null
2026-01-30	Deep Learning-Based Object Detection for Autonomous Vehicles: A Comparative Study of One-Stage and Two-Stage Detectors on Basic Traffic Objects	Bsher Karbouj et.al.	2602.00385	translate	read	null
2026-01-30	Leveraging Textual-Cues for Enhancing Multimodal Sentiment Analysis by Object Recognition	Sumana Biswas et.al.	2602.00360	translate	read	null
2026-01-29	SDCM: Simulated Densifying and Compensatory Modeling Fusion for Radar-Vision 3-D Object Detection in Internet of Vehicles	Shucong Li et.al.	2602.00149	translate	read	null
2026-01-26	Observing Health Outcomes Using Remote Sensing Imagery and Geo-Context Guided Visual Transformer	Yu Li et.al.	2602.00110	translate	read	null
2026-01-30	User Prompting Strategies and Prompt Enhancement Methods for Open-Set Object Detection in XR Environments	Junfeng Lin et.al.	2601.23281	translate	read	null
2026-01-30	A Comparative Evaluation of Large Vision-Language Models for 2D Object Detection under SOTIF Conditions	Ji Zhou et.al.	2601.22830	translate	read	null
2026-01-30	Active Learning-Driven Lightweight YOLOv9: Enhancing Efficiency in Smart Agriculture	Hung-Chih Tu et.al.	2601.22732	translate	read	null
2026-01-30	OOVDet: Low-Density Prior Learning for Zero-Shot Out-of-Vocabulary Object Detection	Binyi Su et.al.	2601.22685	translate	read	null
2026-01-30	UniGeo: A Unified 3D Indoor Object Detection Framework Integrating Geometry-Aware Learning and Dynamic Channel Gating	Xing Yi et.al.	2601.22616	translate	read	null
2026-01-29	CORDS: Continuous Representations of Discrete Structures	Tin Hadži Veljković et.al.	2601.21583	translate	read	null
2026-01-29	Don’t double it: Efficient Agent Prediction in Occlusions	Anna Rothenhäusler et.al.	2601.21504	translate	read	null
2026-01-28	BadDet+: Robust Backdoor Attacks for Object Detection	Kealan Dunnett et.al.	2601.21066	translate	read	null
2026-01-27	On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text	Michał Gromadzki et.al.	2601.20006	translate	read	null
2026-01-27	VGGT-SLAM 2.0: Real-time Dense Feed-forward Scene Reconstruction	Dominic Maggio et.al.	2601.19887	translate	read	null
2026-01-27	Learned split-spectrum metalens for obstruction-free broadband imaging in the visible	Seungwoo Yoon et.al.	2601.19403	translate	read	null
2026-01-27	MIRAGE: Enabling Real-Time Automotive Mediated Reality	Pascal Jansen et.al.	2601.19385	translate	read	null
2026-01-27	Instance-Guided Radar Depth Estimation for 3D Object Detection	Chen-Chou Lo et.al.	2601.19314	translate	read	null
2026-01-27	Implicit Non-Causal Factors are Out via Dataset Splitting for Domain Generalization Object Detection	Zhilong Zhang et.al.	2601.19127	translate	read	null
2026-01-26	On the Role of Depth in Surgical Vision Foundation Models: An Empirical Study of RGB-D Pre-training	John J. Han et.al.	2601.18929	translate	read	null
2026-01-26	Dynamic Mask-Based Backdoor Attack Against Vision AI Models: A Case Study on Mushroom Detection	Zeineb Dridi et.al.	2601.18845	translate	read	null
2026-01-26	EFSI-DETR: Efficient Frequency-Semantic Integration for Real-Time Small Object Detection in UAV Imagery	Yu Xia et.al.	2601.18597	translate	read	null
2026-01-26	YOLO-DS: Fine-Grained Feature Decoupling via Dual-Statistic Synergy Operator for Object Detection	Lin Huang et.al.	2601.18172	translate	read	null
2026-01-26	Text-Pass Filter: An Efficient Scene Text Detector	Chuang Yang et.al.	2601.18098	translate	read	null
2026-01-23	Boundary and Position Information Mining for Aerial Small Object Detection	Rongxin Huang et.al.	2601.16617	translate	read	null
2026-01-23	Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding	Xiaojiang Peng et.al.	2601.16449	translate	read	null
2026-01-22	The Latency Wall: Benchmarking Off-the-Shelf Emotion Recognition for Real-Time Virtual Avatars	Yarin Benyamin et.al.	2601.15914	translate	read	null
2026-01-22	Performance-guided Reinforced Active Learning for Object Detection	Zhixuan Liang et.al.	2601.15688	translate	read	null
2026-01-21	ZENITH: Automated Gradient Norm Informed Stochastic Optimization	Dhrubo Saha et.al.	2601.15212	translate	read	null
2026-01-21	Graph Recognition via Subgraph Prediction	André Eberhard et.al.	2601.15133	translate	read	null
2026-01-21	M2I2HA: A Multi-modal Object Detection Method Based on Intra- and Inter-Modal Hypergraph Attention	Xiaofan Yang et.al.	2601.14776	translate	read	null
2026-01-21	A comprehensive overview of deep learning models for object detection from videos/images	Sukana Zulfqar et.al.	2601.14677	translate	read	null
2026-01-20	GutenOCR: A Grounded Vision-Language Front-End for Documents	Hunter Heidenreich et.al.	2601.14490	translate	read	link
2026-01-20	Gaussian Based Adaptive Multi-Modal 3D Semantic Occupancy Prediction	A. Enes Doruk et.al.	2601.14448	translate	read	null
2026-01-20	DExTeR: Weakly Semi-Supervised Object Detection with Class and Instance Experts for Medical Imaging	Adrien Meyer et.al.	2601.13954	translate	read	null
2026-01-19	Leveraging Transformer Decoder for Automotive Radar Object Detection	Changxu Zhang et.al.	2601.13386	translate	read	null
2026-01-19	Practical Insights into Semi-Supervised Object Detection Approaches	Chaoxin Wang et.al.	2601.13380	translate	read	null
2026-01-19	Real-Time 4D Radar Perception for Robust Human Detection in Harsh Enclosed Environments	Zhenan Liu et.al.	2601.13364	translate	read	null
2026-01-19	AsyncBEV: Cross-modal Flow Alignment in Asynchronous 3D Object Detection	Shiming Wang et.al.	2601.12994	translate	read	null
2026-01-19	Membership Inference Test: Auditing Training Data in Object Classification Models	Gonzalo Mancera et.al.	2601.12929	translate	read	null
2026-01-19	YOLO26: An Analysis of NMS-Free End to End Framework for Real-Time Object Detection	Sudip Chakrabarty et.al.	2601.12882	translate	read	null
2026-01-19	Towards Unbiased Source-Free Object Detection via Vision Foundation Models	Zhi Cai et.al.	2601.12765	translate	read	null
2026-01-19	RSOD: Reliability-Guided Sonar Image Object Detection with Extremely Limited Labels	Chengzhou Li et.al.	2601.12715	translate	read	null
2026-01-19	BlocksecRT-DETR: Decentralized Privacy-Preserving and Token-Efficient Federated Transformer Learning for Secure Real-Time Object Detection in ITS	Mohoshin Ara Tahera et.al.	2601.12693	translate	read	null
2026-01-19	Mixed Precision PointPillars for Efficient 3D Object Detection with TensorRT	Ninnart Fuengfusin et.al.	2601.12638	translate	read	null
2026-01-15	SecMLOps: A Comprehensive Framework for Integrating Security Throughout the MLOps Lifecycle	Xinrui Zhang et.al.	2601.10848	translate	read	null
2026-01-15	Beyond Single Prompts: Synergistic Fusion and Arrangement for VICL	Wenwen Liao et.al.	2601.10117	translate	read	null
2026-01-15	Enhancing Visual In-Context Learning by Multi-Faceted Fusion	Wenwen Liao et.al.	2601.10107	translate	read	null
2026-01-14	LCF3D: A Robust and Real-Time Late-Cascade Fusion Framework for 3D Object Detection in Autonomous Driving	Carlo Sgaravatti et.al.	2601.09812	translate	read	link
2026-01-14	AquaFeat+: an Underwater Vision Learning-based Enhancement Method for Object Detection, Classification, and Tracking	Emanuel da Costa Silva et.al.	2601.09652	translate	read	null
2026-01-14	Towards Robust Cross-Dataset Object Detection Generalization under Domain Specificity	Ritabrata Chakraborty et.al.	2601.09497	translate	read	link
2026-01-14	DeTracker: Motion-decoupled Vehicle Detection and Tracking in Unstabilized Satellite Videos	Jiajun Chen et.al.	2601.09240	translate	read	null
2026-01-14	Disentangle Object and Non-object Infrared Features via Language Guidance	Fan Liu et.al.	2601.09228	translate	read	null
2026-01-13	DentalX: Context-Aware Dental Disease Detection with Radiographs	Zhi Qin Tan et.al.	2601.08797	translate	read	link
2026-01-13	WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation	Zishan Shu et.al.	2601.08602	translate	read	null
2026-01-13	Edge-Optimized Multimodal Learning for UAV Video Understanding via BLIP-2	Yizhan Feng et.al.	2601.08408	translate	read	null
2026-01-13	Human-inspired Global-to-Parallel Multi-scale Encoding for Lightweight Vision Models	Wei Xu et.al.	2601.08190	translate	read	null
2026-01-13	Towards Cross-Platform Generalization: Domain Adaptive 3D Detection with Augmentation and Pseudo-Labeling	Xiyan Feng et.al.	2601.08174	translate	read	null
2026-01-13	Representation Learning with Semantic-aware Instance and Sparse Token Alignments	Phuoc-Nguyen Bui et.al.	2601.08165	translate	read	null
2026-01-13	From Prompts to Deployment: Auto-Curated Domain-Specific Dataset Generation via Diffusion Models	Dongsik Yoon et.al.	2601.08095	translate	read	null
2026-01-12	Integrating Attendance Tracking and Emotion Detection for Enhanced Student Engagement in Smart Classrooms	Keith Ainebyona et.al.	2601.08049	translate	read	null
2026-01-06	Edge-AI Perception Node for Cooperative Road-Safety Enforcement and Connected-Vehicle Integration	Shree Charran R et.al.	2601.07845	translate	read	null
2026-01-12	GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection	Chen Min et.al.	2601.07273	translate	read	null
2026-01-12	SC-MII: Infrastructure LiDAR-based 3D Object Detection on Edge Devices for Split Computing with Multiple Intermediate Outputs Integration	Taisuke Noguchi et.al.	2601.07119	translate	read	null
2026-01-11	Billboard in Focus: Estimating Driver Gaze Duration from a Single Image	Carlos Pizarroso et.al.	2601.07073	translate	read	null
2026-01-08	STResNet & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs	Sudhakar Sah et.al.	2601.05364	translate	read	null
2026-01-08	UniLiPs: Unified LiDAR Pseudo-Labeling with Geometry-Grounded Dynamic Scene Decomposition	Filippo Ghilotti et.al.	2601.05105	translate	read	null
2026-01-08	Character Detection using YOLO for Writer Identification in multiple Medieval books	Alessandra Scotto di Freca et.al.	2601.04834	translate	read	null
2026-01-08	When AI Settles Down: Late-Stage Stability as a Signature of AI-Generated Text Detection	Ke Sun et.al.	2601.04833	translate	read	null
2026-01-08	Optimization of Deep Learning Models for Radio Galaxy Classification	Philipp Denzel et.al.	2601.04773	translate	read	null
2026-01-08	DP-MGTD: Privacy-Preserving Machine-Generated Text Detection via Adaptive Differentially Private Entity Sanitization	Lionel Z. Wang et.al.	2601.04641	translate	read	null
2026-01-07	Few-Shot LoRA Adaptation of a Flow-Matching Foundation Model for Cross-Spectral Object Detection	Maxim Clouser et.al.	2601.04381	translate	read	null
2026-01-07	Correcting Autonomous Driving Object Detection Misclassifications with Automated Commonsense Reasoning	Keegan Kimbrell et.al.	2601.04271	translate	read	null
2026-01-07	AI Generated Text Detection	Adilkhan Alikhanov et.al.	2601.03812	translate	read	null
2026-01-07	A Comparative Study of 3D Model Acquisition Methods for Synthetic Data Generation of Agricultural Products	Steven Moonen et.al.	2601.03784	translate	read	null
2026-01-07	HyperCOD: The First Challenging Benchmark and Baseline for Hyperspectral Camouflaged Object Detection	Shuyan Bai et.al.	2601.03736	translate	read	null
2026-01-07	Systematic Evaluation of Depth Backbones and Semantic Cues for Monocular Pseudo-LiDAR 3D Detection	Samson Oseiwe Ajadalu et.al.	2601.03617	translate	read	null
2026-01-07	Physics-Constrained Cross-Resolution Enhancement Network for Optics-Guided Thermal UAV Image Super-Resolution	Zhicheng Zhao et.al.	2601.03526	translate	read	null
2026-01-06	CageDroneRF: A Large-Scale RF Benchmark and Toolkit for Drone Perception	Mohammad Rostami et.al.	2601.03302	translate	read	null
2026-01-06	Motion Blur Robust Wheat Pest Damage Detection with Dynamic Fuzzy Feature Fusion	Han Zhang et.al.	2601.03046	translate	read	null
2026-01-06	Towards Efficient 3D Object Detection for Vehicle-Infrastructure Collaboration via Risk-Intent Selection	Li Wang et.al.	2601.03001	translate	read	null
2026-01-06	DGA-Net: Enhancing SAM with Depth Prompting and Graph-Anchor Guidance for Camouflaged Object Detection	Yuetong Li et.al.	2601.02831	translate	read	null
2026-01-06	D $^3$ R-DETR: DETR with Dual-Domain Density Refinement for Tiny Object Detection in Aerial Images	Zixiao Wen et.al.	2601.02747	translate	read	null
2026-01-05	SortWaste: A Densely Annotated Dataset for Object Detection in Industrial Waste Sorting	Sara Inácio et.al.	2601.02299	translate	read	null
2026-01-05	SLGNet: Synergizing Structural Priors and Language-Guided Modulation for Multimodal Object Detection	Xiantai Xiang et.al.	2601.02249	translate	read	null
2026-01-05	Enhancing Object Detection with Privileged Information: A Model-Agnostic Teacher-Student Approach	Matthias Bartolo et.al.	2601.02016	translate	read	link
2026-01-05	Point-SRA: Self-Representation Alignment for 3D Representation Learning	Lintong Wei et.al.	2601.01746	translate	read	null
2026-01-05	An AI-guided mechanotyping instrument for fully automated oocyte quality assessment	Yining Guo et.al.	2601.01728	translate	read	null
2026-01-04	Learnability-Driven Submodular Optimization for Active Roadside 3D Detection	Ruiyu Mao et.al.	2601.01695	translate	read	null
2026-01-04	Optically Transparent Meta-Grating Embedded in Rear Windshields for Automotive Radar Detection	Sergey Geyman et.al.	2601.01551	translate	read	null
2026-01-04	Robust Ship Detection and Tracking Using Modified ViBe and Backwash Cancellation Algorithm	Mohammad Hassan Saghafi et.al.	2601.01481	translate	read	null
2026-01-04	Evaluation of Convolutional Neural Network For Image Classification with Agricultural and Urban Datasets	Shamik Shafkat Avro et.al.	2601.01393	translate	read	null
2026-01-03	RFAssigner: A Generic Label Assignment Strategy for Dense Object Detection	Ziqian Guan et.al.	2601.01240	translate	read	null
2026-01-03	GenCAMO: Scene-Graph Contextual Decoupling for Environment-aware and Mask-free Camouflage Image-Dense Annotation Generation	Chenglizhao Chen et.al.	2601.01181	translate	read	null
2026-01-03	Evolving CNN Architectures: From Custom Designs to Deep Residual Models for Diverse Image Classification and Detection Tasks	Mahmudul Hasan et.al.	2601.01099	translate	read	null
2026-01-03	Mono3DV: Monocular 3D Object Detection with 3D-Aware Bipartite Matching and Variational Query DeNoising	Kiet Dang Vu et.al.	2601.01036	translate	read	null
2026-01-02	Noise-Robust Tiny Object Localization with Flows	Huixin Sun et.al.	2601.00617	translate	read	null
2026-01-01	RoLID-11K: A Dashcam Dataset for Small-Object Roadside Litter Detection	Tao Wu et.al.	2601.00398	translate	read	null
2026-01-01	Intelligent Traffic Surveillance for Real-Time Vehicle Detection, License Plate Recognition, and Speed Estimation	Bruce Mugizi et.al.	2601.00344	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)