Object Detection - 2025-11
Object Detection - 2025-11
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-11-27 | Semi-Supervised Contrastive Learning with Orthonormal Prototypes | Huanran Li et.al. | 2512.07880 | translate | read | null |
| 2025-11-30 | Autonomous Grasping On Quadruped Robot With Task Level Interaction | Muhtadin et.al. | 2512.01052 | translate | read | null |
| 2025-11-30 | Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning | Haozhen Gong et.al. | 2512.00818 | translate | read | null |
| 2025-11-30 | DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering | Toshiki Katsube et.al. | 2512.00773 | translate | read | null |
| 2025-11-29 | MM-DETR: An Efficient Multimodal Detection Transformer with Mamba-Driven Dual-Granularity Fusion and Frequency-Aware Modality Adapters | Jianhong Han et.al. | 2512.00363 | translate | read | null |
| 2025-11-28 | Hybrid Synthetic Data Generation with Domain Randomization Enables Zero-Shot Vision-Based Part Inspection Under Extreme Class Imbalance | Ruo-Syuan Mei et.al. | 2512.00125 | translate | read | null |
| 2025-11-25 | Diffusion-Based Synthetic Brightfield Microscopy Images for Enhanced Single Cell Detection | Mario de Jesus da Graca et.al. | 2512.00078 | translate | read | null |
| 2025-11-24 | ProvRain: Rain-Adaptive Denoising and Vehicle Detection via MobileNet-UNet and Faster R-CNN | Aswinkumar Varathakumaran et.al. | 2512.00073 | translate | read | null |
| 2025-11-23 | PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving | Abdolazim Rezaei et.al. | 2512.00060 | translate | read | null |
| 2025-11-28 | Object-Centric Data Synthesis for Category-level Object Detection | Vikhyat Agarwal et.al. | 2511.23450 | translate | read | null |
| 2025-11-28 | Toward Automatic Safe Driving Instruction: A Large-Scale Vision Language Model Approach | Haruki Sakajo et.al. | 2511.23311 | translate | read | null |
| 2025-11-28 | Synthetic Industrial Object Detection: GenAI vs. Feature-Based Methods | Jose Moises Araya-Martinez et.al. | 2511.23241 | translate | read | null |
| 2025-11-28 | Zero-Shot Multi-Criteria Visual Quality Inspection for Semi-Controlled Industrial Environments via Real-Time 3D Digital Twin Simulation | Jose Moises Araya-Martinez et.al. | 2511.23214 | translate | read | null |
| 2025-11-28 | Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text Understanding | Anik De et.al. | 2511.23071 | translate | read | null |
| 2025-11-28 | Barcode and QR Code Object Detection: An Experimental Study on YOLOv8 Models | Kushagra Pandya et.al. | 2511.22937 | translate | read | null |
| 2025-11-28 | DM $^3$ T: Harmonizing Modalities via Diffusion for Multi-Object Tracking | Weiran Li et.al. | 2511.22896 | translate | read | null |
| 2025-11-27 | DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA | Ahmad Mohammadshirazi et.al. | 2511.22521 | translate | read | null |
| 2025-11-27 | Small Object Detection for Birds with Swin Transformer | Da Huo et.al. | 2511.22310 | translate | read | null |
| 2025-11-27 | Simplex-Optimized Hybrid Ensemble for Large Language Model Text Detection Under Generative Distribution Drif | Sepyan Purnama Kristanto et.al. | 2511.22153 | translate | read | null |
| 2025-11-27 | Bistatic Passive Tracking via CSI Power | Zhongqin Wang et.al. | 2511.22144 | translate | read | null |
| 2025-11-27 | SemOD: Semantic Enabled Object Detection Network under Various Weather Conditions | Aiyinsi Zuo et.al. | 2511.22142 | translate | read | null |
| 2025-11-27 | PAGen: Phase-guided Amplitude Generation for Domain-adaptive Object Detection | Shuchen Du et.al. | 2511.22029 | translate | read | null |
| 2025-11-22 | A Lightweight Approach to Detection of AI-Generated Texts Using Stylometric Features | Sergey K. Aityan et.al. | 2511.21744 | translate | read | null |
| 2025-11-26 | Continual Error Correction on Low-Resource Devices | Kirill Paramonov et.al. | 2511.21652 | translate | read | null |
| 2025-11-26 | CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation | Shizhe Sun et.al. | 2511.21503 | translate | read | null |
| 2025-11-26 | Co-Training Vision Language Models for Remote Sensing Multi-task Learning | Qingyun Li et.al. | 2511.21272 | translate | read | null |
| 2025-11-26 | OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection | Chujie Wang et.al. | 2511.21064 | translate | read | null |
| 2025-11-26 | AerialMind: Towards Referring Multi-Object Tracking in UAV Scenarios | Chenglizhao Chen et.al. | 2511.21053 | translate | read | null |
| 2025-11-26 | Wavefront-Constrained Passive Obscured Object Detection | Zhiwen Zheng et.al. | 2511.20991 | translate | read | null |
| 2025-11-26 | RefOnce: Distilling References into a Prototype Memory for Referring Camouflaged Object Detection | Yu-Huan Wu et.al. | 2511.20989 | translate | read | null |
| 2025-11-25 | Video Object Recognition in Mobile Edge Networks: Local Tracking or Edge Detection? | Kun Guo et.al. | 2511.20716 | translate | read | null |
| 2025-11-25 | MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities | Tooba Tehreem Sheikh et.al. | 2511.20650 | translate | read | null |
| 2025-11-25 | Zoo3D: Zero-Shot 3D Object Detection at Scene Level | Andrey Lemeshko et.al. | 2511.20253 | translate | read | null |
| 2025-11-25 | Intelligent Image Search Algorithms Fusing Visual Large Models | Kehan Wang et.al. | 2511.19920 | translate | read | null |
| 2025-11-24 | Maritime Small Object Detection from UAVs using Deep Learning with Altitude-Aware Dynamic Tiling | Sakib Ahmed et.al. | 2511.19728 | translate | read | null |
| 2025-11-24 | Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration | Remi Petitpierre et.al. | 2511.19538 | translate | read | null |
| 2025-11-24 | SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation | Tianrun Chen et.al. | 2511.19425 | translate | read | null |
| 2025-11-24 | IDEAL-M3D: Instance Diversity-Enriched Active Learning for Monocular 3D Detection | Johannes Meier et.al. | 2511.19301 | translate | read | null |
| 2025-11-24 | SpectraNet: FFT-assisted Deep Learning Classifier for Deepfake Face Detection | Nithira Jayarathne et.al. | 2511.19187 | translate | read | null |
| 2025-11-24 | MambaRefine-YOLO: A Dual-Modality Small Object Detector for UAV Imagery | Shuyu Cao et.al. | 2511.19134 | translate | read | null |
| 2025-11-24 | 3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion | Minchong Chen et.al. | 2511.19117 | translate | read | null |
| 2025-11-24 | LLMAID: Identifying AI Capabilities in Android Apps with LLMs | Pei Liu et.al. | 2511.19059 | translate | read | null |
| 2025-11-24 | LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space | Hai Wu et.al. | 2511.19057 | translate | read | null |
| 2025-11-24 | Enhancing Fast Radio Transient Detection with Mask R-CNN Image Segmentation | Sergio Belmonte Diaz et.al. | 2511.19014 | translate | read | null |
| 2025-11-24 | Peregrine: One-Shot Fine-Tuning for FHE Inference of General Deep CNNs | Huaming Ling et.al. | 2511.18976 | translate | read | null |
| 2025-11-24 | DualGazeNet: A Biologically Inspired Dual-Gaze Query Network for Salient Object Detection | Yu Zhang et.al. | 2511.18865 | translate | read | null |
| 2025-11-24 | DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video | Jiawei Hou et.al. | 2511.18814 | translate | read | null |
| 2025-11-24 | StereoDETR: Stereo-based Transformer for 3D Object Detection | Shiyi Mu et.al. | 2511.18788 | translate | read | null |
| 2025-11-24 | DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving | Hongbin Lin et.al. | 2511.18713 | translate | read | null |
| 2025-11-24 | Dendritic Convolution for Noise Image Recognition | Jiarui Xue et.al. | 2511.18699 | translate | read | null |
| 2025-11-24 | Multimodal Real-Time Anomaly Detection and Industrial Applications | Aman Verma et.al. | 2511.18698 | translate | read | null |
| 2025-11-24 | Exploring Surround-View Fisheye Camera 3D Object Detection | Changcai Li et.al. | 2511.18695 | translate | read | null |
| 2025-11-23 | UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization | Siyi Li et.al. | 2511.18254 | translate | read | null |
| 2025-11-22 | VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object Detection | Jianhang Yao et.al. | 2511.18075 | translate | read | null |
| 2025-11-22 | Diverse Instance Generation via Diffusion Models for Enhanced Few-Shot Object Detection in Remote Sensing Images | Yanxing Liu et.al. | 2511.18031 | translate | read | null |
| 2025-11-22 | State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection | Jiaying Zhou et.al. | 2511.18012 | translate | read | null |
| 2025-11-21 | REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion | Ryoma Yataka et.al. | 2511.17806 | translate | read | null |
| 2025-11-21 | PUCP-Metrix: An Open-source and Comprehensive Toolkit for Linguistic Analysis of Spanish Texts | Javier Alonso Villegas Luis et.al. | 2511.17402 | translate | read | null |
| 2025-11-21 | A lightweight detector for real-time detection of remote sensing images | Qianyi Wang et.al. | 2511.17147 | translate | read | null |
| 2025-11-21 | OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding | Teng Fu et.al. | 2511.17053 | translate | read | null |
| 2025-11-20 | Integrating Deep Learning and Spatial Statistics in Marine Ecosystem Monitoring | Gian Mario Sangiovanni et.al. | 2511.16447 | translate | read | null |
| 2025-11-20 | StreetView-Waste: A Multi-Task Dataset for Urban Waste Management | Diogo J. Paulo et.al. | 2511.16440 | translate | read | null |
| 2025-11-04 | In-Context Adaptation of VLMs for Few-Shot Cell Detection in Optical Microscopy | Shreyan Ganguly et.al. | 2511.05565 | translate | read | null |
| 2025-11-03 | Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation | Jiayuan Wang et.al. | 2511.05557 | translate | read | null |
| 2025-11-06 | NovisVQ: A Streaming Convolutional Neural Network for No-Reference Opinion-Unaware Frame Quality Assessment | Kylie Cancilla et.al. | 2511.04628 | translate | read | null |
| 2025-11-06 | Evaluating the Impact of Weather-Induced Sensor Occlusion on BEVFusion for 3D Object Detection | Sanjay Kumar et.al. | 2511.04347 | translate | read | null |
| 2025-11-06 | Comparative Study of CNN Architectures for Binary Classification of Horses and Motorcycles in the VOC 2008 Dataset | Muhammad Annas Shaikh et.al. | 2511.04344 | translate | read | null |
| 2025-11-06 | Deep learning-based object detection of offshore platforms on Sentinel-1 Imagery and the impact of synthetic training data | Robin Spanier et.al. | 2511.04304 | translate | read | null |
| 2025-11-06 | DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platforms | Shengyu Tang et.al. | 2511.04128 | translate | read | null |
| 2025-11-05 | Desert Waste Detection and Classification Using Data-Based and Model-Based Enhanced YOLOv12 DL Model | Abdulmumin Sa’ad et.al. | 2511.03888 | translate | read | null |
| 2025-11-05 | ISC-Perception: A Hybrid Computer Vision Dataset for Object Detection in Novel Steel Assembly | Miftahur Rahman et.al. | 2511.03098 | translate | read | null |
| 2025-11-05 | A Computer Vision Based Proxy for Political Polarization in Religious Countries: A Turkiye Case Study | Liangze Ke et.al. | 2511.03088 | translate | read | null |
| 2025-11-04 | Diffusion Models are Robust Pretrainers | Mika Yagoda et.al. | 2511.02793 | translate | read | null |
| 2025-11-04 | DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding | Zixuan Liu et.al. | 2511.02495 | translate | read | null |
| 2025-11-04 | Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization | Tao Liu et.al. | 2511.02489 | translate | read | null |
| 2025-11-04 | Facial Expression Recognition System Using DNN Accelerator with Multi-threading on FPGA | Takuto Ando et.al. | 2511.02408 | translate | read | null |
| 2025-11-04 | 3D Point Cloud Object Detection on Edge Devices for Split Computing | Taisuke Noguchi et.al. | 2511.02293 | translate | read | null |
| 2025-11-04 | Autobiasing Event Cameras for Flickering Mitigation | Mehdi Sefidgar Dilmaghani et.al. | 2511.02180 | translate | read | null |
| 2025-11-03 | UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs | Zhe Liu et.al. | 2511.01768 | translate | read | null |
| 2025-11-03 | CGF-DETR: Cross-Gated Fusion DETR for Enhanced Pneumonia Detection in Chest X-rays | Yefeng Wu et.al. | 2511.01730 | translate | read | null |
| 2025-11-03 | Contrast-Guided Cross-Modal Distillation for Thermal Object Detection | SiWoo Kim et.al. | 2511.01435 | translate | read | null |
| 2025-11-03 | Eyes on Target: Gaze-Aware Object Detection in Egocentric Video | Vishakha Lall et.al. | 2511.01237 | translate | read | null |
| 2025-11-03 | DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection | Guoxin Ma et.al. | 2511.01192 | translate | read | null |
| 2025-11-02 | Advancing Machine-Generated Text Detection from an Easy to Hard Supervision Perspective | Chenwang Wu et.al. | 2511.00988 | translate | read | null |
| 2025-11-02 | A Hybrid YOLOv5-SSD IoT-Based Animal Detection System for Durian Plantation Protection | Anis Suttan Shahrir et.al. | 2511.00777 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)