Object Detection - 2026-02
Object Detection - 2026-02
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-02-28 | TP-Spikformer: Token Pruned Spiking Transformer | Wenjie Wei et.al. | 2603.00527 | translate | read | null |
| 2026-02-28 | Random Wins All: Rethinking Grouping Strategies for Vision Tokens | Qihang Fan et.al. | 2603.00486 | translate | read | null |
| 2026-02-26 | Sensor Generalization for Adaptive Sensing in Event-based Object Detection via Joint Distribution Training | Aheli Saha et.al. | 2602.23357 | translate | read | null |
| 2026-02-26 | Through BrokenEyes: How Eye Disorders Impact Face Detection? | Prottay Kumar Adhikary et.al. | 2602.23212 | translate | read | null |
| 2026-02-26 | Locally Adaptive Decay Surfaces for High-Speed Face and Landmark Detection with Event Cameras | Paul Kielty et.al. | 2602.23101 | translate | read | null |
| 2026-02-26 | D-FINE-seg: Object Detection and Instance Segmentation Framework with multi-backend deployment | Argo Saakyan et.al. | 2602.23043 | translate | read | null |
| 2026-02-26 | Small Object Detection Model with Spatial Laplacian Pyramid Attention and Multi-Scale Features Enhancement in Aerial Images | Zhangjian Ji et.al. | 2602.23031 | translate | read | null |
| 2026-02-26 | WaterVideoQA: ASV-Centric Perception and Rule-Compliant Reasoning via Multi-Modal Agents | Runwei Guan et.al. | 2602.22923 | translate | read | null |
| 2026-02-26 | UFO-DETR: Frequency-Guided End-to-End Detector for UAV Tiny Objects | Yuankai Chen et.al. | 2602.22712 | translate | read | null |
| 2026-02-26 | SUPERGLASSES: Benchmarking Vision Language Models as Intelligent Agents for AI Smart Glasses | Zhuohang Jiang et.al. | 2602.22683 | translate | read | null |
| 2026-02-26 | SPMamba-YOLO: An Underwater Object Detection Network Based on Multi-Scale Feature Enhancement and Global Context Modeling | Guanghao Liao et.al. | 2602.22674 | translate | read | null |
| 2026-02-26 | CGSA: Class-Guided Slot-Aware Adaptation for Source-Free Object Detection | Boyang Dai et.al. | 2602.22621 | translate | read | null |
| 2026-02-26 | Don’t let the information slip away | Taozhe Li et.al. | 2602.22595 | translate | read | null |
| 2026-02-25 | Unified Unsupervised and Sparsely-Supervised 3D Object Detection by Semantic Pseudo-Labeling and Prototype Learning | Yushen He et.al. | 2602.21484 | translate | read | null |
| 2026-02-24 | Le-DETR: Revisiting Real-Time Detection Transformer with Efficient Encoder Design | Jiannan Huang et.al. | 2602.21010 | translate | read | null |
| 2026-02-24 | EW-DETR: Evolving World Object Detection via Incremental Low-Rank DEtection TRansformer | Munish Monga et.al. | 2602.20985 | translate | read | null |
| 2026-02-24 | FLIM Networks with Bag of Feature Points | João Deltregia Martinelli et.al. | 2602.20845 | translate | read | null |
| 2026-02-24 | SD4R: Sparse-to-Dense Learning for 3D Object Detection with 4D Radar | Xiaokai Bai et.al. | 2602.20653 | translate | read | null |
| 2026-02-24 | Boosting Instance Awareness via Cross-View Correlation with 4D Radar and Camera for 3D Object Detection | Xiaokai Bai et.al. | 2602.20632 | translate | read | null |
| 2026-02-24 | Object-Scene-Camera Decomposition and Recomposition for Data-Efficient Monocular 3D Object Detection | Zhaonian Kuang et.al. | 2602.20627 | translate | read | null |
| 2026-02-24 | Knowing the Unknown: Interpretable Open-World Object Detection via Concept Decomposition Model | Xueqiang Lv et.al. | 2602.20616 | translate | read | null |
| 2026-02-23 | An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction | Guanting Shen et.al. | 2602.20219 | translate | read | null |
| 2026-02-23 | RADE-Net: Robust Attention Network for Radar-Only Object Detection in Adverse Weather | Christof Leitgeb et.al. | 2602.19994 | translate | read | null |
| 2026-02-23 | TextShield-R1: Reinforced Reasoning for Tampered Text Detection | Chenfan Qu et.al. | 2602.19828 | translate | read | null |
| 2026-02-23 | Iconographic Classification and Content-Based Recommendation for Digitized Artworks | Krzysztof Kutt et.al. | 2602.19698 | translate | read | null |
| 2026-02-23 | Fore-Mamba3D: Mamba-based Foreground-Enhanced Encoding for 3D Object Detection | Zhiwei Ning et.al. | 2602.19536 | translate | read | null |
| 2026-02-23 | A Text-Guided Vision Model for Enhanced Recognition of Small Instances | Hyun-Ki Jung et.al. | 2602.19503 | translate | read | null |
| 2026-02-23 | Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement | Amirhossein Farzam et.al. | 2602.19396 | translate | read | null |
| 2026-02-22 | CORVET: A CORDIC-Powered, Resource-Frugal Mixed-Precision Vector Processing Engine for High-Throughput AIoT applications | Sonu Kumar et.al. | 2602.19268 | translate | read | null |
| 2026-02-21 | Learning Multi-Modal Prototypes for Cross-Domain Few-Shot Object Detection | Wanqi Wang et.al. | 2602.18811 | translate | read | null |
| 2026-02-20 | BloomNet: Exploring Single vs. Multiple Object Annotation for Flower Recognition Using YOLO Variants | Safwat Nusrat et.al. | 2602.18585 | translate | read | null |
| 2026-02-20 | Do Generative Metrics Predict YOLO Performance? An Evaluation Across Models, Augmentation Ratios, and Dataset Complexity | Vasile Marian et.al. | 2602.18525 | translate | read | null |
| 2026-02-20 | Self-Aware Object Detection via Degradation Manifolds | Stefan Becker et.al. | 2602.18394 | translate | read | null |
| 2026-02-19 | Detection and Classification of Cetacean Echolocation Clicks using Image-based Object Detection Methods applied to Advanced Wavelet-based Transformations | Christopher Hauer et.al. | 2602.17749 | translate | read | null |
| 2026-02-19 | A Cost-Effective and Climate-Resilient Air Pressure System for Rain Effect Reduction on Automated Vehicle Cameras | Mohamed Sabry et.al. | 2602.17472 | translate | read | null |
| 2026-02-19 | A Multi-modal Detection System for Infrastructure-based Freight Signal Priority | Ziyan Zhang et.al. | 2602.17252 | translate | read | null |
| 2026-02-18 | Benchmarking Adversarial Robustness and Adversarial Training Strategies for Object Detection | Alexis Winter et.al. | 2602.16494 | translate | read | null |
| 2026-02-18 | How Reliable is Your Service at the Extreme Edge? Analytical Modeling of Computational Reliability | MHD Saria Allahham et.al. | 2602.16362 | translate | read | null |
| 2026-02-18 | A Self-Supervised Approach for Enhanced Feature Representations in Object Detection Tasks | Santiago C. Vilabella et.al. | 2602.16322 | translate | read | null |
| 2026-02-17 | A Study on Real-time Object Detection using Deep Learning | Ankita Bose et.al. | 2602.15926 | translate | read | null |
| 2026-02-17 | ToaSt: Token Channel Selection and Structured Pruning for Efficient ViT | Hyunchan Moon et.al. | 2602.15720 | translate | read | null |
| 2026-02-17 | DependencyAI: Detecting AI Generated Text through Dependency Parsing | Sara Ahmed et.al. | 2602.15514 | translate | read | null |
| 2026-02-16 | Synthesizing Trajectory Queries from Examples | Stephen Mell et.al. | 2602.15164 | translate | read | null |
| 2026-02-16 | Zero-shot HOI Detection with MLLM-based Detector-agnostic Interaction Recognition | Shiyu Xuan et.al. | 2602.15124 | translate | read | null |
| 2026-02-15 | Explainability-Inspired Layer-Wise Pruning of Deep Neural Networks for Efficient Object Detection | Abhinav Shukla et.al. | 2602.14040 | translate | read | null |
| 2026-02-15 | RoboAug: One Annotation to Hundreds of Scenes via Region-Contrastive Data Augmentation for Robotic Manipulation | Xinhua Wang et.al. | 2602.14032 | translate | read | null |
| 2026-02-14 | Explore Intrinsic Geometry for Query-based Tiny and Oriented Object Detector with Momentum-based Bipartite Matching | Junpeng Zhang et.al. | 2602.13728 | translate | read | null |
| 2026-02-14 | Fine-tuned Vision Language Model for Localization of Parasitic Eggs in Microscopic Images | Chan Hao Sien et.al. | 2602.13712 | translate | read | null |
| 2026-02-13 | LAF-YOLOv10 with Partial Convolution Backbone, Attention-Guided Feature Pyramid, Auxiliary P2 Head, and Wise-IoU Loss for Small Object Detection in Drone Aerial Imagery | Sohail Ali Farooqui et.al. | 2602.13378 | translate | read | null |
| 2026-02-13 | Robustness of Object Detection of Autonomous Vehicles in Adverse Weather Conditions | Fox Pettersen et.al. | 2602.12902 | translate | read | null |
| 2026-02-13 | PISHYAR: A Socially Intelligent Smart Cane for Indoor Social Navigation and Multimodal Human-Robot Interaction for Visually Impaired People | Mahdi Haghighat Joo et.al. | 2602.12597 | translate | read | null |
| 2026-02-12 | Learning to Manipulate Anything: Revealing Data Scaling Laws in Bounding-Box Guided Policies | Yihao Wu et.al. | 2602.11885 | translate | read | null |
| 2026-02-12 | DMAP: A Distribution Map for Text | Tom Kempton et.al. | 2602.11871 | translate | read | null |
| 2026-02-12 | HyperDet: 3D Object Detection with Hyper 4D Radar Point Clouds | Yichun Xiao et.al. | 2602.11554 | translate | read | null |
| 2026-02-11 | Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting | Rishikesh Bhyri et.al. | 2602.11024 | translate | read | null |
| 2026-02-11 | FGAA-FPN: Foreground-Guided Angle-Aware Feature Pyramid Network for Oriented Object Detection | Jialin Ma et.al. | 2602.10710 | translate | read | null |
| 2026-02-11 | AurigaNet: A Real-Time Multi-Task Network for Enhanced Urban Driving Perception | Kiarash Ghasemzadeh et.al. | 2602.10660 | translate | read | null |
| 2026-02-11 | Fast Person Detection Using YOLOX With AI Accelerator For Train Station Safety | Mas Nurul Achmadiah et.al. | 2602.10593 | translate | read | null |
| 2026-02-11 | 1%>100%: High-Efficiency Visual Adapter with Complex Linear Projection Optimization | Dongshuo Yin et.al. | 2602.10513 | translate | read | null |
| 2026-02-10 | Conformal Prediction Sets for Instance Segmentation | Kerri Lu et.al. | 2602.10045 | translate | read | null |
| 2026-02-10 | Learning to Detect Baked Goods with Limited Supervision | Thomas H. Schmitt et.al. | 2602.09979 | translate | read | null |
| 2026-02-10 | Energy-Efficient Fast Object Detection on Edge Devices for IoT Systems | Mas Nurul Achmadiah et.al. | 2602.09515 | translate | read | null |
| 2026-02-09 | Long distance quantum illumination and ranging using polarization entangled photon pairs in a lossy environment | Sujai Matta et.al. | 2602.08947 | translate | read | null |
| 2026-02-09 | StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors | Suraj Ranganath et.al. | 2602.08934 | translate | read | link |
| 2026-02-09 | UAV-Supported Maritime Search System: Experience from Valun Bay Field Trials | Stefan Ivić et.al. | 2602.08450 | translate | read | null |
| 2026-02-08 | MambaFusion: Adaptive State-Space Fusion for Multimodal 3D Object Detection | Venkatraman Narayanan et.al. | 2602.08126 | translate | read | null |
| 2026-02-08 | Beyond Raw Detection Scores: Markov-Informed Calibration for Boosting Machine-Generated Text Detection | Chenwang Wu et.al. | 2602.08031 | translate | read | null |
| 2026-02-08 | Open-Text Aerial Detection: A Unified Framework For Aerial Visual Grounding And Detection | Guoting Wei et.al. | 2602.07827 | translate | read | null |
| 2026-02-07 | Vision and language: Novel Representations and Artificial intelligence for Driving Scene Safety Assessment and Autonomous Vehicle Planning | Ross Greer et.al. | 2602.07680 | translate | read | null |
| 2026-02-07 | Adaptive Image Zoom-in with Bounding Box Transformation for UAV Object Detection | Tao Wang et.al. | 2602.07512 | translate | read | null |
| 2026-02-06 | MosaicThinker: On-Device Visual Spatial Reasoning for Embodied AI via Iterative Construction of Space Representation | Haoming Wang et.al. | 2602.07082 | translate | read | null |
| 2026-02-04 | Neural Sentinel: Unified Vision Language Model (VLM) for License Plate Recognition with Human-in-the-Loop Continual Learning | Karthik Sivakoti et.al. | 2602.07051 | translate | read | null |
| 2026-02-04 | PipeMFL-240K: A Large-scale Dataset and Benchmark for Object Detection in Pipeline Magnetic Flux Leakage Imaging | Tianyi Qu et.al. | 2602.07044 | translate | read | null |
| 2026-02-06 | Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing | Meng Lou et.al. | 2602.06862 | translate | read | null |
| 2026-02-06 | Machine Learning for Detection and Severity Estimation of Sweetpotato Weevil Damage in Field and Lab Conditions | Doreen M. Chelangat et.al. | 2602.06786 | translate | read | null |
| 2026-02-06 | CytoCrowd: A Multi-Annotator Benchmark Dataset for Cytology Image Analysis | Yonghao Si et.al. | 2602.06674 | translate | read | null |
| 2026-02-06 | Instance-Free Domain Adaptive Object Detection | Hengfu Yu et.al. | 2602.06484 | translate | read | null |
| 2026-02-06 | LAB-Det: Language as a Domain-Invariant Bridge for Training-Free One-Shot Domain Generalization in Object Detection | Xu Zhang et.al. | 2602.06474 | translate | read | null |
| 2026-02-04 | Point Virtual Transformer | Veerain Sood et.al. | 2602.06406 | translate | read | null |
| 2026-02-06 | A neuromorphic model of the insect visual system for natural image processing | Adam D. Hines et.al. | 2602.06405 | translate | read | null |
| 2026-02-06 | Revisiting Salient Object Detection from an Observer-Centric Perspective | Fuxi Zhang et.al. | 2602.06369 | translate | read | null |
| 2026-02-06 | Robust Pedestrian Detection with Uncertain Modality | Qian Bie et.al. | 2602.06363 | translate | read | null |
| 2026-02-05 | LSA: Localized Semantic Alignment for Enhancing Temporal Consistency in Traffic Video Generation | Mirlan Karimov et.al. | 2602.05966 | translate | read | null |
| 2026-02-05 | Depth as Prior Knowledge for Object Detection | Moussa Kassem Sbeyti et.al. | 2602.05730 | translate | read | null |
| 2026-02-05 | PIRATR: Parametric Object Inference for Robotic Applications with Transformers in 3D Point Clouds | Michael Schwingshackl et.al. | 2602.05557 | translate | read | null |
| 2026-02-05 | IndustryShapes: An RGB-D Benchmark dataset for 6D object pose estimation of industrial assembly components and tools | Panagiotis Sapoutzoglou et.al. | 2602.05555 | translate | read | null |
| 2026-02-05 | TSBOW: Traffic Surveillance Benchmark for Occluded Vehicles Under Various Weather Conditions | Ngoc Doan-Minh Huynh et.al. | 2602.05414 | translate | read | link |
| 2026-02-05 | ReGLA: Efficient Receptive-Field Modeling with Gated Linear Attention Network | Junzhou Li et.al. | 2602.05262 | translate | read | null |
| 2026-02-04 | A labeled dataset of simulated phlebotomy procedures for medical AI: polygon annotations for object detection and human-object interaction | Raúl Jiménez Cruz et.al. | 2602.04624 | translate | read | null |
| 2026-02-04 | PEPR: Privileged Event-based Predictive Regularization for Domain Generalization | Gabriele Magrini et.al. | 2602.04583 | translate | read | null |
| 2026-02-03 | RAWDet-7: A Multi-Scenario Benchmark for Object Detection and Description on Quantized RAW Images | Mishal Fatima et.al. | 2602.03760 | translate | read | null |
| 2026-02-03 | SPWOOD: Sparse Partial Weakly-Supervised Oriented Object Detection | Wei Zhang et.al. | 2602.03634 | translate | read | null |
| 2026-02-03 | High-Resolution Underwater Camouflaged Object Detection: GBU-UCOD Dataset and Topology-Aware and Frequency-Decoupled Networks | Wenji Wu et.al. | 2602.03591 | translate | read | null |
| 2026-02-03 | Inlier-Centric Post-Training Quantization for Object Detection Models | Minsu Kim et.al. | 2602.03472 | translate | read | null |
| 2026-02-03 | FSOD-VFM: Few-Shot Object Detection with Vision Foundation Models and Graph Diffusion | Chen-Bin Feng et.al. | 2602.03137 | translate | read | null |
| 2026-02-02 | Real-Time 2D LiDAR Object Detection Using Three-Frame RGB Scan Encoding | Soheil Behnam Roudsari et.al. | 2602.02167 | translate | read | null |
| 2026-02-02 | Deep learning enables urban change profiling through alignment of historical maps | Sidi Wu et.al. | 2602.02154 | translate | read | null |
| 2026-02-02 | Beyond Open Vocabulary: Multimodal Prompting for Object Detection in Remote Sensing Images | Shuai Yang et.al. | 2602.01954 | translate | read | null |
| 2026-02-02 | Samba+: General and Accurate Salient Object Detection via A More Unified Mamba-based Framework | Wenzhuo Zhao et.al. | 2602.01593 | translate | read | null |
| 2026-02-01 | Cross-Paradigm Evaluation of Gaze-Based Semantic Object Identification for Intelligent Vehicles | Penghao Deng et.al. | 2602.01452 | translate | read | null |
| 2026-02-01 | Unified ROI-based Image Compression Paradigm with Generalized Gaussian Model | Kai Hu et.al. | 2602.01325 | translate | read | link |
| 2026-02-01 | Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection | Ke Sun et.al. | 2602.01240 | translate | read | null |
| 2026-02-01 | Refining Context-Entangled Content Segmentation via Curriculum Selection and Anti-Curriculum Promotion | Chunming He et.al. | 2602.01183 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)