Object Detection - 2025-12
Object Detection - 2025-12
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-12-31 | Compressed Map Priors for 3D Perception | Brady Zhou et.al. | 2601.00139 | translate | read | null |
| 2025-12-31 | Automated electrostatic characterization of quantum dot devices in single- and bilayer heterostructures | Merritt P. R. Losert et.al. | 2601.00067 | translate | read | null |
| 2025-12-31 | Semi-Supervised Diversity-Aware Domain Adaptation for 3D Object detection | Bartłomiej Olber et.al. | 2512.24922 | translate | read | null |
| 2025-12-31 | Semi-Automated Data Annotation in Multisensor Datasets for Autonomous Vehicle Testing | Andrii Gamalii et.al. | 2512.24896 | translate | read | null |
| 2025-12-31 | FireRescue: A UAV-Based Dataset and Enhanced YOLO Model for Object Detection in Fire Rescue Scenes | Qingyu Xu et.al. | 2512.24622 | translate | read | null |
| 2025-12-30 | AI-Driven Evaluation of Surgical Skill via Action Recognition | Yan Meng et.al. | 2512.24411 | translate | read | null |
| 2025-12-30 | Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems | Song Wang et.al. | 2512.24385 | translate | read | null |
| 2025-12-30 | Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images | Jingzhou Chen et.al. | 2512.24074 | translate | read | null |
| 2025-12-29 | Automated river gauge plate reading using a hybrid object detection and generative AI framework in the Limpopo River Basin | Kayathri Vigneswaran et.al. | 2512.23454 | translate | read | null |
| 2025-12-29 | YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection | Xu Lin et.al. | 2512.23273 | translate | read | null |
| 2025-12-29 | LIMO: Low-Power In-Memory-Annealer and Matrix-Multiplication Primitive for Edge Computing | Amod Holla et.al. | 2512.23212 | translate | read | null |
| 2025-12-29 | Exploring Syn-to-Real Domain Adaptation for Military Target Detection | Jongoh Jeong et.al. | 2512.23208 | translate | read | null |
| 2025-12-29 | GVSynergy-Det: Synergistic Gaussian-Voxel Representations for Multi-View 3D Object Detection | Yi Zhang et.al. | 2512.23176 | translate | read | null |
| 2025-12-29 | GeoTeacher: Geometry-Guided Semi-Supervised 3D Object Detection | Jingyu Li et.al. | 2512.23147 | translate | read | null |
| 2025-12-28 | RealCamo: Boosting Real Camouflage Synthesis with Layout Controls and Textual-Visual Guidance | Chunyuan Chen et.al. | 2512.22974 | translate | read | null |
| 2025-12-28 | YOLO-IOD: Towards Real Time Incremental Object Detection | Shizhou Zhang et.al. | 2512.22973 | translate | read | null |
| 2025-12-28 | Wavelet-based Multi-View Fusion of 4D Radar Tensor and Camera for Robust 3D Object Detection | Runwei Guan et.al. | 2512.22972 | translate | read | null |
| 2025-12-28 | Evaluating the Performance of Open-Vocabulary Object Detection in Low-quality Image | Po-Chih Wu et.al. | 2512.22801 | translate | read | null |
| 2025-12-27 | SCAFusion: A Multimodal 3D Detection Framework for Small Object Detection in Lunar Surface Exploration | Xin Chen et.al. | 2512.22503 | translate | read | null |
| 2025-12-27 | Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection | Zihan Liu et.al. | 2512.22483 | translate | read | null |
| 2025-12-27 | Comparing Object Detection Models for Electrical Substation Component Mapping | Haley Mody et.al. | 2512.22454 | translate | read | null |
| 2025-12-27 | SonoVision: A Computer Vision Approach for Helping Visually Challenged Individuals Locate Objects with the Help of Sound Cues | Md Abu Obaida Zishan et.al. | 2512.22449 | translate | read | null |
| 2025-12-27 | Towards Robust Optical-SAR Object Detection under Missing Modalities: A Dynamic Quality-Aware Fusion Framework | Zhicheng Zhao et.al. | 2512.22447 | translate | read | null |
| 2025-12-26 | DeFloMat: Detection with Flow Matching for Stable and Efficient Generative Object Localization | Hansang Lee et.al. | 2512.22406 | translate | read | null |
| 2025-12-23 | Failure Analysis of Safety Controllers in Autonomous Vehicles Under Object-Based LiDAR Attacks | Daniyal Ganiuly et.al. | 2512.22244 | translate | read | null |
| 2025-12-26 | Breaking Alignment Barriers: TPS-Driven Semantic Correlation Learning for Alignment-Free RGB-T Salient Object Detection | Lupiao Hu et.al. | 2512.21856 | translate | read | null |
| 2025-12-25 | Detecting AI-Generated Paraphrases in Bengali: A Comparative Study of Zero-Shot and Fine-Tuned Transformers | Md. Rakibul Islam et.al. | 2512.21709 | translate | read | null |
| 2025-12-25 | Comparative Analysis of Deep Learning Models for Perception in Autonomous Vehicles | Jalal Khan et.al. | 2512.21673 | translate | read | null |
| 2025-12-24 | ORCA: Object Recognition and Comprehension for Archiving Marine Species | Yuk-Kwan Wong et.al. | 2512.21150 | translate | read | null |
| 2025-12-24 | Self-supervised Multiplex Consensus Mamba for General Image Fusion | Yingying Wang et.al. | 2512.20921 | translate | read | null |
| 2025-12-23 | Real-World Adversarial Attacks on RF-Based Drone Detectors | Omer Gazit et.al. | 2512.20712 | translate | read | null |
| 2025-12-23 | Bridging Modalities and Transferring Knowledge: Enhanced Multimodal Understanding and Recognition | Gorjan Radevski et.al. | 2512.20501 | translate | read | null |
| 2025-12-23 | ${D}^{3}${ETOR}: ${D}$ebate-Enhanced Pseudo Labeling and Frequency-Aware Progressive ${D}$ebiasing for Weakly-Supervised Camouflaged Object ${D}$ etection with Scribble Annotations | Jiawei Ge et.al. | 2512.20260 | translate | read | null |
| 2025-12-23 | LiteFusion: Taming 3D Object Detectors from Vision-Based to Multi-Modal with Minimal Adaptation | Xiangxuan Ren et.al. | 2512.20217 | translate | read | null |
| 2025-12-23 | Gaussian Process Assisted Meta-learning for Image Classification and Object Detection Models | Anna R. Flowers et.al. | 2512.20021 | translate | read | null |
| 2025-12-23 | PaveSync: A Unified and Comprehensive Dataset for Pavement Distress Analysis and Classification | Blessing Agyei Kyem et.al. | 2512.20011 | translate | read | null |
| 2025-12-22 | Photonic Spiking Graph Neural Network for Energy-Efficient Structured Data Processing | Wanting Yu et.al. | 2512.19182 | translate | read | null |
| 2025-12-20 | The size of 3I/ATLAS from non-gravitational acceleration | John C. Forbes et.al. | 2512.18341 | translate | read | null |
| 2025-12-20 | Pyramidal Adaptive Cross-Gating for Multimodal Detection | Zidong Gu et.al. | 2512.18291 | translate | read | null |
| 2025-12-20 | Building UI/UX Dataset for Dark Pattern Detection and YOLOv12x-based Real-Time Object Recognition Detection System | Se-Young Jang et.al. | 2512.18269 | translate | read | null |
| 2025-12-20 | Spectral Discrepancy and Cross-modal Semantic Consistency Learning for Object Detection in Hyperspectral Image | Xiao He et.al. | 2512.18245 | translate | read | null |
| 2025-12-20 | ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection | Janghyun Baek et.al. | 2512.18187 | translate | read | null |
| 2025-12-19 | YolovN-CBi: A Lightweight and Efficient Architecture for Real-Time Detection of Small UAVs | Ami Pandat et.al. | 2512.18046 | translate | read | null |
| 2025-12-19 | StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection | Di Wu et.al. | 2512.17620 | translate | read | null |
| 2025-12-19 | Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection | Sairam VCR et.al. | 2512.17514 | translate | read | null |
| 2025-12-19 | PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases | Ripan Kumar Kundu et.al. | 2512.17172 | translate | read | null |
| 2025-12-18 | DenseBEV: Transforming BEV Grid Cells into 3D Objects | Marius Dähling et.al. | 2512.16818 | translate | read | null |
| 2025-12-18 | FlowDet: Unifying Object Detection and Generative Transport Flows | Enis Baty et.al. | 2512.16771 | translate | read | null |
| 2025-12-18 | YOLO11-4K: An Efficient Architecture for Real-Time Small Object Detection in 4K Panoramic Images | Huma Hafeez et.al. | 2512.16493 | translate | read | null |
| 2025-12-18 | Autoencoder-based Denoising Defense against Adversarial Attacks on Object Detection | Min Geun Song et.al. | 2512.16123 | translate | read | null |
| 2025-12-18 | Auto-Vocabulary 3D Object Detection | Haomeng Zhang et.al. | 2512.16077 | translate | read | null |
| 2025-12-17 | From Words to Wavelengths: VLMs for Few-Shot Multispectral Object Detection | Manuel Nkegoum et.al. | 2512.15971 | translate | read | null |
| 2025-12-13 | Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real | Yan Yang et.al. | 2512.15774 | translate | read | null |
| 2025-12-17 | IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion | Shashank Mishra et.al. | 2512.15581 | translate | read | null |
| 2025-12-17 | Evaluation of deep learning architectures for wildlife object detection: A comparative study of ResNet and Inception | Malach Obisa Amonga et.al. | 2512.15480 | translate | read | null |
| 2025-12-17 | Vision-based module for accurately reading linear scales in a laboratory | Parvesh Saini et.al. | 2512.15327 | translate | read | null |
| 2025-12-17 | EPSM: A Novel Metric to Evaluate the Safety of Environmental Perception in Autonomous Driving | Jörg Gamerdinger et.al. | 2512.15195 | translate | read | null |
| 2025-12-17 | Criticality Metrics for Relevance Classification in Safety Evaluation of Object Detection in Automated Driving | Jörg Gamerdinger et.al. | 2512.15181 | translate | read | null |
| 2025-12-17 | Beyond Proximity: A Keypoint-Trajectory Framework for Classifying Affiliative and Agonistic Social Networks in Dairy Cattle | Sibi Parivendan et.al. | 2512.14998 | translate | read | null |
| 2025-12-16 | TUMTraf EMOT: Event-Based Multi-Object Tracking Dataset and Baseline for Traffic Scenarios | Mengyu Li et.al. | 2512.14595 | translate | read | null |
| 2025-12-16 | 4D-RaDiff: Latent Diffusion for 4D Radar Point Cloud Generation | Jimmie Kwok et.al. | 2512.14235 | translate | read | null |
| 2025-12-16 | CIS-BA: Continuous Interaction Space Based Backdoor Attack for Object Detection in the Real-World | Shuxin Zhao et.al. | 2512.14158 | translate | read | null |
| 2025-12-16 | Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries | Emanuele Mezzi et.al. | 2512.14102 | translate | read | null |
| 2025-12-16 | Deep Learning Perspective of Scene Understanding in Autonomous Robots | Afia Maham et.al. | 2512.14020 | translate | read | null |
| 2025-12-16 | Real-Time Service Subscription and Adaptive Offloading Control in Vehicular Edge Computing | Chuanchao Gao et.al. | 2512.14002 | translate | read | null |
| 2025-12-16 | FocalComm: Hard Instance-Aware Multi-Agent Perception | Dereje Shenkut et.al. | 2512.13982 | translate | read | null |
| 2025-12-15 | Route-DETR: Pairwise Query Routing in Transformers for Object Detection | Ye Zhang et.al. | 2512.13876 | translate | read | null |
| 2025-12-15 | VajraV1 – The most accurate Real Time Object Detector of the YOLO family | Naman Balbir Singh Makkar et.al. | 2512.13834 | translate | read | null |
| 2025-12-15 | Near-Field Perception for Safety Enhancement of Autonomous Mobile Robots in Manufacturing Environments | Li-Wei Shih et.al. | 2512.13561 | translate | read | null |
| 2025-12-15 | On the Ability of Deep Learning to Detect Signals with Unknown Parameters | Tom Anders et.al. | 2512.13542 | translate | read | null |
| 2025-12-15 | Computer vision training dataset generation for robotic environments using Gaussian splatting | Patryk Niżeniec et.al. | 2512.13411 | translate | read | null |
| 2025-12-15 | Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather | Zhijian He et.al. | 2512.13107 | translate | read | null |
| 2025-12-14 | Cross-Level Sensor Fusion with Object Lists via Transformer for 3D Object Detection | Xiangzhong Liu et.al. | 2512.12884 | translate | read | null |
| 2025-12-13 | INDOOR-LiDAR: Bridging Simulation and Reality for Robot-Centric 360 degree Indoor LiDAR Perception – A Robot-Centric Hybrid Dataset | Haichuan Li et.al. | 2512.12377 | translate | read | null |
| 2025-12-13 | WeDetect: Fast Open-Vocabulary Object Detection as Retrieval | Shenghao Fu et.al. | 2512.12309 | translate | read | null |
| 2025-12-13 | Cognitive-YOLO: LLM-Driven Architecture Synthesis from First Principles of Data for Object Detection | Jiahao Zhao et.al. | 2512.12281 | translate | read | null |
| 2025-12-13 | AI-Augmented Pollen Recognition in Optical and Holographic Microscopy for Veterinary Imaging | Swarn S. Warshaneyan et.al. | 2512.12101 | translate | read | null |
| 2025-12-12 | TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder | Qinghao Meng et.al. | 2512.11926 | translate | read | null |
| 2025-12-12 | Depth-Copy-Paste: Multimodal and Depth-Aware Compositing for Robust Face Detection | Qiushi Guo et.al. | 2512.11683 | translate | read | null |
| 2025-12-12 | DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation | Mohamed Abdelsamad et.al. | 2512.11465 | translate | read | null |
| 2025-12-12 | Assisted Refinement Network Based on Channel Information Interaction for Camouflaged and Salient Object Detection | Kuan Wang et.al. | 2512.11369 | translate | read | null |
| 2025-12-12 | Reliable Detection of Minute Targets in High-Resolution Aerial Imagery across Temporal Shifts | Mohammad Sadegh Gholizadeh et.al. | 2512.11360 | translate | read | null |
| 2025-12-11 | VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction | Weitai Kang et.al. | 2512.11099 | translate | read | null |
| 2025-12-11 | Salient Object Detection in Complex Weather Conditions via Noise Indicators | Quan Chen et.al. | 2512.10592 | translate | read | null |
| 2025-12-11 | Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method | Ge Zhang et.al. | 2512.10386 | translate | read | null |
| 2025-12-10 | ABBSPO: Adaptive Bounding Box Scaling and Symmetric Prior based Orientation Prediction for Detecting Aerial Image Objects | Woojin Lee et.al. | 2512.10031 | translate | read | null |
| 2025-12-10 | NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway | Sander Riisøen Jyhne et.al. | 2512.09913 | translate | read | null |
| 2025-12-10 | Hands-on Evaluation of Visual Transformers for Object Recognition and Detection | Dimitrios N. Vlachogiannis et.al. | 2512.09579 | translate | read | null |
| 2025-12-10 | MODA: The First Challenging Benchmark for Multispectral Object Detection in Aerial Images | Shuaihao Han et.al. | 2512.09489 | translate | read | null |
| 2025-12-10 | A Hierarchical, Model-Based System for High-Performance Humanoid Soccer | Quanyou Wang et.al. | 2512.09431 | translate | read | null |
| 2025-12-10 | Identifying Bias in Machine-generated Text Detection | Kevin Stowe et.al. | 2512.09292 | translate | read | null |
| 2025-12-10 | ROI-Packing: Efficient Region-Based Compression for Machine Vision | Md Eimran Hossain Eimon et.al. | 2512.09258 | translate | read | null |
| 2025-12-09 | Automated Pollen Recognition in Optical and Holographic Microscopy Images | Swarn Singh Warshaneyan et.al. | 2512.08589 | translate | read | null |
| 2025-12-09 | SSCATeR: Sparse Scatter-Based Convolution Algorithm with Temporal Data Recycling for Real-Time 3D Object Detection in LiDAR Point Clouds | Alexander Dow et.al. | 2512.08557 | translate | read | null |
| 2025-12-09 | Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection | Haowen Zheng et.al. | 2512.08247 | translate | read | null |
| 2025-12-09 | SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection | Ching-Hung Cheng et.al. | 2512.08223 | translate | read | null |
| 2025-12-09 | Metasurfaces Enable Active-Like Passive Radar | Mingyi Li et.al. | 2512.08208 | translate | read | null |
| 2025-12-08 | An AI-Powered Autonomous Underwater System for Sea Exploration and Scientific Research | Hamad Almazrouei et.al. | 2512.07652 | translate | read | null |
| 2025-12-08 | Towards Robust DeepFake Detection under Unstable Face Sequences: Adaptive Sparse Graph Embedding with Order-Free Representation and Explicit Laplacian Spectral Prior | Chih-Chung Hsu et.al. | 2512.07498 | translate | read | null |
| 2025-12-08 | Enhancing Small Object Detection with YOLO: A Novel Framework for Improved Accuracy and Efficiency | Mahila Moghadami et.al. | 2512.07379 | translate | read | null |
| 2025-12-08 | A graph generation pipeline for critical infrastructures based on heuristics, images and depth data | Mike Diessner et.al. | 2512.07269 | translate | read | null |
| 2025-12-08 | DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning | Nithin Sivakumaran et.al. | 2512.07132 | translate | read | null |
| 2025-12-08 | DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cross-Scene Small Object Detection | Bo Gao et.al. | 2512.07078 | translate | read | null |
| 2025-12-07 | Large Language Models and Forensic Linguistics: Navigating Opportunities and Threats in the Age of Generative AI | George Mikros et.al. | 2512.06922 | translate | read | null |
| 2025-12-07 | Spatial Retrieval Augmented Autonomous Driving | Xiaosong Jia et.al. | 2512.06865 | translate | read | null |
| 2025-12-07 | CoT4Det: A Chain-of-Thought Framework for Perception-Oriented Vision-Language Tasks | Yu Qi et.al. | 2512.06663 | translate | read | null |
| 2025-12-07 | TextMamba: Scene Text Detector with Mamba | Qiyan Zhao et.al. | 2512.06657 | translate | read | null |
| 2025-12-06 | Neural expressiveness for beyond importance model compression | Angelos-Christos Maroudis et.al. | 2512.06440 | translate | read | null |
| 2025-12-06 | Are AI-Generated Driving Videos Ready for Autonomous Driving? A Diagnostic Evaluation Framework | Xinhao Xiang et.al. | 2512.06376 | translate | read | null |
| 2025-12-05 | OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning | Xusheng Guo et.al. | 2512.05698 | translate | read | null |
| 2025-12-05 | LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection | Johannes Meier et.al. | 2512.05663 | translate | read | null |
| 2025-12-05 | An Integrated System for WEEE Sorting Employing X-ray Imaging, AI-based Object Detection and Segmentation, and Delta Robot Manipulation | Panagiotis Giannikos et.al. | 2512.05599 | translate | read | null |
| 2025-12-05 | Concept-based Explainable Data Mining with VLM for 3D Detection | Mai Tsujimoto et.al. | 2512.05482 | translate | read | null |
| 2025-12-05 | Moving object detection from multi-depth images with an attention-enhanced CNN | Masato Shibukawa et.al. | 2512.05415 | translate | read | null |
| 2025-12-05 | YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications | Yida Lin et.al. | 2512.05412 | translate | read | null |
| 2025-12-04 | GeoPE:A Unified Geometric Positional Embedding for Structured Tensors | Yupu Yao et.al. | 2512.04963 | translate | read | null |
| 2025-12-04 | ZeBROD: Zero-Retraining Based Recognition and Object Detection Framework | Priyanto Hidayatullah et.al. | 2512.04888 | translate | read | null |
| 2025-12-04 | DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance | Yinghui Xing et.al. | 2512.04511 | translate | read | null |
| 2025-12-04 | Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection | Xiangyi Gao et.al. | 2512.04413 | translate | read | null |
| 2025-12-03 | Real-time Cricket Sorting By Sex | Juan Manuel Cantarero Angulo et.al. | 2512.04311 | translate | read | null |
| 2025-12-03 | Fast & Efficient Normalizing Flows and Applications of Image Generative Models | Sandeep Nagar et.al. | 2512.04039 | translate | read | null |
| 2025-12-03 | MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms | Jiahao Zhang et.al. | 2512.03640 | translate | read | null |
| 2025-12-03 | Real-Time Control and Automation Framework for Acousto-Holographic Microscopy | Hasan Berkay Abdioğlu et.al. | 2512.03539 | translate | read | null |
| 2025-12-03 | YOLOA: Real-Time Affordance Detection via LLM Adapter | Yuqi Ji et.al. | 2512.03418 | translate | read | null |
| 2025-12-02 | GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection | Md Sohag Mia et.al. | 2512.02991 | translate | read | null |
| 2025-12-02 | BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection | Guowen Zhang et.al. | 2512.02972 | translate | read | null |
| 2025-12-02 | MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding | Fan Yang et.al. | 2512.02906 | translate | read | null |
| 2025-12-02 | ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection | Omid Reza Heidari et.al. | 2512.02696 | translate | read | null |
| 2025-12-02 | SAM2Grasp: Resolve Multi-modal Grasping via Prompt-conditioned Temporal Action Prediction | Shengkai Wu et.al. | 2512.02609 | translate | read | null |
| 2025-12-02 | GeoDiT: A Diffusion-based Vision-Language Model for Geospatial Understanding | Jiaqi Liu et.al. | 2512.02505 | translate | read | null |
| 2025-12-02 | Temporal Dynamics Enhancer for Directly Trained Spiking Object Detectors | Fan Luo et.al. | 2512.02447 | translate | read | null |
| 2025-12-01 | Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory | Chenyi Wang et.al. | 2512.01934 | translate | read | null |
| 2025-12-01 | SAM3-UNet: Simplified Adaptation of Segment Anything Model 3 | Xinyu Xiong et.al. | 2512.01789 | translate | read | null |
| 2025-12-01 | Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery | Zhicheng Zhao et.al. | 2512.01665 | translate | read | null |
| 2025-12-01 | ViT $^3$ : Unlocking Test-Time Training in Vision | Dongchen Han et.al. | 2512.01643 | translate | read | null |
| 2025-12-01 | OpenBox: Annotate Any Bounding Boxes in 3D | In-Jae Lee et.al. | 2512.01352 | translate | read | null |
| 2025-12-01 | FOD-S2R: A FOD Dataset for Sim2Real Transfer Learning based Object Detection | Ashish Vashist et.al. | 2512.01315 | translate | read | null |
| 2025-12-01 | Supervised Contrastive Machine Unlearning of Background Bias in Sonar Image Classification with Fine-Grained Explainable AI | Kamal Basha S et.al. | 2512.01291 | translate | read | null |
| 2025-12-01 | VSRD++: Autolabeling for 3D Object Detection via Instance-Aware Volumetric Silhouette Rendering | Zihua Liu et.al. | 2512.01178 | translate | read | null |
| 2025-12-01 | Real-Time On-the-Go Annotation Framework Using YOLO for Automated Dataset Generation | Mohamed Abdallah Salem et.al. | 2512.01165 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)