Object Detection - 2025-01
Object Detection - 2025-01
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-01-31 | Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches | Ying Zang et.al. | 2501.19329 | translate | read | null |
| 2025-01-31 | Beyond checkmate: exploring the creative chokepoints in AI text | Nafis Irtiza Tripto et.al. | 2501.19301 | translate | read | link |
| 2025-01-31 | GO: The Great Outdoors Multimodal Dataset | Peng Jiang et.al. | 2501.19274 | translate | read | null |
| 2025-01-31 | Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings | Ahmed K. Kadhim et.al. | 2501.18998 | translate | read | null |
| 2025-01-31 | Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques | Samitha Vidhanaarachchi et.al. | 2501.18835 | translate | read | null |
| 2025-01-30 | Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios | David El-Chai Ben-Ezra et.al. | 2501.18788 | translate | read | null |
| 2025-01-30 | Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms | Abhinav Pratap et.al. | 2501.18444 | translate | read | null |
| 2025-01-29 | Real Time Scheduling Framework for Multi Object Detection via Spiking Neural Networks | Donghwa Kang et.al. | 2501.18412 | translate | read | null |
| 2025-01-30 | IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain | Zhe Wang et.al. | 2501.18162 | translate | read | null |
| 2025-01-29 | TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection | Lei Cheng et.al. | 2501.17977 | translate | read | link |
| 2025-01-28 | Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC | Tyler Wheeler et.al. | 2501.17892 | translate | read | null |
| 2025-01-29 | Detection of Oscillation-like Patterns in Eclipsing Binary Light Curves using Neural Network-based Object Detection Algorithms | Burak Ulaş et.al. | 2501.17538 | translate | read | null |
| 2025-01-30 | Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection | Alicia Allmendinger et.al. | 2501.17387 | translate | read | null |
| 2025-01-28 | DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications | Muhammad Shahbaz et.al. | 2501.17076 | translate | read | null |
| 2025-01-28 | Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding | Akash Kumar et.al. | 2501.17053 | translate | read | null |
| 2025-01-28 | Approach Towards Semi-Automated Certification for Low Criticality ML-Enabled Airborne Applications | Chandrasekar Sridhar et.al. | 2501.17028 | translate | read | null |
| 2025-01-28 | Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection | Xiangyu Gao et.al. | 2501.16981 | translate | read | null |
| 2025-01-28 | B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning | Nikolaos Kaparinos et.al. | 2501.16917 | translate | read | null |
| 2025-01-28 | SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios | Yinqi Chen et.al. | 2501.16754 | translate | read | null |
| 2025-01-28 | DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging | Muxi Chen et.al. | 2501.16751 | translate | read | null |
| 2025-01-28 | DFCon: Attention-Driven Supervised Contrastive Learning for Robust Deepfake Detection | MD Sadik Hossain Shanto et.al. | 2501.16704 | translate | read | null |
| 2025-01-27 | Efficient Object Detection of Marine Debris using Pruned YOLO Model | Abi Aryaza et.al. | 2501.16571 | translate | read | null |
| 2025-01-27 | Object Detection for Medical Image Analysis: Insights from the RT-DETR Model | Weijie He et.al. | 2501.16469 | translate | read | null |
| 2025-01-27 | The Linear Attention Resurrection in Vision Transformer | Chuanyang Zheng et.al. | 2501.16182 | translate | read | null |
| 2025-01-27 | Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room | Santiago Cepeda et.al. | 2501.15994 | translate | read | null |
| 2025-01-26 | Classifying Deepfakes Using Swin Transformers | Aprille J. Xi et.al. | 2501.15656 | translate | read | null |
| 2025-01-26 | A Privacy Enhancing Technique to Evade Detection by Street Video Cameras Without Using Adversarial Accessories | Jacob Shams et.al. | 2501.15653 | translate | read | null |
| 2025-01-26 | Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection | Zengran Wang et.al. | 2501.15449 | translate | read | null |
| 2025-01-26 | FAVbot: An Autonomous Target Tracking Micro-Robot with Frequency Actuation Control | Zhijian Hao et.al. | 2501.15426 | translate | read | null |
| 2025-01-26 | Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception | Lianqing Zheng et.al. | 2501.15394 | translate | read | null |
| 2025-01-26 | iFormer: Integrating ConvNet and Transformer for Mobile Application | Chuanyang Zheng et.al. | 2501.15369 | translate | read | link |
| 2025-01-25 | Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data | Nora Fink et.al. | 2501.15263 | translate | read | null |
| 2025-01-25 | SpikSSD: Better Extraction and Fusion for Object Detection with Spiking Neuron Networks | Yimeng Fan et.al. | 2501.15151 | translate | read | link |
| 2025-01-24 | LiDAR-Based Vehicle Detection and Tracking for Autonomous Racing | Marcello Cellina et.al. | 2501.14502 | translate | read | null |
| 2025-01-24 | TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection | Xi Xiao et.al. | 2501.14302 | translate | read | null |
| 2025-01-24 | A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques | Lifu Gao et.al. | 2501.14288 | translate | read | null |
| 2025-01-23 | Efficient Precision Control in Object Detection Models for Enhanced and Reliable Ovarian Follicle Counting | Vincent Blot et.al. | 2501.14036 | translate | read | null |
| 2025-01-23 | PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection | Peiyuan Zhang et.al. | 2501.13898 | translate | read | link |
| 2025-01-23 | First Lessons Learned of an Artificial Intelligence Robotic System for Autonomous Coarse Waste Recycling Using Multispectral Imaging-Based Methods | Timo Lange et.al. | 2501.13855 | translate | read | null |
| 2025-01-23 | Integrating Causality with Neurochaos Learning: Proposed Approach and Research Agenda | Nanjangud C. Narendra et.al. | 2501.13763 | translate | read | null |
| 2025-01-23 | You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain | Timothy Chase Jr et.al. | 2501.13725 | translate | read | null |
| 2025-01-23 | YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID | Iñaki Erregue et.al. | 2501.13710 | translate | read | link |
| 2025-01-23 | Emotion estimation from video footage with LSTM | Samer Attrah et.al. | 2501.13432 | translate | read | link |
| 2025-01-23 | Multi-aspect Knowledge Distillation with Large Language Model | Taegyeong Lee et.al. | 2501.13341 | translate | read | link |
| 2025-01-22 | MONA: Moving Object Detection from Videos Shot by Dynamic Camera | Boxun Hu et.al. | 2501.13183 | translate | read | null |
| 2025-01-21 | Large-image Object Detection for Fine-grained Recognition of Punches Patterns in Medieval Panel Painting | Josh Bruegger et.al. | 2501.12489 | translate | read | link |
| 2025-01-21 | TOFFE – Temporally-binned Object Flow from Events for High-speed and Energy-Efficient Object Detection and Tracking | Adarsh Kumar Kosta et.al. | 2501.12482 | translate | read | null |
| 2025-01-21 | Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems | Stefano Carlo Lambertenghi et.al. | 2501.12269 | translate | read | null |
| 2025-01-21 | DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains | Junyu Xia et.al. | 2501.12235 | translate | read | null |
| 2025-01-21 | SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology | Dongli Wu et.al. | 2501.12169 | translate | read | null |
| 2025-01-21 | Co-Paced Learning Strategy Based on Confidence for Flying Bird Object Detection Model Training | Zi-Wei Sun et.al. | 2501.12071 | translate | read | null |
| 2025-01-21 | SMamba: Sparse Mamba for Event-based Object Detection | Nan Yang et.al. | 2501.11971 | translate | read | null |
| 2025-01-21 | LuxVeri at GenAI Detection Task 1: Inverse Perplexity Weighted Ensemble for Robust Detection of AI-Generated Text across English and Multilingual Contexts | Md Kamrujjaman Mobin et.al. | 2501.11914 | translate | read | null |
| 2025-01-20 | Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection | Ali Naseh et.al. | 2501.11786 | translate | read | null |
| 2025-01-20 | Everyone’s Privacy Matters! An Analysis of Privacy Leakage from Real-World Facial Images on Twitter and Associated User Behaviors | Yuqi Niu et.al. | 2501.11756 | translate | read | null |
| 2025-01-20 | Automatic Labelling & Semantic Segmentation with 4D Radar Tensors | Botao Sun et.al. | 2501.11351 | translate | read | null |
| 2025-01-20 | Enhancing SAR Object Detection with Self-Supervised Pre-training on Masked Auto-Encoders | Xinyang Pu et.al. | 2501.11249 | translate | read | null |
| 2025-01-17 | MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection | Xiangyuan Peng et.al. | 2501.10266 | translate | read | null |
| 2025-01-17 | Leveraging Confident Image Regions for Source-Free Domain-Adaptive Object Detection | Mohamed Lamine Mekhalfi et.al. | 2501.10081 | translate | read | null |
| 2025-01-17 | One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression | Keita Miwa et.al. | 2501.10064 | translate | read | null |
| 2025-01-17 | LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks | Wei Lu et.al. | 2501.10040 | translate | read | link |
| 2025-01-17 | FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis | Zhe Chen et.al. | 2501.09887 | translate | read | null |
| 2025-01-16 | Qwen it detect machine-generated text? | Teodor-George Marchitan et.al. | 2501.09813 | translate | read | link |
| 2025-01-16 | A Simple Aerial Detection Baseline of Multimodal Language Models | Qingyun Li et.al. | 2501.09720 | translate | read | link |
| 2025-01-16 | Practical Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2501.09705 | translate | read | link |
| 2025-01-16 | Exploring AI-based System Design for Pixel-level Protected Health Information Detection in Medical Images | Tuan Truong et.al. | 2501.09552 | translate | read | null |
| 2025-01-16 | Multi-task deep-learning for sleep event detection and stage classification | Adriana Anido-Alonso et.al. | 2501.09519 | translate | read | link |
| 2025-01-16 | The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning | Wonjun Jo et.al. | 2501.09485 | translate | read | null |
| 2025-01-16 | MonoSOWA: Scalable monocular 3D Object detector Without human Annotations | Jan Skvrna et.al. | 2501.09481 | translate | read | link |
| 2025-01-16 | RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection | Jianrui Shi et.al. | 2501.09465 | translate | read | null |
| 2025-01-16 | On the Relation between Optical Aperture and Automotive Object Detection | Ofer Bar-Shalom et.al. | 2501.09456 | translate | read | null |
| 2025-01-16 | SoccerSynth-Detection: A Synthetic Dataset for Soccer Player Detection | Haobin Qin et.al. | 2501.09281 | translate | read | null |
| 2025-01-15 | GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge | Liam Dugan et.al. | 2501.08913 | translate | read | null |
| 2025-01-15 | PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection | Chenguang Liu et.al. | 2501.08605 | translate | read | null |
| 2025-01-14 | Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests | Ni Li et.al. | 2501.08465 | translate | read | link |
| 2025-01-14 | Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying | Jonathan Lyhs et.al. | 2501.08142 | translate | read | null |
| 2025-01-14 | Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation | Yunzhi Zhuge et.al. | 2501.07806 | translate | read | link |
| 2025-01-14 | Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding | Zhaokai Wang et.al. | 2501.07783 | translate | read | link |
| 2025-01-13 | SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Varun Biyyala et.al. | 2501.07554 | translate | read | link |
| 2025-01-13 | ML Mule: Mobile-Driven Context-Aware Collaborative Learning | Haoxiang Yu et.al. | 2501.07536 | translate | read | null |
| 2025-01-13 | TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations | Daniel Steininger et.al. | 2501.07360 | translate | read | link |
| 2025-01-13 | Toward Realistic Camouflaged Object Detection: Benchmarks and Method | Zhimeng Xin et.al. | 2501.07297 | translate | read | link |
| 2025-01-13 | Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection | ZhouRui Zhang et.al. | 2501.07101 | translate | read | null |
| 2025-01-11 | CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection | Yiheng Li et.al. | 2501.06550 | translate | read | link |
| 2025-01-11 | CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement | Yijie Li et.al. | 2501.06441 | translate | read | null |
| 2025-01-11 | FocusDD: Real-World Scene Infusion for Robust Dataset Distillation | Youbing Hu et.al. | 2501.06405 | translate | read | null |
| 2025-01-10 | A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection | Tsui Qin Mok et.al. | 2501.06038 | translate | read | null |
| 2025-01-10 | Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion | Sanjay Kumar et.al. | 2501.05997 | translate | read | null |
| 2025-01-10 | EDNet: Edge-Optimized Small Target Detection in UAV Imagery – Faster Context Attention, Better Feature Fusion, and Hardware Acceleration | Zhifan Song et.al. | 2501.05885 | translate | read | null |
| 2025-01-10 | Automatic detection of single-electron regime of quantum dots and definition of virtual gates using U-Net and clustering | Yui Muto et.al. | 2501.05878 | translate | read | null |
| 2025-01-10 | Zero-shot Shark Tracking and Biometrics from Aerial Imagery | Chinmay K Lalgudi et.al. | 2501.05717 | translate | read | null |
| 2025-01-10 | Dark Energy Survey Year 6 Results: Synthetic-source Injection Across the Full Survey Using Balrog | D. Anbajagane et.al. | 2501.05683 | translate | read | null |
| 2025-01-09 | Approximate Supervised Object Distance Estimation on Unmanned Surface Vehicles | Benjamin Kiefer et.al. | 2501.05567 | translate | read | null |
| 2025-01-09 | Performance of YOLOv7 in Kitchen Safety While Handling Knife | Athulya Sundaresan Geetha et.al. | 2501.05399 | translate | read | null |
| 2025-01-09 | The global consensus on the risk management of autonomous driving | Sebastian Krügel et.al. | 2501.05391 | translate | read | null |
| 2025-01-09 | A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision | Ali Rohan et.al. | 2501.05147 | translate | read | null |
| 2025-01-09 | CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection | Xiang Zhang et.al. | 2501.05132 | translate | read | null |
| 2025-01-09 | AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data | Haoran Zhu et.al. | 2501.04969 | translate | read | link |
| 2025-01-09 | Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks | Seyed Amir Bidaki et.al. | 2501.04897 | translate | read | link |
| 2025-01-08 | Video Summarisation with Incident and Context Information using Generative AI | Ulindu De Silva et.al. | 2501.04764 | translate | read | null |
| 2025-01-08 | Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models | Miaoyang He et.al. | 2501.04582 | translate | read | null |
| 2025-01-08 | Combining YOLO and Visual Rhythm for Vehicle Counting | Victor Nascimento Ribeiro et.al. | 2501.04534 | translate | read | link |
| 2025-01-08 | RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark | Xin Zhang et.al. | 2501.04440 | translate | read | link |
| 2025-01-08 | Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions | Doaa Mahmud et.al. | 2501.04437 | translate | read | null |
| 2025-01-08 | FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection | Guoxin Zhang et.al. | 2501.04373 | translate | read | null |
| 2025-01-08 | H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving | Siran Chen et.al. | 2501.04302 | translate | read | null |
| 2025-01-08 | UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles | Abhishek Balasubramaniam et.al. | 2501.04213 | translate | read | null |
| 2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005 | translate | read | null |
| 2025-01-07 | Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection | Pablo Miralles-González et.al. | 2501.03940 | translate | read | null |
| 2025-01-07 | Visual question answering: from early developments to recent advances – a survey | Ngoc Dung Huynh et.al. | 2501.03939 | translate | read | null |
| 2025-01-07 | SCC-YOLO: An Improved Object Detector for Assisting in Brain Tumor Diagnosis | Runci Bai et.al. | 2501.03836 | translate | read | null |
| 2025-01-07 | Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection | Xinbin Yuan et.al. | 2501.03775 | translate | read | link |
| 2025-01-07 | AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features | Ruochen Zhang et.al. | 2501.03700 | translate | read | null |
| 2025-01-07 | Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work | Takumi Kitsukawa et.al. | 2501.03533 | translate | read | null |
| 2025-01-07 | SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild | Jiawei Liu et.al. | 2501.02962 | translate | read | null |
| 2025-01-05 | Multispectral Pedestrian Detection with Sparsely Annotated Label | Chan Lee et.al. | 2501.02640 | translate | read | null |
| 2025-01-05 | Generalization-Enhanced Few-Shot Object Detection in Remote Sensing | Hui Lin et.al. | 2501.02474 | translate | read | link |
| 2025-01-04 | Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities | Tara Radvand et.al. | 2501.02406 | translate | read | link |
| 2025-01-04 | V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection | Sichao Wang et.al. | 2501.02363 | translate | read | link |
| 2025-01-04 | Accurate Crop Yield Estimation of Blueberries using Deep Learning and Smart Drones | Hieu D. Nguyen et.al. | 2501.02344 | translate | read | null |
| 2025-01-04 | On The Causal Network Of Face-selective Regions In Human Brain During Movie Watching | Ali Bavafa et.al. | 2501.02333 | translate | read | null |
| 2025-01-04 | RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar | Liye Jia et.al. | 2501.02314 | translate | read | null |
| 2025-01-03 | A Separable Self-attention Inspired by the State Space Model for Computer Vision | Juntao Zhang et.al. | 2501.02040 | translate | read | link |
| 2025-01-03 | UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery | Huaxiang Zhang et.al. | 2501.01855 | translate | read | null |
| 2025-01-03 | Dual Mutual Learning Network with Global-local Awareness for RGB-D Salient Object Detection | Kang Yi et.al. | 2501.01648 | translate | read | link |
| 2025-01-02 | A Multi-task Supervised Compression Model for Split Computing | Yoshitomo Matsubara et.al. | 2501.01420 | translate | read | link |
| 2025-01-02 | MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception | Xiaoshuai Hao et.al. | 2501.01037 | translate | read | null |
| 2025-01-01 | A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia | Hirthik Mathesh GV et.al. | 2501.00876 | translate | read | null |
| 2025-01-01 | NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model | Yuzhi Lai et.al. | 2501.00785 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)