Object Detection - 2025-01 | Paper Arxiv Daily

Object Detection - 2025-01

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-01-31	Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches	Ying Zang et.al.	2501.19329	translate	read	null
2025-01-31	Beyond checkmate: exploring the creative chokepoints in AI text	Nafis Irtiza Tripto et.al.	2501.19301	translate	read	link
2025-01-31	GO: The Great Outdoors Multimodal Dataset	Peng Jiang et.al.	2501.19274	translate	read	null
2025-01-31	Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings	Ahmed K. Kadhim et.al.	2501.18998	translate	read	null
2025-01-31	Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques	Samitha Vidhanaarachchi et.al.	2501.18835	translate	read	null
2025-01-30	Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios	David El-Chai Ben-Ezra et.al.	2501.18788	translate	read	null
2025-01-30	Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms	Abhinav Pratap et.al.	2501.18444	translate	read	null
2025-01-29	Real Time Scheduling Framework for Multi Object Detection via Spiking Neural Networks	Donghwa Kang et.al.	2501.18412	translate	read	null
2025-01-30	IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain	Zhe Wang et.al.	2501.18162	translate	read	null
2025-01-29	TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection	Lei Cheng et.al.	2501.17977	translate	read	link
2025-01-28	Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC	Tyler Wheeler et.al.	2501.17892	translate	read	null
2025-01-29	Detection of Oscillation-like Patterns in Eclipsing Binary Light Curves using Neural Network-based Object Detection Algorithms	Burak Ulaş et.al.	2501.17538	translate	read	null
2025-01-30	Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection	Alicia Allmendinger et.al.	2501.17387	translate	read	null
2025-01-28	DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications	Muhammad Shahbaz et.al.	2501.17076	translate	read	null
2025-01-28	Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding	Akash Kumar et.al.	2501.17053	translate	read	null
2025-01-28	Approach Towards Semi-Automated Certification for Low Criticality ML-Enabled Airborne Applications	Chandrasekar Sridhar et.al.	2501.17028	translate	read	null
2025-01-28	Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection	Xiangyu Gao et.al.	2501.16981	translate	read	null
2025-01-28	B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning	Nikolaos Kaparinos et.al.	2501.16917	translate	read	null
2025-01-28	SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios	Yinqi Chen et.al.	2501.16754	translate	read	null
2025-01-28	DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging	Muxi Chen et.al.	2501.16751	translate	read	null
2025-01-28	DFCon: Attention-Driven Supervised Contrastive Learning for Robust Deepfake Detection	MD Sadik Hossain Shanto et.al.	2501.16704	translate	read	null
2025-01-27	Efficient Object Detection of Marine Debris using Pruned YOLO Model	Abi Aryaza et.al.	2501.16571	translate	read	null
2025-01-27	Object Detection for Medical Image Analysis: Insights from the RT-DETR Model	Weijie He et.al.	2501.16469	translate	read	null
2025-01-27	The Linear Attention Resurrection in Vision Transformer	Chuanyang Zheng et.al.	2501.16182	translate	read	null
2025-01-27	Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room	Santiago Cepeda et.al.	2501.15994	translate	read	null
2025-01-26	Classifying Deepfakes Using Swin Transformers	Aprille J. Xi et.al.	2501.15656	translate	read	null
2025-01-26	A Privacy Enhancing Technique to Evade Detection by Street Video Cameras Without Using Adversarial Accessories	Jacob Shams et.al.	2501.15653	translate	read	null
2025-01-26	Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection	Zengran Wang et.al.	2501.15449	translate	read	null
2025-01-26	FAVbot: An Autonomous Target Tracking Micro-Robot with Frequency Actuation Control	Zhijian Hao et.al.	2501.15426	translate	read	null
2025-01-26	Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception	Lianqing Zheng et.al.	2501.15394	translate	read	null
2025-01-26	iFormer: Integrating ConvNet and Transformer for Mobile Application	Chuanyang Zheng et.al.	2501.15369	translate	read	link
2025-01-25	Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data	Nora Fink et.al.	2501.15263	translate	read	null
2025-01-25	SpikSSD: Better Extraction and Fusion for Object Detection with Spiking Neuron Networks	Yimeng Fan et.al.	2501.15151	translate	read	link
2025-01-24	LiDAR-Based Vehicle Detection and Tracking for Autonomous Racing	Marcello Cellina et.al.	2501.14502	translate	read	null
2025-01-24	TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection	Xi Xiao et.al.	2501.14302	translate	read	null
2025-01-24	A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques	Lifu Gao et.al.	2501.14288	translate	read	null
2025-01-23	Efficient Precision Control in Object Detection Models for Enhanced and Reliable Ovarian Follicle Counting	Vincent Blot et.al.	2501.14036	translate	read	null
2025-01-23	PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection	Peiyuan Zhang et.al.	2501.13898	translate	read	link
2025-01-23	First Lessons Learned of an Artificial Intelligence Robotic System for Autonomous Coarse Waste Recycling Using Multispectral Imaging-Based Methods	Timo Lange et.al.	2501.13855	translate	read	null
2025-01-23	Integrating Causality with Neurochaos Learning: Proposed Approach and Research Agenda	Nanjangud C. Narendra et.al.	2501.13763	translate	read	null
2025-01-23	You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain	Timothy Chase Jr et.al.	2501.13725	translate	read	null
2025-01-23	YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID	Iñaki Erregue et.al.	2501.13710	translate	read	link
2025-01-23	Emotion estimation from video footage with LSTM	Samer Attrah et.al.	2501.13432	translate	read	link
2025-01-23	Multi-aspect Knowledge Distillation with Large Language Model	Taegyeong Lee et.al.	2501.13341	translate	read	link
2025-01-22	MONA: Moving Object Detection from Videos Shot by Dynamic Camera	Boxun Hu et.al.	2501.13183	translate	read	null
2025-01-21	Large-image Object Detection for Fine-grained Recognition of Punches Patterns in Medieval Panel Painting	Josh Bruegger et.al.	2501.12489	translate	read	link
2025-01-21	TOFFE – Temporally-binned Object Flow from Events for High-speed and Energy-Efficient Object Detection and Tracking	Adarsh Kumar Kosta et.al.	2501.12482	translate	read	null
2025-01-21	Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems	Stefano Carlo Lambertenghi et.al.	2501.12269	translate	read	null
2025-01-21	DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains	Junyu Xia et.al.	2501.12235	translate	read	null
2025-01-21	SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology	Dongli Wu et.al.	2501.12169	translate	read	null
2025-01-21	Co-Paced Learning Strategy Based on Confidence for Flying Bird Object Detection Model Training	Zi-Wei Sun et.al.	2501.12071	translate	read	null
2025-01-21	SMamba: Sparse Mamba for Event-based Object Detection	Nan Yang et.al.	2501.11971	translate	read	null
2025-01-21	LuxVeri at GenAI Detection Task 1: Inverse Perplexity Weighted Ensemble for Robust Detection of AI-Generated Text across English and Multilingual Contexts	Md Kamrujjaman Mobin et.al.	2501.11914	translate	read	null
2025-01-20	Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection	Ali Naseh et.al.	2501.11786	translate	read	null
2025-01-20	Everyone’s Privacy Matters! An Analysis of Privacy Leakage from Real-World Facial Images on Twitter and Associated User Behaviors	Yuqi Niu et.al.	2501.11756	translate	read	null
2025-01-20	Automatic Labelling & Semantic Segmentation with 4D Radar Tensors	Botao Sun et.al.	2501.11351	translate	read	null
2025-01-20	Enhancing SAR Object Detection with Self-Supervised Pre-training on Masked Auto-Encoders	Xinyang Pu et.al.	2501.11249	translate	read	null
2025-01-17	MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection	Xiangyuan Peng et.al.	2501.10266	translate	read	null
2025-01-17	Leveraging Confident Image Regions for Source-Free Domain-Adaptive Object Detection	Mohamed Lamine Mekhalfi et.al.	2501.10081	translate	read	null
2025-01-17	One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression	Keita Miwa et.al.	2501.10064	translate	read	null
2025-01-17	LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks	Wei Lu et.al.	2501.10040	translate	read	link
2025-01-17	FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis	Zhe Chen et.al.	2501.09887	translate	read	null
2025-01-16	Qwen it detect machine-generated text?	Teodor-George Marchitan et.al.	2501.09813	translate	read	link
2025-01-16	A Simple Aerial Detection Baseline of Multimodal Language Models	Qingyun Li et.al.	2501.09720	translate	read	link
2025-01-16	Practical Continual Forgetting for Pre-trained Vision Models	Hongbo Zhao et.al.	2501.09705	translate	read	link
2025-01-16	Exploring AI-based System Design for Pixel-level Protected Health Information Detection in Medical Images	Tuan Truong et.al.	2501.09552	translate	read	null
2025-01-16	Multi-task deep-learning for sleep event detection and stage classification	Adriana Anido-Alonso et.al.	2501.09519	translate	read	link
2025-01-16	The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning	Wonjun Jo et.al.	2501.09485	translate	read	null
2025-01-16	MonoSOWA: Scalable monocular 3D Object detector Without human Annotations	Jan Skvrna et.al.	2501.09481	translate	read	link
2025-01-16	RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection	Jianrui Shi et.al.	2501.09465	translate	read	null
2025-01-16	On the Relation between Optical Aperture and Automotive Object Detection	Ofer Bar-Shalom et.al.	2501.09456	translate	read	null
2025-01-16	SoccerSynth-Detection: A Synthetic Dataset for Soccer Player Detection	Haobin Qin et.al.	2501.09281	translate	read	null
2025-01-15	GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge	Liam Dugan et.al.	2501.08913	translate	read	null
2025-01-15	PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection	Chenguang Liu et.al.	2501.08605	translate	read	null
2025-01-14	Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests	Ni Li et.al.	2501.08465	translate	read	link
2025-01-14	Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying	Jonathan Lyhs et.al.	2501.08142	translate	read	null
2025-01-14	Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation	Yunzhi Zhuge et.al.	2501.07806	translate	read	link
2025-01-14	Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding	Zhaokai Wang et.al.	2501.07783	translate	read	link
2025-01-13	SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing	Varun Biyyala et.al.	2501.07554	translate	read	link
2025-01-13	ML Mule: Mobile-Driven Context-Aware Collaborative Learning	Haoxiang Yu et.al.	2501.07536	translate	read	null
2025-01-13	TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations	Daniel Steininger et.al.	2501.07360	translate	read	link
2025-01-13	Toward Realistic Camouflaged Object Detection: Benchmarks and Method	Zhimeng Xin et.al.	2501.07297	translate	read	link
2025-01-13	Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection	ZhouRui Zhang et.al.	2501.07101	translate	read	null
2025-01-11	CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection	Yiheng Li et.al.	2501.06550	translate	read	link
2025-01-11	CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement	Yijie Li et.al.	2501.06441	translate	read	null
2025-01-11	FocusDD: Real-World Scene Infusion for Robust Dataset Distillation	Youbing Hu et.al.	2501.06405	translate	read	null
2025-01-10	A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection	Tsui Qin Mok et.al.	2501.06038	translate	read	null
2025-01-10	Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion	Sanjay Kumar et.al.	2501.05997	translate	read	null
2025-01-10	EDNet: Edge-Optimized Small Target Detection in UAV Imagery – Faster Context Attention, Better Feature Fusion, and Hardware Acceleration	Zhifan Song et.al.	2501.05885	translate	read	null
2025-01-10	Automatic detection of single-electron regime of quantum dots and definition of virtual gates using U-Net and clustering	Yui Muto et.al.	2501.05878	translate	read	null
2025-01-10	Zero-shot Shark Tracking and Biometrics from Aerial Imagery	Chinmay K Lalgudi et.al.	2501.05717	translate	read	null
2025-01-10	Dark Energy Survey Year 6 Results: Synthetic-source Injection Across the Full Survey Using Balrog	D. Anbajagane et.al.	2501.05683	translate	read	null
2025-01-09	Approximate Supervised Object Distance Estimation on Unmanned Surface Vehicles	Benjamin Kiefer et.al.	2501.05567	translate	read	null
2025-01-09	Performance of YOLOv7 in Kitchen Safety While Handling Knife	Athulya Sundaresan Geetha et.al.	2501.05399	translate	read	null
2025-01-09	The global consensus on the risk management of autonomous driving	Sebastian Krügel et.al.	2501.05391	translate	read	null
2025-01-09	A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision	Ali Rohan et.al.	2501.05147	translate	read	null
2025-01-09	CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection	Xiang Zhang et.al.	2501.05132	translate	read	null
2025-01-09	AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data	Haoran Zhu et.al.	2501.04969	translate	read	link
2025-01-09	Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks	Seyed Amir Bidaki et.al.	2501.04897	translate	read	link
2025-01-08	Video Summarisation with Incident and Context Information using Generative AI	Ulindu De Silva et.al.	2501.04764	translate	read	null
2025-01-08	Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models	Miaoyang He et.al.	2501.04582	translate	read	null
2025-01-08	Combining YOLO and Visual Rhythm for Vehicle Counting	Victor Nascimento Ribeiro et.al.	2501.04534	translate	read	link
2025-01-08	RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark	Xin Zhang et.al.	2501.04440	translate	read	link
2025-01-08	Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions	Doaa Mahmud et.al.	2501.04437	translate	read	null
2025-01-08	FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection	Guoxin Zhang et.al.	2501.04373	translate	read	null
2025-01-08	H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving	Siran Chen et.al.	2501.04302	translate	read	null
2025-01-08	UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles	Abhishek Balasubramaniam et.al.	2501.04213	translate	read	null
2025-01-07	LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving	Lingdong Kong et.al.	2501.04005	translate	read	null
2025-01-07	Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection	Pablo Miralles-González et.al.	2501.03940	translate	read	null
2025-01-07	Visual question answering: from early developments to recent advances – a survey	Ngoc Dung Huynh et.al.	2501.03939	translate	read	null
2025-01-07	SCC-YOLO: An Improved Object Detector for Assisting in Brain Tumor Diagnosis	Runci Bai et.al.	2501.03836	translate	read	null
2025-01-07	Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection	Xinbin Yuan et.al.	2501.03775	translate	read	link
2025-01-07	AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features	Ruochen Zhang et.al.	2501.03700	translate	read	null
2025-01-07	Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work	Takumi Kitsukawa et.al.	2501.03533	translate	read	null
2025-01-07	SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild	Jiawei Liu et.al.	2501.02962	translate	read	null
2025-01-05	Multispectral Pedestrian Detection with Sparsely Annotated Label	Chan Lee et.al.	2501.02640	translate	read	null
2025-01-05	Generalization-Enhanced Few-Shot Object Detection in Remote Sensing	Hui Lin et.al.	2501.02474	translate	read	link
2025-01-04	Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities	Tara Radvand et.al.	2501.02406	translate	read	link
2025-01-04	V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection	Sichao Wang et.al.	2501.02363	translate	read	link
2025-01-04	Accurate Crop Yield Estimation of Blueberries using Deep Learning and Smart Drones	Hieu D. Nguyen et.al.	2501.02344	translate	read	null
2025-01-04	On The Causal Network Of Face-selective Regions In Human Brain During Movie Watching	Ali Bavafa et.al.	2501.02333	translate	read	null
2025-01-04	RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar	Liye Jia et.al.	2501.02314	translate	read	null
2025-01-03	A Separable Self-attention Inspired by the State Space Model for Computer Vision	Juntao Zhang et.al.	2501.02040	translate	read	link
2025-01-03	UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery	Huaxiang Zhang et.al.	2501.01855	translate	read	null
2025-01-03	Dual Mutual Learning Network with Global-local Awareness for RGB-D Salient Object Detection	Kang Yi et.al.	2501.01648	translate	read	link
2025-01-02	A Multi-task Supervised Compression Model for Split Computing	Yoshitomo Matsubara et.al.	2501.01420	translate	read	link
2025-01-02	MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception	Xiaoshuai Hao et.al.	2501.01037	translate	read	null
2025-01-01	A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia	Hirthik Mathesh GV et.al.	2501.00876	translate	read	null
2025-01-01	NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model	Yuzhi Lai et.al.	2501.00785	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)