Object Detection - 2024-05

Publish Date Title Authors PDF Translate Read Code
2024-05-31 Power of Cooperative Supervision: Multiple Teachers Framework for Enhanced 3D Semi-Supervised Object Detection Jin-Hee Lee et.al. 2405.20720 translate read link
2024-05-30 On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines Selim Kuzucu et.al. 2405.20459 translate read link
2024-05-30 RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection Fangyi Chen et.al. 2405.19854 translate read null
2024-05-30 Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology Frank A. Ruis et.al. 2405.19822 translate read null
2024-05-30 Towards Unified Multi-granularity Text Detection with Interactive Attention Xingyu Wan et.al. 2405.19765 translate read null
2024-05-30 Fully Test-Time Adaptation for Monocular 3D Object Detection Hongbin Lin et.al. 2405.19682 translate read link
2024-05-30 YotoR-You Only Transform One Representation José Ignacio Díaz Villa et.al. 2405.19629 translate read null
2024-05-29 Enabling Visual Recognition at Radio Frequency Haowen Lai et.al. 2405.19516 translate read null
2024-05-29 Model Agnostic Defense against Adversarial Patch Attacks on Object Detection in Unmanned Aerial Vehicles Saurabh Pathak et.al. 2405.19179 translate read null
2024-05-29 RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision Jinzhong Wang et.al. 2405.18955 translate read null
2024-05-29 SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving Yiming Cui et.al. 2405.18857 translate read null
2024-05-29 PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram Sifan Zhou et.al. 2405.18734 translate read null
2024-05-28 A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic Ioanna Gogou et.al. 2405.18387 translate read link
2024-05-28 Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? Yifan Bai et.al. 2405.18361 translate read null
2024-05-28 Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention Weitai Kang et.al. 2405.18295 translate read link
2024-05-28 DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture Shentong Mo et.al. 2405.17995 translate read link
2024-05-28 Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection Teodor-George Marchitan et.al. 2405.17964 translate read null
2024-05-28 Self-supervised Pre-training for Transferable Multi-modal Perception Xiaohao Xu et.al. 2405.17942 translate read null
2024-05-28 Boosting General Trimap-free Matting in the Real-World Image Leo Shan Wenzhang Zhou Grace Zhao et.al. 2405.17916 translate read null
2024-05-28 The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention Xingyu Ding et.al. 2405.17776 translate read null
2024-05-27 Understanding differences in applying DETR to natural and medical images Yanqi Xu et.al. 2405.17677 translate read null
2024-05-27 Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection Shuai Zeng et.al. 2405.17422 translate read link
2024-05-27 Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association Tingwei Liu et.al. 2405.17323 translate read null
2024-05-27 Enhanced Automotive Radar Collaborative Sensing By Exploiting Constructive Interference Lifan Xu et.al. 2405.17297 translate read null
2024-05-27 SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving Avinash Nittur Ramesh et.al. 2405.17030 translate read null
2024-05-27 Collective Perception Datasets for Autonomous Driving: A Comprehensive Review Sven Teufel et.al. 2405.16973 translate read null
2024-05-27 OED: Towards One-stage End-to-End Dynamic Scene Graph Generation Guan Wang et.al. 2405.16925 translate read link
2024-05-27 ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection Ziying Song et.al. 2405.16873 translate read null
2024-05-27 A re-calibration method for object detection with multi-modal alignment bias in autonomous driving Zhihang Song et.al. 2405.16848 translate read null
2024-05-26 A Study on Unsupervised Anomaly Detection and Defect Localization using Generative Model in Ultrasonic Non-Destructive Testing Yusaku Ando et.al. 2405.16580 translate read null
2024-05-26 AI-Generated Text Detection and Classification Based on BERT Deep Learning Algorithm Hao Wang et.al. 2405.16422 translate read null
2024-05-24 UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes Ted Lentsch et.al. 2405.15688 translate read link
2024-05-24 Multimodal Object Detection via Probabilistic a priori Information Integration Hafsa El Hafyani et.al. 2405.15596 translate read null
2024-05-24 Scale-Invariant Feature Disentanglement via Adversarial Learning for UAV-based Object Detection Fan Liu et.al. 2405.15465 translate read null
2024-05-24 Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets Hoàng-Ân Lê et.al. 2405.15394 translate read null
2024-05-24 Towards Global Optimal Visual In-Context Learning Prompt Selection Chengming Xu et.al. 2405.15279 translate read null
2024-05-24 Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection Yajing Liu et.al. 2405.15225 translate read null
2024-05-24 ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models Jingyuan Zhu et.al. 2405.15199 translate read null
2024-05-24 MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method Pan Liao et.al. 2405.15176 translate read null
2024-05-23 Learning to Detect and Segment Mobile Objects from Unlabeled Videos Yihong Sun et.al. 2405.14841 translate read null
2024-05-23 Designing A Sustainable Marine Debris Clean-up Framework without Human Labels Raymond Wang et.al. 2405.14815 translate read null
2024-05-23 Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond Zhechao Wang et.al. 2405.14674 translate read null
2024-05-23 Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment Muhammad Sohail Danish et.al. 2405.14497 translate read null
2024-05-23 YOLOv10: Real-Time End-to-End Object Detection Ao Wang et.al. 2405.14458 translate read link
2024-05-23 Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations Mohammed Baharoon et.al. 2405.14239 translate read null
2024-05-22 Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation Mykhailo Uss et.al. 2405.14024 translate read null
2024-05-22 TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System Diogo Lavado et.al. 2405.13989 translate read null
2024-05-22 Class-Conditional self-reward mechanism for improved Text-to-Image models Safouane El Ghazouali et.al. 2405.13473 translate read link
2024-05-22 Adaptive Wireless Image Semantic Transmission and Over-The-Air Testing Jiarun Ding et.al. 2405.13403 translate read null
2024-05-21 BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once Theodore Zhao et.al. 2405.12971 translate read null
2024-05-21 AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection Zizhao Chen et.al. 2405.12944 translate read link
2024-05-21 Predicting the Influence of Adverse Weather on Pedestrian Detection with Automotive Radar and Lidar Sensors Daniel Weihmayr et.al. 2405.12736 translate read null
2024-05-21 Spotting AI’s Touch: Identifying LLM-Paraphrased Spans in Text Yafu Li et.al. 2405.12689 translate read null
2024-05-21 Automating Attendance Management in Human Resources: A Design Science Approach Using Computer Vision and Facial Recognition Bao-Thien Nguyen-Tat et.al. 2405.12633 translate read null
2024-05-21 FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors Shuai Liu et.al. 2405.12601 translate read link
2024-05-21 Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering Hiba Maryam et.al. 2405.12533 translate read null
2024-05-21 Active Object Detection with Knowledge Aggregation and Distillation from Large Models Dejie Yang et.al. 2405.12509 translate read null
2024-05-21 Mutual Information Analysis in Multimodal Learning Systems Hadi Hadizadeh et.al. 2405.12456 translate read null
2024-05-20 Multi-View Attentive Contextualization for Multi-View 3D Object Detection Xianpeng Liu et.al. 2405.12200 translate read null
2024-05-20 Bangladeshi Native Vehicle Detection in Wild Bipin Saha et.al. 2405.12150 translate read link
2024-05-20 Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments Jooyong Park et.al. 2405.11855 translate read null
2024-05-20 DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment Jianhong Han et.al. 2405.11765 translate read link
2024-05-20 Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain Adaptation Runou Yang et.al. 2405.11754 translate read link
2024-05-19 FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention Ziang Guo et.al. 2405.11682 translate read link
2024-05-19 SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization Jialong Guo et.al. 2405.11582 translate read link
2024-05-19 The First Swahili Language Scene Text Detection and Recognition Dataset Fadila Wendigoundi Douamba et.al. 2405.11437 translate read link
2024-05-18 InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images Wuzhou Li et.al. 2405.11293 translate read null
2024-05-18 Visible and Clear: Finding Tiny Objects in Difference Map Bing Cao et.al. 2405.11276 translate read null
2024-05-17 A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model Mingxiang Fu et.al. 2405.10890 translate read null
2024-05-17 DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts Anastasia Voznyuk et.al. 2405.10629 translate read link
2024-05-17 DuoSpaceNet: Leveraging Both Bird’s-Eye-View and Perspective View Representations for 3D Object Detection Zhe Huang et.al. 2405.10577 translate read null
2024-05-16 Drone-type-Set: Drone types detection benchmark for drone detection and tracking Kholoud AlDosari et.al. 2405.10398 translate read null
2024-05-16 Grounded 3D-LLM with Referent Tokens Yilun Chen et.al. 2405.10370 translate read link
2024-05-16 Grounding DINO 1.5: Advance the “Edge” of Open-Set Object Detection Tianhe Ren et.al. 2405.10300 translate read link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 translate read link
2024-05-16 SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network Zhaoxu Li et.al. 2405.10148 translate read link
2024-05-16 SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection Mingxuan Liu et.al. 2405.10053 translate read link
2024-05-16 FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection Siliang Ma et.al. 2405.09942 translate read null
2024-05-16 Infrared Adversarial Car Stickers Xiaopei Zhu et.al. 2405.09924 translate read null
2024-05-16 PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features Xusheng Li et.al. 2405.09828 translate read null
2024-05-16 Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection Feiran Li et.al. 2405.09782 translate read link
2024-05-15 Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation Guo Yachan et.al. 2405.09682 translate read null
2024-05-15 Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels Guozhang Liu et.al. 2405.09024 translate read null
2024-05-14 CLIP with Quality Captions: A Strong Pretraining for Vision Tasks Pavan Kumar Anasosalu Vasu et.al. 2405.08911 translate read null
2024-05-14 Open-Vocabulary Object Detection via Neighboring Region Attention Alignment Sunyuan Qiang et.al. 2405.08593 translate read null
2024-05-14 Semantic Contextualization of Face Forgery: A New Definition, Dataset, and Detection Method Mian Zou et.al. 2405.08487 translate read link
2024-05-14 RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images Zong-Wei Hong et.al. 2405.08483 translate read link
2024-05-14 Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale Events Xin Wu et.al. 2405.08251 translate read link
2024-05-13 RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors Liam Dugan et.al. 2405.07940 translate read null
2024-05-13 oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving Abdul Hannan Khan et.al. 2405.07698 translate read null
2024-05-13 MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders Xueying Jiang et.al. 2405.07696 translate read null
2024-05-13 Quality-aware Selective Fusion Network for V-D-T Salient Object Detection Liuxin Bao et.al. 2405.07655 translate read link
2024-05-13 Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying Thomas Pöllabauer et.al. 2405.07653 translate read null
2024-05-13 Integrity Monitoring of 3D Object Detection in Automated Driving Systems using Raw Activation Patterns and Spatial Filtering Hakan Yekta Yatbaz et.al. 2405.07600 translate read null
2024-05-13 Environmental Matching Attack Against Unmanned Aerial Vehicles Object Detection Dehong Kong et.al. 2405.07595 translate read null
2024-05-13 Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis Tianci Bi et.al. 2405.07481 translate read null
2024-05-13 Enhancing 3D Object Detection by Using Neural Network with Self-adaptive Thresholding Houze Liu et.al. 2405.07479 translate read null
2024-05-12 MAML MOT: Multiple Object Tracking based on Meta-Learning Jiayi Chen et.al. 2405.07272 translate read null
2024-05-10 How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models? Engin Uzun et.al. 2405.06383 translate read null
2024-05-10 Precise Apple Detection and Localization in Orchards using YOLOv5 for Robotic Harvesting Systems Jiang Ziyue et.al. 2405.06260 translate read null
2024-05-09 CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks Nick et.al. 2405.05755 translate read null
2024-05-09 Depth Awakens: A Depth-perceptual Attention Fusion Network for RGB-D Camouflaged Object Detection Xinran Liua et.al. 2405.05614 translate read null
2024-05-09 The object detection model uses combined extraction with KNN and RF classification Florentina Tatrin Kurniati et.al. 2405.05551 translate read null
2024-05-08 Reviewing Intelligent Cinematography: AI research for camera-based video production Adrian Azzarelli et.al. 2405.05039 translate read null
2024-05-07 A Novel Wide-Area Multiobject Detection System with High-Probability Region Searching Xianlei Long et.al. 2405.04589 translate read null
2024-05-07 DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving Chen Min et.al. 2405.04390 translate read null
2024-05-07 A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields Raiyan Rahman et.al. 2405.04305 translate read null
2024-05-07 ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers Jinke Li et.al. 2405.04299 translate read link
2024-05-07 Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore Junchao Wu et.al. 2405.04286 translate read link
2024-05-07 Deep Event-based Object Detection in Autonomous Driving: A Survey Bingquan Zhou et.al. 2405.03995 translate read null
2024-05-06 BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection Saket S. Chaturvedi et.al. 2405.03884 translate read null
2024-05-06 RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection Thennarasi Balakrishnan et.al. 2405.03541 translate read link
2024-05-06 Low-light Object Detection Pengpeng Li et.al. 2405.03519 translate read null
2024-05-06 Salient Object Detection From Arbitrary Modalities Nianchang Huang et.al. 2405.03352 translate read null
2024-05-06 Modality Prompts for Arbitrary Modality Salient Object Detection Nianchang Huang et.al. 2405.03351 translate read null
2024-05-06 Vietnamese AI Generated Text Detection Quang-Dan Tran et.al. 2405.03206 translate read null
2024-05-06 PTQ4SAM: Post-Training Quantization for Segment Anything Chengtao Lv et.al. 2405.03144 translate read link
2024-05-05 Performance Evaluation of Real-Time Object Detection for Electric Scooters Dong Chen et.al. 2405.03039 translate read link
2024-05-05 SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection Kassaw Abraham Mulat et.al. 2405.02906 translate read null
2024-05-07 Adaptive Guidance Learning for Camouflaged Object Detection Zhennan Chen et.al. 2405.02824 translate read null
2024-05-05 PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection Zhaoqi Leng et.al. 2405.02811 translate read null
2024-05-02 Segmentation-Free Outcome Prediction in Head and Neck Cancer: Deep Learning-based Feature Extraction from Multi-Angle Maximum Intensity Projections (MA-MIPs) of PET Images Amirhosein Toosi et.al. 2405.01756 translate read null
2024-05-02 PointCompress3D – A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems Walter Zimmer et.al. 2405.01750 translate read null
2024-05-02 Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey Guoping Xu et.al. 2405.01725 translate read link
2024-05-02 SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients Tushar Verma et.al. 2405.01699 translate read null
2024-05-02 Imagine the Unseen: Occluded Pedestrian Detection via Adversarial Feature Completion Shanshan Zhang et.al. 2405.01311 translate read null
2024-05-02 Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease Remediation Dr. Selva Kumar S et.al. 2405.01310 translate read null
2024-05-02 Towards Consistent Object Detection via LiDAR-Camera Synergy Kai Luo et.al. 2405.01258 translate read link
2024-05-02 Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection Ahmad Khalil et.al. 2405.01108 translate read null
2024-05-01 Grains of Saliency: Optimizing Saliency-based Training of Biometric Attack Detection Models Colton R. Crum et.al. 2405.00650 translate read null
2024-05-01 Object detection under the linear subspace model with application to cryo-EM images Amitay Eldar et.al. 2405.00364 translate read null

(<a href=../Object_Detection.md>back to Object Detection</a>)