Object Detection - 2024-05
Object Detection - 2024-05
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-05-31 | Power of Cooperative Supervision: Multiple Teachers Framework for Enhanced 3D Semi-Supervised Object Detection | Jin-Hee Lee et.al. | 2405.20720 | translate | read | link |
| 2024-05-30 | On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines | Selim Kuzucu et.al. | 2405.20459 | translate | read | link |
| 2024-05-30 | RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection | Fangyi Chen et.al. | 2405.19854 | translate | read | null |
| 2024-05-30 | Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology | Frank A. Ruis et.al. | 2405.19822 | translate | read | null |
| 2024-05-30 | Towards Unified Multi-granularity Text Detection with Interactive Attention | Xingyu Wan et.al. | 2405.19765 | translate | read | null |
| 2024-05-30 | Fully Test-Time Adaptation for Monocular 3D Object Detection | Hongbin Lin et.al. | 2405.19682 | translate | read | link |
| 2024-05-30 | YotoR-You Only Transform One Representation | José Ignacio Díaz Villa et.al. | 2405.19629 | translate | read | null |
| 2024-05-29 | Enabling Visual Recognition at Radio Frequency | Haowen Lai et.al. | 2405.19516 | translate | read | null |
| 2024-05-29 | Model Agnostic Defense against Adversarial Patch Attacks on Object Detection in Unmanned Aerial Vehicles | Saurabh Pathak et.al. | 2405.19179 | translate | read | null |
| 2024-05-29 | RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision | Jinzhong Wang et.al. | 2405.18955 | translate | read | null |
| 2024-05-29 | SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving | Yiming Cui et.al. | 2405.18857 | translate | read | null |
| 2024-05-29 | PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram | Sifan Zhou et.al. | 2405.18734 | translate | read | null |
| 2024-05-28 | A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic | Ioanna Gogou et.al. | 2405.18387 | translate | read | link |
| 2024-05-28 | Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | Yifan Bai et.al. | 2405.18361 | translate | read | null |
| 2024-05-28 | Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention | Weitai Kang et.al. | 2405.18295 | translate | read | link |
| 2024-05-28 | DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture | Shentong Mo et.al. | 2405.17995 | translate | read | link |
| 2024-05-28 | Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection | Teodor-George Marchitan et.al. | 2405.17964 | translate | read | null |
| 2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | translate | read | null |
| 2024-05-28 | Boosting General Trimap-free Matting in the Real-World Image | Leo Shan Wenzhang Zhou Grace Zhao et.al. | 2405.17916 | translate | read | null |
| 2024-05-28 | The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention | Xingyu Ding et.al. | 2405.17776 | translate | read | null |
| 2024-05-27 | Understanding differences in applying DETR to natural and medical images | Yanqi Xu et.al. | 2405.17677 | translate | read | null |
| 2024-05-27 | Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection | Shuai Zeng et.al. | 2405.17422 | translate | read | link |
| 2024-05-27 | Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association | Tingwei Liu et.al. | 2405.17323 | translate | read | null |
| 2024-05-27 | Enhanced Automotive Radar Collaborative Sensing By Exploiting Constructive Interference | Lifan Xu et.al. | 2405.17297 | translate | read | null |
| 2024-05-27 | SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving | Avinash Nittur Ramesh et.al. | 2405.17030 | translate | read | null |
| 2024-05-27 | Collective Perception Datasets for Autonomous Driving: A Comprehensive Review | Sven Teufel et.al. | 2405.16973 | translate | read | null |
| 2024-05-27 | OED: Towards One-stage End-to-End Dynamic Scene Graph Generation | Guan Wang et.al. | 2405.16925 | translate | read | link |
| 2024-05-27 | ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2405.16873 | translate | read | null |
| 2024-05-27 | A re-calibration method for object detection with multi-modal alignment bias in autonomous driving | Zhihang Song et.al. | 2405.16848 | translate | read | null |
| 2024-05-26 | A Study on Unsupervised Anomaly Detection and Defect Localization using Generative Model in Ultrasonic Non-Destructive Testing | Yusaku Ando et.al. | 2405.16580 | translate | read | null |
| 2024-05-26 | AI-Generated Text Detection and Classification Based on BERT Deep Learning Algorithm | Hao Wang et.al. | 2405.16422 | translate | read | null |
| 2024-05-24 | UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes | Ted Lentsch et.al. | 2405.15688 | translate | read | link |
| 2024-05-24 | Multimodal Object Detection via Probabilistic a priori Information Integration | Hafsa El Hafyani et.al. | 2405.15596 | translate | read | null |
| 2024-05-24 | Scale-Invariant Feature Disentanglement via Adversarial Learning for UAV-based Object Detection | Fan Liu et.al. | 2405.15465 | translate | read | null |
| 2024-05-24 | Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets | Hoàng-Ân Lê et.al. | 2405.15394 | translate | read | null |
| 2024-05-24 | Towards Global Optimal Visual In-Context Learning Prompt Selection | Chengming Xu et.al. | 2405.15279 | translate | read | null |
| 2024-05-24 | Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection | Yajing Liu et.al. | 2405.15225 | translate | read | null |
| 2024-05-24 | ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models | Jingyuan Zhu et.al. | 2405.15199 | translate | read | null |
| 2024-05-24 | MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method | Pan Liao et.al. | 2405.15176 | translate | read | null |
| 2024-05-23 | Learning to Detect and Segment Mobile Objects from Unlabeled Videos | Yihong Sun et.al. | 2405.14841 | translate | read | null |
| 2024-05-23 | Designing A Sustainable Marine Debris Clean-up Framework without Human Labels | Raymond Wang et.al. | 2405.14815 | translate | read | null |
| 2024-05-23 | Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond | Zhechao Wang et.al. | 2405.14674 | translate | read | null |
| 2024-05-23 | Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment | Muhammad Sohail Danish et.al. | 2405.14497 | translate | read | null |
| 2024-05-23 | YOLOv10: Real-Time End-to-End Object Detection | Ao Wang et.al. | 2405.14458 | translate | read | link |
| 2024-05-23 | Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations | Mohammed Baharoon et.al. | 2405.14239 | translate | read | null |
| 2024-05-22 | Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation | Mykhailo Uss et.al. | 2405.14024 | translate | read | null |
| 2024-05-22 | TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System | Diogo Lavado et.al. | 2405.13989 | translate | read | null |
| 2024-05-22 | Class-Conditional self-reward mechanism for improved Text-to-Image models | Safouane El Ghazouali et.al. | 2405.13473 | translate | read | link |
| 2024-05-22 | Adaptive Wireless Image Semantic Transmission and Over-The-Air Testing | Jiarun Ding et.al. | 2405.13403 | translate | read | null |
| 2024-05-21 | BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once | Theodore Zhao et.al. | 2405.12971 | translate | read | null |
| 2024-05-21 | AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection | Zizhao Chen et.al. | 2405.12944 | translate | read | link |
| 2024-05-21 | Predicting the Influence of Adverse Weather on Pedestrian Detection with Automotive Radar and Lidar Sensors | Daniel Weihmayr et.al. | 2405.12736 | translate | read | null |
| 2024-05-21 | Spotting AI’s Touch: Identifying LLM-Paraphrased Spans in Text | Yafu Li et.al. | 2405.12689 | translate | read | null |
| 2024-05-21 | Automating Attendance Management in Human Resources: A Design Science Approach Using Computer Vision and Facial Recognition | Bao-Thien Nguyen-Tat et.al. | 2405.12633 | translate | read | null |
| 2024-05-21 | FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors | Shuai Liu et.al. | 2405.12601 | translate | read | link |
| 2024-05-21 | Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering | Hiba Maryam et.al. | 2405.12533 | translate | read | null |
| 2024-05-21 | Active Object Detection with Knowledge Aggregation and Distillation from Large Models | Dejie Yang et.al. | 2405.12509 | translate | read | null |
| 2024-05-21 | Mutual Information Analysis in Multimodal Learning Systems | Hadi Hadizadeh et.al. | 2405.12456 | translate | read | null |
| 2024-05-20 | Multi-View Attentive Contextualization for Multi-View 3D Object Detection | Xianpeng Liu et.al. | 2405.12200 | translate | read | null |
| 2024-05-20 | Bangladeshi Native Vehicle Detection in Wild | Bipin Saha et.al. | 2405.12150 | translate | read | link |
| 2024-05-20 | Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments | Jooyong Park et.al. | 2405.11855 | translate | read | null |
| 2024-05-20 | DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment | Jianhong Han et.al. | 2405.11765 | translate | read | link |
| 2024-05-20 | Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain Adaptation | Runou Yang et.al. | 2405.11754 | translate | read | link |
| 2024-05-19 | FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention | Ziang Guo et.al. | 2405.11682 | translate | read | link |
| 2024-05-19 | SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization | Jialong Guo et.al. | 2405.11582 | translate | read | link |
| 2024-05-19 | The First Swahili Language Scene Text Detection and Recognition Dataset | Fadila Wendigoundi Douamba et.al. | 2405.11437 | translate | read | link |
| 2024-05-18 | InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | Wuzhou Li et.al. | 2405.11293 | translate | read | null |
| 2024-05-18 | Visible and Clear: Finding Tiny Objects in Difference Map | Bing Cao et.al. | 2405.11276 | translate | read | null |
| 2024-05-17 | A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model | Mingxiang Fu et.al. | 2405.10890 | translate | read | null |
| 2024-05-17 | DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts | Anastasia Voznyuk et.al. | 2405.10629 | translate | read | link |
| 2024-05-17 | DuoSpaceNet: Leveraging Both Bird’s-Eye-View and Perspective View Representations for 3D Object Detection | Zhe Huang et.al. | 2405.10577 | translate | read | null |
| 2024-05-16 | Drone-type-Set: Drone types detection benchmark for drone detection and tracking | Kholoud AlDosari et.al. | 2405.10398 | translate | read | null |
| 2024-05-16 | Grounded 3D-LLM with Referent Tokens | Yilun Chen et.al. | 2405.10370 | translate | read | link |
| 2024-05-16 | Grounding DINO 1.5: Advance the “Edge” of Open-Set Object Detection | Tianhe Ren et.al. | 2405.10300 | translate | read | link |
| 2024-05-16 | Towards Task-Compatible Compressible Representations | Anderson de Andrade et.al. | 2405.10244 | translate | read | link |
| 2024-05-16 | SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network | Zhaoxu Li et.al. | 2405.10148 | translate | read | link |
| 2024-05-16 | SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection | Mingxuan Liu et.al. | 2405.10053 | translate | read | link |
| 2024-05-16 | FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection | Siliang Ma et.al. | 2405.09942 | translate | read | null |
| 2024-05-16 | Infrared Adversarial Car Stickers | Xiaopei Zhu et.al. | 2405.09924 | translate | read | null |
| 2024-05-16 | PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features | Xusheng Li et.al. | 2405.09828 | translate | read | null |
| 2024-05-16 | Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection | Feiran Li et.al. | 2405.09782 | translate | read | link |
| 2024-05-15 | Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation | Guo Yachan et.al. | 2405.09682 | translate | read | null |
| 2024-05-15 | Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels | Guozhang Liu et.al. | 2405.09024 | translate | read | null |
| 2024-05-14 | CLIP with Quality Captions: A Strong Pretraining for Vision Tasks | Pavan Kumar Anasosalu Vasu et.al. | 2405.08911 | translate | read | null |
| 2024-05-14 | Open-Vocabulary Object Detection via Neighboring Region Attention Alignment | Sunyuan Qiang et.al. | 2405.08593 | translate | read | null |
| 2024-05-14 | Semantic Contextualization of Face Forgery: A New Definition, Dataset, and Detection Method | Mian Zou et.al. | 2405.08487 | translate | read | link |
| 2024-05-14 | RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images | Zong-Wei Hong et.al. | 2405.08483 | translate | read | link |
| 2024-05-14 | Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale Events | Xin Wu et.al. | 2405.08251 | translate | read | link |
| 2024-05-13 | RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors | Liam Dugan et.al. | 2405.07940 | translate | read | null |
| 2024-05-13 | oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving | Abdul Hannan Khan et.al. | 2405.07698 | translate | read | null |
| 2024-05-13 | MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders | Xueying Jiang et.al. | 2405.07696 | translate | read | null |
| 2024-05-13 | Quality-aware Selective Fusion Network for V-D-T Salient Object Detection | Liuxin Bao et.al. | 2405.07655 | translate | read | link |
| 2024-05-13 | Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying | Thomas Pöllabauer et.al. | 2405.07653 | translate | read | null |
| 2024-05-13 | Integrity Monitoring of 3D Object Detection in Automated Driving Systems using Raw Activation Patterns and Spatial Filtering | Hakan Yekta Yatbaz et.al. | 2405.07600 | translate | read | null |
| 2024-05-13 | Environmental Matching Attack Against Unmanned Aerial Vehicles Object Detection | Dehong Kong et.al. | 2405.07595 | translate | read | null |
| 2024-05-13 | Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis | Tianci Bi et.al. | 2405.07481 | translate | read | null |
| 2024-05-13 | Enhancing 3D Object Detection by Using Neural Network with Self-adaptive Thresholding | Houze Liu et.al. | 2405.07479 | translate | read | null |
| 2024-05-12 | MAML MOT: Multiple Object Tracking based on Meta-Learning | Jiayi Chen et.al. | 2405.07272 | translate | read | null |
| 2024-05-10 | How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models? | Engin Uzun et.al. | 2405.06383 | translate | read | null |
| 2024-05-10 | Precise Apple Detection and Localization in Orchards using YOLOv5 for Robotic Harvesting Systems | Jiang Ziyue et.al. | 2405.06260 | translate | read | null |
| 2024-05-09 | CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks | Nick et.al. | 2405.05755 | translate | read | null |
| 2024-05-09 | Depth Awakens: A Depth-perceptual Attention Fusion Network for RGB-D Camouflaged Object Detection | Xinran Liua et.al. | 2405.05614 | translate | read | null |
| 2024-05-09 | The object detection model uses combined extraction with KNN and RF classification | Florentina Tatrin Kurniati et.al. | 2405.05551 | translate | read | null |
| 2024-05-08 | Reviewing Intelligent Cinematography: AI research for camera-based video production | Adrian Azzarelli et.al. | 2405.05039 | translate | read | null |
| 2024-05-07 | A Novel Wide-Area Multiobject Detection System with High-Probability Region Searching | Xianlei Long et.al. | 2405.04589 | translate | read | null |
| 2024-05-07 | DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving | Chen Min et.al. | 2405.04390 | translate | read | null |
| 2024-05-07 | A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields | Raiyan Rahman et.al. | 2405.04305 | translate | read | null |
| 2024-05-07 | ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers | Jinke Li et.al. | 2405.04299 | translate | read | link |
| 2024-05-07 | Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore | Junchao Wu et.al. | 2405.04286 | translate | read | link |
| 2024-05-07 | Deep Event-based Object Detection in Autonomous Driving: A Survey | Bingquan Zhou et.al. | 2405.03995 | translate | read | null |
| 2024-05-06 | BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection | Saket S. Chaturvedi et.al. | 2405.03884 | translate | read | null |
| 2024-05-06 | RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection | Thennarasi Balakrishnan et.al. | 2405.03541 | translate | read | link |
| 2024-05-06 | Low-light Object Detection | Pengpeng Li et.al. | 2405.03519 | translate | read | null |
| 2024-05-06 | Salient Object Detection From Arbitrary Modalities | Nianchang Huang et.al. | 2405.03352 | translate | read | null |
| 2024-05-06 | Modality Prompts for Arbitrary Modality Salient Object Detection | Nianchang Huang et.al. | 2405.03351 | translate | read | null |
| 2024-05-06 | Vietnamese AI Generated Text Detection | Quang-Dan Tran et.al. | 2405.03206 | translate | read | null |
| 2024-05-06 | PTQ4SAM: Post-Training Quantization for Segment Anything | Chengtao Lv et.al. | 2405.03144 | translate | read | link |
| 2024-05-05 | Performance Evaluation of Real-Time Object Detection for Electric Scooters | Dong Chen et.al. | 2405.03039 | translate | read | link |
| 2024-05-05 | SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection | Kassaw Abraham Mulat et.al. | 2405.02906 | translate | read | null |
| 2024-05-07 | Adaptive Guidance Learning for Camouflaged Object Detection | Zhennan Chen et.al. | 2405.02824 | translate | read | null |
| 2024-05-05 | PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection | Zhaoqi Leng et.al. | 2405.02811 | translate | read | null |
| 2024-05-02 | Segmentation-Free Outcome Prediction in Head and Neck Cancer: Deep Learning-based Feature Extraction from Multi-Angle Maximum Intensity Projections (MA-MIPs) of PET Images | Amirhosein Toosi et.al. | 2405.01756 | translate | read | null |
| 2024-05-02 | PointCompress3D – A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | translate | read | null |
| 2024-05-02 | Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey | Guoping Xu et.al. | 2405.01725 | translate | read | link |
| 2024-05-02 | SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients | Tushar Verma et.al. | 2405.01699 | translate | read | null |
| 2024-05-02 | Imagine the Unseen: Occluded Pedestrian Detection via Adversarial Feature Completion | Shanshan Zhang et.al. | 2405.01311 | translate | read | null |
| 2024-05-02 | Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease Remediation | Dr. Selva Kumar S et.al. | 2405.01310 | translate | read | null |
| 2024-05-02 | Towards Consistent Object Detection via LiDAR-Camera Synergy | Kai Luo et.al. | 2405.01258 | translate | read | link |
| 2024-05-02 | Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection | Ahmad Khalil et.al. | 2405.01108 | translate | read | null |
| 2024-05-01 | Grains of Saliency: Optimizing Saliency-based Training of Biometric Attack Detection Models | Colton R. Crum et.al. | 2405.00650 | translate | read | null |
| 2024-05-01 | Object detection under the linear subspace model with application to cryo-EM images | Amitay Eldar et.al. | 2405.00364 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)