Object Detection - 2024-07
Object Detection - 2024-07
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-07-31 | Dynamic Object Queries for Transformer-based Incremental Object Detection | Jichuan Zhang et.al. | 2407.21687 | translate | read | null |
| 2024-07-31 | Spatial Transformer Network YOLO Model for Agricultural Object Detection | Yash Zambre et.al. | 2407.21652 | translate | read | null |
| 2024-07-31 | Evaluating SAM2’s Role in Camouflaged Object Detection: From SAM to SAM2 | Lv Tang et.al. | 2407.21596 | translate | read | null |
| 2024-07-31 | InScope: A New Real-world 3D Infrastructure-side Collaborative Perception Dataset for Open Traffic Scenarios | Xiaofei Zhang et.al. | 2407.21581 | translate | read | null |
| 2024-07-31 | Voxel Scene Graph for Intracranial Hemorrhage | Antoine P. Sanner et.al. | 2407.21580 | translate | read | null |
| 2024-07-31 | MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection | Kuo Wang et.al. | 2407.21465 | translate | read | null |
| 2024-07-31 | Generalized Tampered Scene Text Detection in the era of Generative AI | Chenfan Qu et.al. | 2407.21422 | translate | read | null |
| 2024-07-30 | Candidate Distant Trans-Neptunian Objects Detected by the New Horizons Subaru TNO Survey | Wesley C. Fraser et.al. | 2407.21142 | translate | read | null |
| 2024-07-30 | What is YOLOv5: A deep look into the internal features of the popular object detector | Rahima Khanam et.al. | 2407.20892 | translate | read | null |
| 2024-07-30 | WARM-3D: A Weakly-Supervised Sim2Real Domain Adaptation Framework for Roadside Monocular 3D Object Detection | Xingcheng Zhou et.al. | 2407.20818 | translate | read | null |
| 2024-07-31 | Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection | Xinhao Luo et.al. | 2407.20708 | translate | read | link |
| 2024-07-29 | Uncertainty-Rectified YOLO-SAM for Weakly Supervised ICH Segmentation | Pascal Spiegler et.al. | 2407.20461 | translate | read | null |
| 2024-07-29 | MEVDT: Multi-Modal Event-Based Vehicle Detection and Tracking Dataset | Zaid A. El Shair et.al. | 2407.20446 | translate | read | null |
| 2024-07-30 | AxiomVision: Accuracy-Guaranteed Adaptive Visual Model Selection for Perspective-Aware Video Analytics | Xiangxiang Dai et.al. | 2407.20124 | translate | read | link |
| 2024-07-29 | Octave-YOLO: Cross frequency detection network with octave convolution | Sangjune Shin et.al. | 2407.19746 | translate | read | null |
| 2024-07-29 | Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images | Zewen Du et.al. | 2407.19696 | translate | read | null |
| 2024-07-29 | Practical Video Object Detection via Feature Selection and Aggregation | Yuheng Shi et.al. | 2407.19650 | translate | read | link |
| 2024-07-28 | Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data | Azmyin Md. Kamal et.al. | 2407.19518 | translate | read | link |
| 2024-07-28 | Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets | Tianxiao Zhang et.al. | 2407.19394 | translate | read | link |
| 2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | translate | read | null |
| 2024-07-27 | Enhancing Tree Type Detection in Forest Fire Risk Assessment: Multi-Stage Approach and Color Encoding with Forest Fire Risk Evaluation Framework for UAV Imagery | Jinda Zhang et.al. | 2407.19184 | translate | read | null |
| 2024-07-27 | Reducing Spurious Correlation for Federated Domain Generalization | Shuran Ma et.al. | 2407.19174 | translate | read | null |
| 2024-07-27 | Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble | Juhan Cha et.al. | 2407.19156 | translate | read | link |
| 2024-07-26 | Local Binary Pattern(LBP) Optimization for Feature Extraction | Zeinab Sedaghatjoo et.al. | 2407.18665 | translate | read | null |
| 2024-07-25 | LION: Linear Group RNN for 3D Object Detection in Point Clouds | Zhe Liu et.al. | 2407.18232 | translate | read | link |
| 2024-07-25 | XS-VID: An Extremely Small Video Object Detection Dataset | Jiahao Guo et.al. | 2407.18137 | translate | read | null |
| 2024-07-25 | SaccadeDet: A Novel Dual-Stage Architecture for Rapid and Accurate Detection in Gigapixel Images | Wenxi Li et.al. | 2407.17956 | translate | read | null |
| 2024-07-25 | A Novel Perception Entropy Metric for Optimizing Vehicle Perception with LiDAR Deployment | Yongjiang He et.al. | 2407.17942 | translate | read | null |
| 2024-07-25 | Hierarchical Object Detection and Recognition Framework for Practical Plant Disease Diagnosis | Kohei Iwano et.al. | 2407.17906 | translate | read | null |
| 2024-07-25 | Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey | Shahab Saquib Sohail et.al. | 2407.17877 | translate | read | null |
| 2024-07-25 | Enhancing Fine-grained Object Detection in Aerial Images via Orthogonal Mapping | Haoran Zhu et.al. | 2407.17738 | translate | read | link |
| 2024-07-26 | Unsqueeze [CLS] Bottleneck to Learn Rich Representations | Qing Su et.al. | 2407.17671 | translate | read | link |
| 2024-07-24 | SDLNet: Statistical Deep Learning Network for Co-Occurring Object Detection and Identification | Binay Kumar Singh et.al. | 2407.17664 | translate | read | null |
| 2024-07-24 | PEEKABOO: Hiding parts of an image for unsupervised object localization | Hasib Zunair et.al. | 2407.17628 | translate | read | link |
| 2024-07-24 | ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels Only | Saad Lahlali et.al. | 2407.17197 | translate | read | null |
| 2024-07-24 | DVPE: Divided View Position Embedding for Multi-View 3D Object Detection | Jiasen Wang et.al. | 2407.16955 | translate | read | link |
| 2024-07-23 | What Matters in Range View 3D Object Detection | Benjamin Wilson et.al. | 2407.16789 | translate | read | link |
| 2024-07-23 | A Framework for Pupil Tracking with Event Cameras | Khadija Iddrisu et.al. | 2407.16665 | translate | read | null |
| 2024-07-24 | Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles | Seamie Hayes et.al. | 2407.16636 | translate | read | null |
| 2024-07-23 | COALA: A Practical and Vision-Centric Federated Learning Platform | Weiming Zhuang et.al. | 2407.16560 | translate | read | link |
| 2024-07-23 | Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection | Trinh Le Ba Khanh et.al. | 2407.16497 | translate | read | link |
| 2024-07-23 | MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh et.al. | 2407.16448 | translate | read | link |
| 2024-07-23 | ESOD: Efficient Small Object Detection on High-Resolution Images | Kai Liu et.al. | 2407.16424 | translate | read | null |
| 2024-07-23 | Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection | Youqian Zhang et.al. | 2407.16327 | translate | read | null |
| 2024-07-23 | DeepClean: Integrated Distortion Identification and Algorithm Selection for Rectifying Image Corruptions | Aditya Kapoor et.al. | 2407.16302 | translate | read | null |
| 2024-07-23 | FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network | Weiying Xie et.al. | 2407.16129 | translate | read | link |
| 2024-07-22 | PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips | Håkon Maric Solberg et.al. | 2407.16076 | translate | read | null |
| 2024-07-22 | Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video | Guiqiu Liao et.al. | 2407.15794 | translate | read | null |
| 2024-07-22 | Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis | Brian K. S. Isaac-Medina et.al. | 2407.15763 | translate | read | null |
| 2024-07-22 | Counter Turing Test ( $CT^2$): Investigating AI-Generated Text Detection for Hindi – Ranking LLMs based on Hindi AI Detectability Index ($ADI_{hi}$ ) | Ishan Kavathekar et.al. | 2407.15694 | translate | read | null |
| 2024-07-22 | YOLOv10 for Automated Fracture Detection in Pediatric Wrist Trauma X-rays | Ammar Ahmed et.al. | 2407.15689 | translate | read | link |
| 2024-07-22 | SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection | Daniel Jakab et.al. | 2407.15646 | translate | read | null |
| 2024-07-22 | YOLO-pdd: A Novel Multi-scale PCB Defect Detection Method Using Deep Representations with Sequential Images | Bowen Liu et.al. | 2407.15427 | translate | read | null |
| 2024-07-22 | Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection | Zhili Chen et.al. | 2407.15354 | translate | read | null |
| 2024-07-22 | Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection | Yiran Yang et.al. | 2407.15334 | translate | read | null |
| 2024-07-21 | Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection | Kwanyong Park et.al. | 2407.15296 | translate | read | null |
| 2024-07-21 | Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis | Jingwei Guo et.al. | 2407.15199 | translate | read | null |
| 2024-07-19 | Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation | Dongyang Wu et.al. | 2407.14498 | translate | read | null |
| 2024-07-19 | MLMT-CNN for Object Detection and Segmentation in Multi-layer and Multi-spectral Images | Majedaldein Almahasneh et.al. | 2407.14473 | translate | read | null |
| 2024-07-19 | EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition | Youssef Doulfoukar et.al. | 2407.14314 | translate | read | null |
| 2024-07-19 | Bucketed Ranking-based Losses for Efficient Training of Object Detectors | Feyza Yavuz et.al. | 2407.14204 | translate | read | link |
| 2024-07-19 | Visual Text Generation in the Wild | Yuanzhi Zhu et.al. | 2407.14138 | translate | read | link |
| 2024-07-18 | GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model | Abdelrahman Shaker et.al. | 2407.13772 | translate | read | link |
| 2024-07-18 | General Geometry-aware Weakly Supervised 3D Object Detection | Guowen Zhang et.al. | 2407.13748 | translate | read | link |
| 2024-07-18 | Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation | Ilhoon Yoon et.al. | 2407.13524 | translate | read | link |
| 2024-07-18 | The use of the symmetric finite difference in the local binary pattern (symmetric LBP) | Zeinab Sedaghatjoo et.al. | 2407.13178 | translate | read | null |
| 2024-07-18 | Learning Camouflaged Object Detection from Noisy Pseudo Label | Jin Zhang et.al. | 2407.13157 | translate | read | null |
| 2024-07-18 | DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection | Zhourui Zhang et.al. | 2407.13147 | translate | read | null |
| 2024-07-18 | FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection | Jianwei Zhao et.al. | 2407.13133 | translate | read | null |
| 2024-07-17 | AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer | Zhuguanyu Wu et.al. | 2407.12951 | translate | read | link |
| 2024-07-17 | Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Dohyung Kim et.al. | 2407.12637 | translate | read | null |
| 2024-07-17 | CerberusDet: Unified Multi-Task Object Detection | Irina Tolstykh et.al. | 2407.12632 | translate | read | link |
| 2024-07-17 | Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation | Prantik Howlader et.al. | 2407.12630 | translate | read | link |
| 2024-07-17 | Enhancing Wrist Abnormality Detection with YOLO: Analysis of State-of-the-art Single-stage Detection Models | Ammar Ahmed et.al. | 2407.12597 | translate | read | link |
| 2024-07-17 | Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection | Hu Cao et.al. | 2407.12582 | translate | read | null |
| 2024-07-17 | Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation | Kaixin Bai et.al. | 2407.12449 | translate | read | null |
| 2024-07-17 | GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval | Han Zhou et.al. | 2407.12431 | translate | read | link |
| 2024-07-17 | Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection | Zhenni Yu et.al. | 2407.12339 | translate | read | null |
| 2024-07-16 | AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs | Yunling Zheng et.al. | 2407.12217 | translate | read | null |
| 2024-07-16 | The object detection method aids in image reconstruction evaluation and clinical interpretation of meniscal abnormalities | Natalia Konovalova et.al. | 2407.12184 | translate | read | null |
| 2024-07-16 | A Case for Application-Aware Space Radiation Tolerance in Orbital Computing | Meiqi Wang et.al. | 2407.11853 | translate | read | null |
| 2024-07-16 | Improving Unsupervised Video Object Segmentation via Fake Flow Generation | Suhwan Cho et.al. | 2407.11714 | translate | read | link |
| 2024-07-16 | Relation DETR: Exploring Explicit Position Relation Prior for Object Detection | Xiuquan Hou et.al. | 2407.11699 | translate | read | link |
| 2024-07-16 | Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection | Qijie Mo et.al. | 2407.11499 | translate | read | null |
| 2024-07-16 | Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes | Zhi Cai et.al. | 2407.11464 | translate | read | link |
| 2024-07-16 | Generative AI Driven Task-Oriented Adaptive Semantic Communications | Yuzhou Fu et.al. | 2407.11354 | translate | read | null |
| 2024-07-16 | LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction | Penghui Du et.al. | 2407.11335 | translate | read | link |
| 2024-07-16 | TCFormer: Visual Recognition via Token Clustering Transformer | Wang Zeng et.al. | 2407.11321 | translate | read | link |
| 2024-07-16 | PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer | Pierre-David Letourneau et.al. | 2407.11306 | translate | read | null |
| 2024-07-15 | OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models | Zijian Zhou et.al. | 2407.11213 | translate | read | link |
| 2024-07-15 | Interpreting Hand gestures using Object Detection and Digits Classification | Sangeetha K et.al. | 2407.10902 | translate | read | null |
| 2024-07-15 | RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Chunliang Li et.al. | 2407.10876 | translate | read | link |
| 2024-07-15 | OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Jinghua Hou et.al. | 2407.10753 | translate | read | link |
| 2024-07-15 | Anticipating Future Object Compositions without Forgetting | Youssef Zahran et.al. | 2407.10723 | translate | read | null |
| 2024-07-15 | OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer | Yu Wang et.al. | 2407.10655 | translate | read | link |
| 2024-07-15 | Backdoor Attacks against Image-to-Image Networks | Wenbo Jiang et.al. | 2407.10445 | translate | read | null |
| 2024-07-14 | Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data | Tuo Feng et.al. | 2407.10200 | translate | read | link |
| 2024-07-14 | LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection | Sanmin Kim et.al. | 2407.10164 | translate | read | link |
| 2024-07-14 | FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Zheng Jiang et.al. | 2407.10135 | translate | read | null |
| 2024-07-14 | When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset | Yi Zhang et.al. | 2407.10125 | translate | read | null |
| 2024-07-12 | DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training | Chen Xin et.al. | 2407.09174 | translate | read | link |
| 2024-07-12 | Open Vocabulary Multi-Label Video Classification | Rohit Gupta et.al. | 2407.09073 | translate | read | null |
| 2024-07-12 | DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects | Peng Wang et.al. | 2407.09051 | translate | read | null |
| 2024-07-12 | Task-driven single-image super-resolution reconstruction of document scans | Maciej Zyrek et.al. | 2407.08993 | translate | read | null |
| 2024-07-11 | OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects | Akshay Krishnan et.al. | 2407.08711 | translate | read | null |
| 2024-07-11 | Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene | Ruiyang Zhang et.al. | 2407.08569 | translate | read | link |
| 2024-07-11 | Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation | Zeyang Zhao et.al. | 2407.08489 | translate | read | link |
| 2024-07-11 | Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer | Tahira Shehzadi et.al. | 2407.08460 | translate | read | null |
| 2024-07-11 | PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection with Event Data | Dominika Przewlocka-Rus et.al. | 2407.08272 | translate | read | null |
| 2024-07-11 | Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appear | Seonwhee Jin et.al. | 2407.08257 | translate | read | link |
| 2024-07-11 | Enrich the content of the image Using Context-Aware Copy Paste | Qiushi Guo et.al. | 2407.08151 | translate | read | null |
| 2024-07-11 | DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing | Minghang Zhou et.al. | 2407.08132 | translate | read | null |
| 2024-07-10 | MambaVision: A Hybrid Mamba-Transformer Vision Backbone | Ali Hatamizadeh et.al. | 2407.08083 | translate | read | link |
| 2024-07-10 | Bayesian Detector Combination for Object Detection with Crowdsourced Annotations | Zhi Qin Tan et.al. | 2407.07958 | translate | read | link |
| 2024-07-10 | Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher | Jiangming Chen et.al. | 2407.07780 | translate | read | null |
| 2024-07-10 | LSM: A Comprehensive Metric for Assessing the Safety of Lane Detection Systems in Autonomous Driving | Jörg Gamerdinger et.al. | 2407.07740 | translate | read | null |
| 2024-07-10 | Few-Shot Domain Adaptive Object Detection for Microscopic Images | Sumayya Inayat et.al. | 2407.07633 | translate | read | null |
| 2024-07-10 | Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights | Yan Hao et.al. | 2407.07586 | translate | read | link |
| 2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | translate | read | null |
| 2024-07-09 | ConvNLP: Image-based AI Text Detection | Suriya Prakash Jambunathan et.al. | 2407.07225 | translate | read | null |
| 2024-07-09 | Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images | Chuanrui Zhang et.al. | 2407.06984 | translate | read | null |
| 2024-07-09 | Cue Point Estimation using Object Detection | Giulia Argüello et.al. | 2407.06823 | translate | read | link |
| 2024-07-09 | CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Shuang Hao et.al. | 2407.06780 | translate | read | link |
| 2024-07-09 | Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions | Yu-Guan Hsieh et.al. | 2407.06723 | translate | read | null |
| 2024-07-08 | Stochastic Traveling Salesperson Problem with Neighborhoods for Object Detection | Cheng Peng et.al. | 2407.06366 | translate | read | null |
| 2024-07-08 | GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images | Jon Crall et.al. | 2407.06337 | translate | read | null |
| 2024-07-08 | Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection | Chenxu Wang et.al. | 2407.05909 | translate | read | link |
| 2024-07-08 | Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework | Hao Jing et.al. | 2407.05769 | translate | read | null |
| 2024-07-08 | Short-term Object Interaction Anticipation with Disentangled Object Detection @ Ego4D Short Term Object Interaction Anticipation Challenge | Hyunjin Cho et.al. | 2407.05713 | translate | read | link |
| 2024-07-08 | Weakly Supervised Test-Time Domain Adaptation for Object Detection | Anh-Dzung Doan et.al. | 2407.05607 | translate | read | null |
| 2024-07-08 | Towards Reflected Object Detection: A Benchmark | Zhongtian Wang et.al. | 2407.05575 | translate | read | null |
| 2024-07-08 | GMC: A General Framework of Multi-stage Context Learning and Utilization for Visual Detection Tasks | Xuan Wang et.al. | 2407.05566 | translate | read | null |
| 2024-07-07 | CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs | Akshat Ramachandran et.al. | 2407.05266 | translate | read | link |
| 2024-07-07 | Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image | Pengkun Jiao et.al. | 2407.05256 | translate | read | null |
| 2024-07-06 | SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention | Yunzhong Si et.al. | 2407.05128 | translate | read | null |
| 2024-07-06 | Quantizing YOLOv7: A Comprehensive Study | Mohammadamin Baghbanbashi et.al. | 2407.04943 | translate | read | null |
| 2024-07-05 | SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry | Hafiz Mughees Ahmad et.al. | 2407.04590 | translate | read | link |
| 2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | translate | read | null |
| 2024-07-05 | Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection | Zhiqiang Yang et.al. | 2407.04381 | translate | read | link |
| 2024-07-05 | Towards Stable 3D Object Detection | Jiabao Wang et.al. | 2407.04305 | translate | read | null |
| 2024-07-05 | Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey | Han Wang et.al. | 2407.04277 | translate | read | null |
| 2024-07-04 | LiDAR-based Real-Time Object Detection and Tracking in Dynamic Environments | Wenqiang Du et.al. | 2407.04115 | translate | read | null |
| 2024-07-04 | FIPGNet:Pyramid grafting network with feature interaction strategies | Ziyi Ding et.al. | 2407.04085 | translate | read | null |
| 2024-07-04 | Detect Closer Surfaces that can be Seen: New Modeling and Evaluation in Cross-domain 3D Object Detection | Ruixiao Zhang et.al. | 2407.04061 | translate | read | null |
| 2024-07-04 | The Solution for the GAIIC2024 RGB-TIR object detection Challenge | Xiangyu Wu et.al. | 2407.03872 | translate | read | null |
| 2024-07-04 | StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection | Yunshuang Yuan et.al. | 2407.03825 | translate | read | null |
| 2024-07-03 | Visual Grounding with Attention-Driven Constraint Balancing | Weitai Kang et.al. | 2407.03243 | translate | read | null |
| 2024-07-03 | Category-Aware Dynamic Label Assignment with High-Quality Oriented Proposal | Mingkui Feng et.al. | 2407.03205 | translate | read | null |
| 2024-07-03 | SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding | Weitai Kang et.al. | 2407.03200 | translate | read | link |
| 2024-07-03 | Global Context Modeling in YOLOv8 for Pediatric Wrist Fracture Detection | Rui-Yang Ju et.al. | 2407.03163 | translate | read | link |
| 2024-07-03 | YOLOv5, YOLOv8 and YOLOv10: The Go-To Detectors for Real-time Vision | Muhammad Hussain et.al. | 2407.02988 | translate | read | null |
| 2024-07-03 | Mast Kalandar at SemEval-2024 Task 8: On the Trail of Textual Origins: RoBERTa-BiLSTM Approach to Detect AI-Generated Text | Jainit Sushil Bafna et.al. | 2407.02978 | translate | read | null |
| 2024-07-03 | A Pairwise DomMix Attentive Adversarial Network for Unsupervised Domain Adaptive Object Detection | Jie Shao et.al. | 2407.02835 | translate | read | null |
| 2024-07-03 | ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers | Yanfeng Jiang et.al. | 2407.02763 | translate | read | null |
| 2024-07-02 | SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection | Anay Majee et.al. | 2407.02665 | translate | read | null |
| 2024-07-02 | Robust ADAS: Enhancing Robustness of Machine Learning-based Advanced Driver Assistance Systems for Adverse Weather | Muhammad Zaeem Shahzad et.al. | 2407.02581 | translate | read | null |
| 2024-07-02 | Similarity Distance-Based Label Assignment for Tiny Object Detection | Shuohao Shi et.al. | 2407.02394 | translate | read | link |
| 2024-07-02 | OpenSlot: Mixed Open-set Recognition with Object-centric Learning | Xu Yin et.al. | 2407.02386 | translate | read | null |
| 2024-07-02 | DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection | Kaixin Xu et.al. | 2407.02098 | translate | read | null |
| 2024-07-02 | Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning | Chengchao Shen et.al. | 2407.02014 | translate | read | link |
| 2024-07-02 | Adaptive Modality Balanced Online Knowledge Distillation for Brain-Eye-Computer based Dim Object Detection | Zixing Li et.al. | 2407.01894 | translate | read | link |
| 2024-07-01 | Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision | Balaji VS et.al. | 2407.01435 | translate | read | null |
| 2024-07-01 | Formal Verification of Object Detection | Avraham Raviv et.al. | 2407.01295 | translate | read | null |
| 2024-07-01 | Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection | Francesco Barbato et.al. | 2407.01193 | translate | read | null |
| 2024-07-01 | Eliminating Position Bias of Language Models: A Mechanistic Approach | Ziqi Wang et.al. | 2407.01100 | translate | read | link |
| 2024-07-01 | No More Potentially Dynamic Objects: Static Point Cloud Map Generation based on 3D Object Detection and Ground Projection | Soojin Woo et.al. | 2407.01073 | translate | read | null |
| 2024-07-01 | Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding | Yifan Tang et.al. | 2406.19791 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)