Object Detection - 2024-04

Publish Date Title Authors PDF Translate Read Code
2024-04-30 Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation Yunhao Ge et.al. 2404.19752 translate read null
2024-04-30 Quantifying Nematodes through Images: Datasets, Models, and Baselines of Deep Learning Zhipeng Yuan et.al. 2404.19748 translate read null
2024-04-30 Masked Multi-Query Slot Attention for Unsupervised Object Discovery Rishav Pramanik et.al. 2404.19654 translate read link
2024-04-30 Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World Wen Yin et.al. 2404.19417 translate read null
2024-04-30 UniFS: Universal Few-shot Instance Perception with Point Representations Sheng Jin et.al. 2404.19401 translate read null
2024-04-30 Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection Zhanwei Zhang et.al. 2404.19384 translate read null
2024-04-30 Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank Sungjune Park et.al. 2404.19299 translate read null
2024-04-29 MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection Heitor R. Medeiros et.al. 2404.18849 translate read null
2024-04-29 Leveraging PointNet and PointNet++ for Lyft Point Cloud Classification Challenge Rajat K. Doshi et.al. 2404.18665 translate read null
2024-04-29 CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception Yunshuang Yuan et.al. 2404.18617 translate read null
2024-04-29 Assessing Quality Metrics for Neural Reality Gap Input Mitigation in Autonomous Driving Testing Stefano Carlo Lambertenghi et.al. 2404.18577 translate read null
2024-04-29 Efficient Meta-Learning Enabled Lightweight Multiscale Few-Shot Object Detection in Remote Sensing Images Wenbin Guan et.al. 2404.18426 translate read null
2024-04-29 Multi-modal Perception Dataset of In-water Objects for Autonomous Surface Vehicles Mingi Jeong et.al. 2404.18411 translate read null
2024-04-28 FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method Yanbing Bai et.al. 2404.18245 translate read null
2024-04-28 RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation Oded Bialer et.al. 2404.18150 translate read null
2024-04-27 Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection Farzad Nozarian et.al. 2404.17910 translate read link
2024-04-27 A Hybrid Approach for Document Layout Analysis in Document images Tahira Shehzadi et.al. 2404.17888 translate read null
2024-04-26 Inhomogeneous illuminated image enhancement under extremely low visibility condition Libang Chen et.al. 2404.17503 translate read null
2024-04-26 Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection Moussa Kassem Sbeyti et.al. 2404.17427 translate read null
2024-04-26 Enhancing mmWave Radar Point Cloud via Visual-inertial Supervision Cong Fan et.al. 2404.17229 translate read null
2024-04-26 MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection Chengpei Xu et.al. 2404.17151 translate read null
2024-04-25 Generating Minimalist Adversarial Perturbations to Test Object-Detection Models: An Adaptive Multi-Metric Evolutionary Search Approach Cristopher McIntyre-Garcia et.al. 2404.17020 translate read link
2024-04-25 Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection Mehmet Kerem Turkcan et.al. 2404.16944 translate read link
2024-04-25 Self-Balanced R-CNN for Instance Segmentation Leonardo Rossi et.al. 2404.16633 translate read link
2024-04-25 Cross-Domain Spatial Matching for Camera and Radar Sensor Data Fusion in Autonomous Vehicle Perception System Daniel Dworak et.al. 2404.16548 translate read null
2024-04-25 Commonsense Prototype for Outdoor Unsupervised 3D Object Detection Hai Wu et.al. 2404.16493 translate read link
2024-04-25 IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks Zitong Huang et.al. 2404.16331 translate read null
2024-04-25 CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions Haoyuan Li et.al. 2404.16302 translate read link
2024-04-24 AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models Zhiqiang Tang et.al. 2404.16233 translate read null
2024-04-24 Observational parameters of Blue Large-Amplitude Pulsators P. Pietrukowicz et.al. 2404.16089 translate read null
2024-04-24 A Survey on Visual Mamba Hanwei Zhang et.al. 2404.15956 translate read null
2024-04-24 Steal Now and Attack Later: Evaluating Robustness of Object Detection against Black-box Adversarial Attacks Erh-Chung Chen et.al. 2404.15881 translate read null
2024-04-24 Revisiting Out-of-Distribution Detection in LiDAR-based 3D Object Detection Michael Kösel et.al. 2404.15879 translate read link
2024-04-23 CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection Hongyi Cai et.al. 2404.15451 translate read null
2024-04-23 ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning Weifeng Chen et.al. 2404.15449 translate read null
2024-04-23 Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions Xingguang Zhang et.al. 2404.15252 translate read null
2024-04-23 Efficient Transformer Encoders for Mask2Former-style models Manyi Yao et.al. 2404.15244 translate read null
2024-04-23 Gallbladder Cancer Detection in Ultrasound Images based on YOLO and Faster R-CNN Sara Dadjouy et.al. 2404.15129 translate read null
2024-04-23 External Prompt Features Enhanced Parameter-efficient Fine-tuning for Salient Object Detection Wen Liang et.al. 2404.15008 translate read null
2024-04-23 ContextualFusion: Context-Based Multi-Sensor Fusion for 3D Object Detection in Adverse Operating Conditions Shounak Sural et.al. 2404.14780 translate read null
2024-04-23 Unified Unsupervised Salient Object Detection via Knowledge Transfer Yao Yuan et.al. 2404.14759 translate read link
2024-04-22 SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection Yuxia Wang et.al. 2404.14183 translate read null
2024-04-22 Text in the Dark: Extremely Low-Light Text Image Enhancement Che-Tsung Lin et.al. 2404.14135 translate read null
2024-04-22 CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective Wencheng Zhu et.al. 2404.14109 translate read null
2024-04-22 Benchmarking Multi-Modal LLMs for Testing Visual Deep Learning Systems Through the Lens of Image Mutation Liwen Wang et.al. 2404.13945 translate read null
2024-04-22 NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation Chi Huang et.al. 2404.13921 translate read null
2024-04-22 TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos Atom Scott et.al. 2404.13868 translate read null
2024-04-22 Toward Robust LiDAR based 3D Object Detection via Density-Aware Adaptive Thresholding Eunho Lee et.al. 2404.13852 translate read null
2024-04-21 A Nasal Cytology Dataset for Object Detection and Deep Learning Mauro Camporeale et.al. 2404.13745 translate read null
2024-04-23 Clio: Real-time Task-Driven Open-Set 3D Scene Graphs Dominic Maggio et.al. 2404.13696 translate read null
2024-04-20 FisheyeDetNet: Object Detection on Fisheye Surround View Camera Systems for Automated Driving Ganesh Sistu et.al. 2404.13443 translate read null
2024-04-19 A comparison between single-stage and two-stage 3D tracking algorithms for greenhouse robotics David Rapado-Rincon et.al. 2404.12963 translate read null
2024-04-19 Language-Driven Active Learning for Diverse Open-Set 3D Object Detection Ross Greer et.al. 2404.12856 translate read null
2024-04-19 ECOR: Explainable CLIP for Object Recognition Ali Rasekh et.al. 2404.12839 translate read null
2024-04-19 A Point-Based Approach to Efficient LiDAR Multi-Task Perception Christopher Lang et.al. 2404.12798 translate read null
2024-04-19 ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation Yu-Hsuan Ho et.al. 2404.12606 translate read null
2024-04-18 The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models Cheng Shi et.al. 2404.11957 translate read link
2024-04-18 Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition Xunsong Li et.al. 2404.11903 translate read null
2024-04-17 TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation Thomas Monninger et.al. 2404.11803 translate read null
2024-04-17 Multimodal 3D Object Detection on Unseen Domains Deepti Hegde et.al. 2404.11764 translate read null
2024-04-17 Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection Deepti Hegde et.al. 2404.11737 translate read null
2024-04-17 Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems Luca Bompani et.al. 2404.11488 translate read link
2024-04-17 EcoMLS: A Self-Adaptation Approach for Architecting Green ML-Enabled Systems Meghana Tedla et.al. 2404.11411 translate read null
2024-04-17 Detector Collapse: Backdooring Object Detection to Catastrophic Overload or Blindness Hangtao Zhang et.al. 2404.11357 translate read null
2024-04-17 Simple In-place Data Augmentation for Surveillance Object Detection Munkh-Erdene Otgonbold et.al. 2404.11226 translate read null
2024-04-17 Feature Corrective Transfer Learning: End-to-End Solutions to Object Detection in Non-Ideal Visual Conditions Chuheng Wei et.al. 2404.11214 translate read null
2024-04-17 GhostNetV3: Exploring the Training Strategies for Compact Models Zhenhua Liu et.al. 2404.11202 translate read link
2024-04-17 How to deal with glare for improved perception of Autonomous Vehicles Muhammad Z. Alam et.al. 2404.10992 translate read null
2024-04-17 Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection Nawfal Guefrachi et.al. 2404.10978 translate read null
2024-04-16 OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery Matthew Inkawhich et.al. 2404.10865 translate read null
2024-04-16 Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark Jiangning Zhang et.al. 2404.10760 translate read null
2024-04-16 Watch Your Step: Optimal Retrieval for Continual Learning at Scale Truman Hickok et.al. 2404.10758 translate read null
2024-04-16 Efficient optimal dispersed Haar-like filters for face detection Zeinab Sedaghatjoo et.al. 2404.10476 translate read null
2024-04-16 Camera clustering for scalable stream-based active distillation Dani Manjah et.al. 2404.10411 translate read null
2024-04-15 Low-Light Image Enhancement Framework for Improved Object Detection in Fisheye Lens Datasets Dai Quoc Tran et.al. 2404.10078 translate read link
2024-04-15 Explainable Light-Weight Deep Learning Pipeline for Improved Drought Stres Aswini Kumar Patra et.al. 2404.10073 translate read null
2024-04-15 VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection Bonan Ding et.al. 2404.09431 translate read null
2024-04-14 TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model Wiktor Mucha et.al. 2404.09254 translate read null
2024-04-14 DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection Lewei Yao et.al. 2404.09216 translate read null
2024-04-14 Coreset Selection for Object Detection Hojun Lee et.al. 2404.09161 translate read null
2024-04-14 Fusion-Mamba for Cross-modality Object Detection Wenhao Dong et.al. 2404.09146 translate read null
2024-04-13 The Snake’s Beating Heart? A Millisecond Pulsar Binary in the Galactic Center Radio Filament G359.1 $-$ 0.2 Marcus E. Lower et.al. 2404.09098 translate read null
2024-04-13 BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection Jian Zhang et.al. 2404.08979 translate read null
2024-04-13 Shifting Spotlight for Co-supervision: A Simple yet Efficient Single-branch Network to See Through Camouflage Yang Hu et.al. 2404.08936 translate read null
2024-04-12 Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation Yanhao Zheng et.al. 2404.08603 translate read link
2024-04-12 FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation Riza Velioglu et.al. 2404.08582 translate read link
2024-04-12 Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning Girmaw Abebe Tadesse et.al. 2404.08544 translate read null
2024-04-12 MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion Zhe Li et.al. 2404.08406 translate read null
2024-04-12 Overcoming Scene Context Constraints for Object Detection in wild using Defilters Vamshi Krishna Kancharla et.al. 2404.08293 translate read null
2024-04-11 ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model Lifan Jiang et.al. 2404.07773 translate read link
2024-04-11 Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification Ricardo Pereira et.al. 2404.07739 translate read null
2024-04-11 Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns Hakan Yekta Yatbaz et.al. 2404.07685 translate read null
2024-04-11 Finding Dino: A plug-and-play framework for unsupervised detection of out-of-distribution objects using prototypes Poulami Sinhamahapatra et.al. 2404.07664 translate read null
2024-04-11 Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method Tashmoy Ghosh et.al. 2404.07649 translate read null
2024-04-11 GLID: Pre-training a Generalist Encoder-Decoder Vision Model Jihao Liu et.al. 2404.07603 translate read null
2024-04-11 SFSORT: Scene Features-based Simple Online Real-Time Tracker M. M. Morsali et.al. 2404.07553 translate read link
2024-04-11 The Sydney Radio Star Catalogue: properties of radio stars at megahertz to gigahertz frequencies Laura N. Driessen et.al. 2404.07418 translate read null
2024-04-11 Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing Jaemin Kang et.al. 2404.07405 translate read null
2024-04-11 A fine-tuning workflow for automatic first-break picking with deep learning Amir Mardan et.al. 2404.07400 translate read link
2024-04-10 Identification of Fine-grained Systematic Errors via Controlled Scene Generation Valentyn Boreiko et.al. 2404.07045 translate read null
2024-04-10 Accurate Tennis Court Line Detection on Amateur Recorded Matches Sameer Agrawal et.al. 2404.06977 translate read null
2024-04-10 SARA: Smart AI Reading Assistant for Reading Comprehension Enkeleda Thaqi et.al. 2404.06906 translate read null
2024-04-10 Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data Aakash Kumar et.al. 2404.06715 translate read null
2024-04-10 Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting Hao Lu et.al. 2404.06700 translate read link
2024-04-09 Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping Anas Gouda et.al. 2404.06277 translate read link
2024-04-09 Label-Efficient 3D Object Detection For Road-Side Units Minh-Quan Dao et.al. 2404.06256 translate read null
2024-04-09 Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector Bach Ha et.al. 2404.06219 translate read null
2024-04-09 YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images Chenguang Liu et.al. 2404.06180 translate read null
2024-04-09 Enhanced Radar Perception via Multi-Task Learning: Towards Refined Data for Sensor Fusion Applications Huawei Sun et.al. 2404.06165 translate read null
2024-04-09 Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation Zong-Wei Hong et.al. 2404.06029 translate read null
2024-04-08 Retrieval-Augmented Open-Vocabulary Object Detection Jooyeon Kim et.al. 2404.05687 translate read link
2024-04-08 3D-COCO: extension of MS-COCO dataset for image detection and 3D reconstruction modules Maxence Bideaux et.al. 2404.05641 translate read null
2024-04-08 PetKaz at SemEval-2024 Task 8: Can Linguistics Capture the Specifics of LLM-generated Text? Kseniia Petukhova et.al. 2404.05483 translate read null
2024-04-08 Detecting Every Object from Events Haitian Zhang et.al. 2404.05285 translate read link
2024-04-08 MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues Xiahan Chen et.al. 2404.05280 translate read null
2024-04-08 Rendering-Enhanced Automatic Image-to-Point Cloud Registration for Roadside Scenes Yu Sheng et.al. 2404.05164 translate read null
2024-04-08 Better Monocular 3D Detectors with LiDAR from the Past Yurong You et.al. 2404.05139 translate read link
2024-04-07 AirShot: Efficient Few-Shot Detection for Autonomous Exploration Zihan Wang et.al. 2404.05069 translate read link
2024-04-07 PlateSegFL: A Privacy-Preserving License Plate Detection Using Federated Segmentation Learning Md. Shahriar Rahman Anuvab et.al. 2404.05049 translate read null
2024-04-07 PathFinder: Attention-Driven Dynamic Non-Line-of-Sight Tracking with a Mobile Robot Shenbagaraj Kannapiran et.al. 2404.05024 translate read null
2024-04-05 SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers Weile Li et.al. 2404.04179 translate read link
2024-04-05 Designing Robots to Help Women Martin Cooney et.al. 2404.04123 translate read null
2024-04-04 Is CLIP the main roadblock for fine-grained open-world perception? Lorenzo Bianchi et.al. 2404.03539 translate read link
2024-04-04 DQ-DETR: DETR with Dynamic Query for Tiny Object Detection Yi-Xin Huang et.al. 2404.03507 translate read link
2024-04-05 A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data Iqra Bano et.al. 2404.03493 translate read null
2024-04-04 MonoCD: Monocular 3D Object Detection with Complementary Depths Longfei Yan et.al. 2404.03181 translate read link
2024-04-03 DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection Felix Fent et.al. 2404.03015 translate read null
2024-04-03 ALOHa: A New Measure for Hallucination in Captioning Models Suzanne Petryk et.al. 2404.02904 translate read null
2024-04-03 FlightScope: A Deep Comprehensive Assessment of Aircraft Detection Algorithms in Satellite Imagery Safouane El Ghazouali et.al. 2404.02877 translate read link
2024-04-03 HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras Zhongyu Xia et.al. 2404.02517 translate read link
2024-04-04 TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression Ho-Joong Kim et.al. 2404.02405 translate read null
2024-04-04 EGTR: Extracting Graph from Transformer for Scene Graph Generation Jinbae Im et.al. 2404.02072 translate read link
2024-04-03 Cooperative Students: Navigating Unsupervised Domain Adaptation in Nighttime Object Detection Jicheng Yuan et.al. 2404.01988 translate read link
2024-04-02 Towards Enhanced Analysis of Lung Cancer Lesions in EBUS-TBNA – A Semi-Supervised Video Object Detection Method Jyun-An Lin et.al. 2404.01929 translate read null
2024-04-02 Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack Ying Zhou et.al. 2404.01907 translate read link
2024-04-02 Scene Adaptive Sparse Transformer for Event-based Object Detection Yansong Peng et.al. 2404.01882 translate read link
2024-04-02 Semi-Supervised Domain Adaptation for Wildfire Detection JooYoung Jang et.al. 2404.01842 translate read null
2024-04-02 Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection Tahira Shehzadi et.al. 2404.01819 translate read null
2024-04-02 Analyzing the Single Event Upset Vulnerability of Binarized Neural Networks on SRAM FPGAs Ioanna Souvatzoglou et.al. 2404.01757 translate read null
2024-04-02 Disentangled Pre-training for Human-Object Interaction Detection Zhuolong Li et.al. 2404.01725 translate read null
2024-04-02 Task Integration Distillation for Object Detectors Hai Su et.al. 2404.01699 translate read null

(<a href=../Object_Detection.md>back to Object Detection</a>)