Object Detection - 2024-04 | Paper Arxiv Daily

Object Detection - 2024-04

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-04-30	Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation	Yunhao Ge et.al.	2404.19752	translate	read	null
2024-04-30	Quantifying Nematodes through Images: Datasets, Models, and Baselines of Deep Learning	Zhipeng Yuan et.al.	2404.19748	translate	read	null
2024-04-30	Masked Multi-Query Slot Attention for Unsupervised Object Discovery	Rishav Pramanik et.al.	2404.19654	translate	read	link
2024-04-30	Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World	Wen Yin et.al.	2404.19417	translate	read	null
2024-04-30	UniFS: Universal Few-shot Instance Perception with Point Representations	Sheng Jin et.al.	2404.19401	translate	read	null
2024-04-30	Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection	Zhanwei Zhang et.al.	2404.19384	translate	read	null
2024-04-30	Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank	Sungjune Park et.al.	2404.19299	translate	read	null
2024-04-29	MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection	Heitor R. Medeiros et.al.	2404.18849	translate	read	null
2024-04-29	Leveraging PointNet and PointNet++ for Lyft Point Cloud Classification Challenge	Rajat K. Doshi et.al.	2404.18665	translate	read	null
2024-04-29	CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception	Yunshuang Yuan et.al.	2404.18617	translate	read	null
2024-04-29	Assessing Quality Metrics for Neural Reality Gap Input Mitigation in Autonomous Driving Testing	Stefano Carlo Lambertenghi et.al.	2404.18577	translate	read	null
2024-04-29	Efficient Meta-Learning Enabled Lightweight Multiscale Few-Shot Object Detection in Remote Sensing Images	Wenbin Guan et.al.	2404.18426	translate	read	null
2024-04-29	Multi-modal Perception Dataset of In-water Objects for Autonomous Surface Vehicles	Mingi Jeong et.al.	2404.18411	translate	read	null
2024-04-28	FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method	Yanbing Bai et.al.	2404.18245	translate	read	null
2024-04-28	RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation	Oded Bialer et.al.	2404.18150	translate	read	null
2024-04-27	Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection	Farzad Nozarian et.al.	2404.17910	translate	read	link
2024-04-27	A Hybrid Approach for Document Layout Analysis in Document images	Tahira Shehzadi et.al.	2404.17888	translate	read	null
2024-04-26	Inhomogeneous illuminated image enhancement under extremely low visibility condition	Libang Chen et.al.	2404.17503	translate	read	null
2024-04-26	Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection	Moussa Kassem Sbeyti et.al.	2404.17427	translate	read	null
2024-04-26	Enhancing mmWave Radar Point Cloud via Visual-inertial Supervision	Cong Fan et.al.	2404.17229	translate	read	null
2024-04-26	MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection	Chengpei Xu et.al.	2404.17151	translate	read	null
2024-04-25	Generating Minimalist Adversarial Perturbations to Test Object-Detection Models: An Adaptive Multi-Metric Evolutionary Search Approach	Cristopher McIntyre-Garcia et.al.	2404.17020	translate	read	link
2024-04-25	Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection	Mehmet Kerem Turkcan et.al.	2404.16944	translate	read	link
2024-04-25	Self-Balanced R-CNN for Instance Segmentation	Leonardo Rossi et.al.	2404.16633	translate	read	link
2024-04-25	Cross-Domain Spatial Matching for Camera and Radar Sensor Data Fusion in Autonomous Vehicle Perception System	Daniel Dworak et.al.	2404.16548	translate	read	null
2024-04-25	Commonsense Prototype for Outdoor Unsupervised 3D Object Detection	Hai Wu et.al.	2404.16493	translate	read	link
2024-04-25	IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks	Zitong Huang et.al.	2404.16331	translate	read	null
2024-04-25	CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions	Haoyuan Li et.al.	2404.16302	translate	read	link
2024-04-24	AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models	Zhiqiang Tang et.al.	2404.16233	translate	read	null
2024-04-24	Observational parameters of Blue Large-Amplitude Pulsators	P. Pietrukowicz et.al.	2404.16089	translate	read	null
2024-04-24	A Survey on Visual Mamba	Hanwei Zhang et.al.	2404.15956	translate	read	null
2024-04-24	Steal Now and Attack Later: Evaluating Robustness of Object Detection against Black-box Adversarial Attacks	Erh-Chung Chen et.al.	2404.15881	translate	read	null
2024-04-24	Revisiting Out-of-Distribution Detection in LiDAR-based 3D Object Detection	Michael Kösel et.al.	2404.15879	translate	read	link
2024-04-23	CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection	Hongyi Cai et.al.	2404.15451	translate	read	null
2024-04-23	ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning	Weifeng Chen et.al.	2404.15449	translate	read	null
2024-04-23	Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions	Xingguang Zhang et.al.	2404.15252	translate	read	null
2024-04-23	Efficient Transformer Encoders for Mask2Former-style models	Manyi Yao et.al.	2404.15244	translate	read	null
2024-04-23	Gallbladder Cancer Detection in Ultrasound Images based on YOLO and Faster R-CNN	Sara Dadjouy et.al.	2404.15129	translate	read	null
2024-04-23	External Prompt Features Enhanced Parameter-efficient Fine-tuning for Salient Object Detection	Wen Liang et.al.	2404.15008	translate	read	null
2024-04-23	ContextualFusion: Context-Based Multi-Sensor Fusion for 3D Object Detection in Adverse Operating Conditions	Shounak Sural et.al.	2404.14780	translate	read	null
2024-04-23	Unified Unsupervised Salient Object Detection via Knowledge Transfer	Yao Yuan et.al.	2404.14759	translate	read	link
2024-04-22	SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection	Yuxia Wang et.al.	2404.14183	translate	read	null
2024-04-22	Text in the Dark: Extremely Low-Light Text Image Enhancement	Che-Tsung Lin et.al.	2404.14135	translate	read	null
2024-04-22	CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective	Wencheng Zhu et.al.	2404.14109	translate	read	null
2024-04-22	Benchmarking Multi-Modal LLMs for Testing Visual Deep Learning Systems Through the Lens of Image Mutation	Liwen Wang et.al.	2404.13945	translate	read	null
2024-04-22	NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation	Chi Huang et.al.	2404.13921	translate	read	null
2024-04-22	TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos	Atom Scott et.al.	2404.13868	translate	read	null
2024-04-22	Toward Robust LiDAR based 3D Object Detection via Density-Aware Adaptive Thresholding	Eunho Lee et.al.	2404.13852	translate	read	null
2024-04-21	A Nasal Cytology Dataset for Object Detection and Deep Learning	Mauro Camporeale et.al.	2404.13745	translate	read	null
2024-04-23	Clio: Real-time Task-Driven Open-Set 3D Scene Graphs	Dominic Maggio et.al.	2404.13696	translate	read	null
2024-04-20	FisheyeDetNet: Object Detection on Fisheye Surround View Camera Systems for Automated Driving	Ganesh Sistu et.al.	2404.13443	translate	read	null
2024-04-19	A comparison between single-stage and two-stage 3D tracking algorithms for greenhouse robotics	David Rapado-Rincon et.al.	2404.12963	translate	read	null
2024-04-19	Language-Driven Active Learning for Diverse Open-Set 3D Object Detection	Ross Greer et.al.	2404.12856	translate	read	null
2024-04-19	ECOR: Explainable CLIP for Object Recognition	Ali Rasekh et.al.	2404.12839	translate	read	null
2024-04-19	A Point-Based Approach to Efficient LiDAR Multi-Task Perception	Christopher Lang et.al.	2404.12798	translate	read	null
2024-04-19	ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation	Yu-Hsuan Ho et.al.	2404.12606	translate	read	null
2024-04-18	The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models	Cheng Shi et.al.	2404.11957	translate	read	link
2024-04-18	Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition	Xunsong Li et.al.	2404.11903	translate	read	null
2024-04-17	TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation	Thomas Monninger et.al.	2404.11803	translate	read	null
2024-04-17	Multimodal 3D Object Detection on Unseen Domains	Deepti Hegde et.al.	2404.11764	translate	read	null
2024-04-17	Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection	Deepti Hegde et.al.	2404.11737	translate	read	null
2024-04-17	Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems	Luca Bompani et.al.	2404.11488	translate	read	link
2024-04-17	EcoMLS: A Self-Adaptation Approach for Architecting Green ML-Enabled Systems	Meghana Tedla et.al.	2404.11411	translate	read	null
2024-04-17	Detector Collapse: Backdooring Object Detection to Catastrophic Overload or Blindness	Hangtao Zhang et.al.	2404.11357	translate	read	null
2024-04-17	Simple In-place Data Augmentation for Surveillance Object Detection	Munkh-Erdene Otgonbold et.al.	2404.11226	translate	read	null
2024-04-17	Feature Corrective Transfer Learning: End-to-End Solutions to Object Detection in Non-Ideal Visual Conditions	Chuheng Wei et.al.	2404.11214	translate	read	null
2024-04-17	GhostNetV3: Exploring the Training Strategies for Compact Models	Zhenhua Liu et.al.	2404.11202	translate	read	link
2024-04-17	How to deal with glare for improved perception of Autonomous Vehicles	Muhammad Z. Alam et.al.	2404.10992	translate	read	null
2024-04-17	Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection	Nawfal Guefrachi et.al.	2404.10978	translate	read	null
2024-04-16	OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery	Matthew Inkawhich et.al.	2404.10865	translate	read	null
2024-04-16	Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark	Jiangning Zhang et.al.	2404.10760	translate	read	null
2024-04-16	Watch Your Step: Optimal Retrieval for Continual Learning at Scale	Truman Hickok et.al.	2404.10758	translate	read	null
2024-04-16	Efficient optimal dispersed Haar-like filters for face detection	Zeinab Sedaghatjoo et.al.	2404.10476	translate	read	null
2024-04-16	Camera clustering for scalable stream-based active distillation	Dani Manjah et.al.	2404.10411	translate	read	null
2024-04-15	Low-Light Image Enhancement Framework for Improved Object Detection in Fisheye Lens Datasets	Dai Quoc Tran et.al.	2404.10078	translate	read	link
2024-04-15	Explainable Light-Weight Deep Learning Pipeline for Improved Drought Stres	Aswini Kumar Patra et.al.	2404.10073	translate	read	null
2024-04-15	VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection	Bonan Ding et.al.	2404.09431	translate	read	null
2024-04-14	TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model	Wiktor Mucha et.al.	2404.09254	translate	read	null
2024-04-14	DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection	Lewei Yao et.al.	2404.09216	translate	read	null
2024-04-14	Coreset Selection for Object Detection	Hojun Lee et.al.	2404.09161	translate	read	null
2024-04-14	Fusion-Mamba for Cross-modality Object Detection	Wenhao Dong et.al.	2404.09146	translate	read	null
2024-04-13	The Snake’s Beating Heart? A Millisecond Pulsar Binary in the Galactic Center Radio Filament G359.1 $-$ 0.2	Marcus E. Lower et.al.	2404.09098	translate	read	null
2024-04-13	BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection	Jian Zhang et.al.	2404.08979	translate	read	null
2024-04-13	Shifting Spotlight for Co-supervision: A Simple yet Efficient Single-branch Network to See Through Camouflage	Yang Hu et.al.	2404.08936	translate	read	null
2024-04-12	Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation	Yanhao Zheng et.al.	2404.08603	translate	read	link
2024-04-12	FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation	Riza Velioglu et.al.	2404.08582	translate	read	link
2024-04-12	Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning	Girmaw Abebe Tadesse et.al.	2404.08544	translate	read	null
2024-04-12	MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion	Zhe Li et.al.	2404.08406	translate	read	null
2024-04-12	Overcoming Scene Context Constraints for Object Detection in wild using Defilters	Vamshi Krishna Kancharla et.al.	2404.08293	translate	read	null
2024-04-11	ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model	Lifan Jiang et.al.	2404.07773	translate	read	link
2024-04-11	Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification	Ricardo Pereira et.al.	2404.07739	translate	read	null
2024-04-11	Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns	Hakan Yekta Yatbaz et.al.	2404.07685	translate	read	null
2024-04-11	Finding Dino: A plug-and-play framework for unsupervised detection of out-of-distribution objects using prototypes	Poulami Sinhamahapatra et.al.	2404.07664	translate	read	null
2024-04-11	Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method	Tashmoy Ghosh et.al.	2404.07649	translate	read	null
2024-04-11	GLID: Pre-training a Generalist Encoder-Decoder Vision Model	Jihao Liu et.al.	2404.07603	translate	read	null
2024-04-11	SFSORT: Scene Features-based Simple Online Real-Time Tracker	M. M. Morsali et.al.	2404.07553	translate	read	link
2024-04-11	The Sydney Radio Star Catalogue: properties of radio stars at megahertz to gigahertz frequencies	Laura N. Driessen et.al.	2404.07418	translate	read	null
2024-04-11	Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing	Jaemin Kang et.al.	2404.07405	translate	read	null
2024-04-11	A fine-tuning workflow for automatic first-break picking with deep learning	Amir Mardan et.al.	2404.07400	translate	read	link
2024-04-10	Identification of Fine-grained Systematic Errors via Controlled Scene Generation	Valentyn Boreiko et.al.	2404.07045	translate	read	null
2024-04-10	Accurate Tennis Court Line Detection on Amateur Recorded Matches	Sameer Agrawal et.al.	2404.06977	translate	read	null
2024-04-10	SARA: Smart AI Reading Assistant for Reading Comprehension	Enkeleda Thaqi et.al.	2404.06906	translate	read	null
2024-04-10	Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data	Aakash Kumar et.al.	2404.06715	translate	read	null
2024-04-10	Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting	Hao Lu et.al.	2404.06700	translate	read	link
2024-04-09	Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping	Anas Gouda et.al.	2404.06277	translate	read	link
2024-04-09	Label-Efficient 3D Object Detection For Road-Side Units	Minh-Quan Dao et.al.	2404.06256	translate	read	null
2024-04-09	Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector	Bach Ha et.al.	2404.06219	translate	read	null
2024-04-09	YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images	Chenguang Liu et.al.	2404.06180	translate	read	null
2024-04-09	Enhanced Radar Perception via Multi-Task Learning: Towards Refined Data for Sensor Fusion Applications	Huawei Sun et.al.	2404.06165	translate	read	null
2024-04-09	Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation	Zong-Wei Hong et.al.	2404.06029	translate	read	null
2024-04-08	Retrieval-Augmented Open-Vocabulary Object Detection	Jooyeon Kim et.al.	2404.05687	translate	read	link
2024-04-08	3D-COCO: extension of MS-COCO dataset for image detection and 3D reconstruction modules	Maxence Bideaux et.al.	2404.05641	translate	read	null
2024-04-08	PetKaz at SemEval-2024 Task 8: Can Linguistics Capture the Specifics of LLM-generated Text?	Kseniia Petukhova et.al.	2404.05483	translate	read	null
2024-04-08	Detecting Every Object from Events	Haitian Zhang et.al.	2404.05285	translate	read	link
2024-04-08	MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues	Xiahan Chen et.al.	2404.05280	translate	read	null
2024-04-08	Rendering-Enhanced Automatic Image-to-Point Cloud Registration for Roadside Scenes	Yu Sheng et.al.	2404.05164	translate	read	null
2024-04-08	Better Monocular 3D Detectors with LiDAR from the Past	Yurong You et.al.	2404.05139	translate	read	link
2024-04-07	AirShot: Efficient Few-Shot Detection for Autonomous Exploration	Zihan Wang et.al.	2404.05069	translate	read	link
2024-04-07	PlateSegFL: A Privacy-Preserving License Plate Detection Using Federated Segmentation Learning	Md. Shahriar Rahman Anuvab et.al.	2404.05049	translate	read	null
2024-04-07	PathFinder: Attention-Driven Dynamic Non-Line-of-Sight Tracking with a Mobile Robot	Shenbagaraj Kannapiran et.al.	2404.05024	translate	read	null
2024-04-05	SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers	Weile Li et.al.	2404.04179	translate	read	link
2024-04-05	Designing Robots to Help Women	Martin Cooney et.al.	2404.04123	translate	read	null
2024-04-04	Is CLIP the main roadblock for fine-grained open-world perception?	Lorenzo Bianchi et.al.	2404.03539	translate	read	link
2024-04-04	DQ-DETR: DETR with Dynamic Query for Tiny Object Detection	Yi-Xin Huang et.al.	2404.03507	translate	read	link
2024-04-05	A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data	Iqra Bano et.al.	2404.03493	translate	read	null
2024-04-04	MonoCD: Monocular 3D Object Detection with Complementary Depths	Longfei Yan et.al.	2404.03181	translate	read	link
2024-04-03	DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection	Felix Fent et.al.	2404.03015	translate	read	null
2024-04-03	ALOHa: A New Measure for Hallucination in Captioning Models	Suzanne Petryk et.al.	2404.02904	translate	read	null
2024-04-03	FlightScope: A Deep Comprehensive Assessment of Aircraft Detection Algorithms in Satellite Imagery	Safouane El Ghazouali et.al.	2404.02877	translate	read	link
2024-04-03	HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras	Zhongyu Xia et.al.	2404.02517	translate	read	link
2024-04-04	TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression	Ho-Joong Kim et.al.	2404.02405	translate	read	null
2024-04-04	EGTR: Extracting Graph from Transformer for Scene Graph Generation	Jinbae Im et.al.	2404.02072	translate	read	link
2024-04-03	Cooperative Students: Navigating Unsupervised Domain Adaptation in Nighttime Object Detection	Jicheng Yuan et.al.	2404.01988	translate	read	link
2024-04-02	Towards Enhanced Analysis of Lung Cancer Lesions in EBUS-TBNA – A Semi-Supervised Video Object Detection Method	Jyun-An Lin et.al.	2404.01929	translate	read	null
2024-04-02	Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack	Ying Zhou et.al.	2404.01907	translate	read	link
2024-04-02	Scene Adaptive Sparse Transformer for Event-based Object Detection	Yansong Peng et.al.	2404.01882	translate	read	link
2024-04-02	Semi-Supervised Domain Adaptation for Wildfire Detection	JooYoung Jang et.al.	2404.01842	translate	read	null
2024-04-02	Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection	Tahira Shehzadi et.al.	2404.01819	translate	read	null
2024-04-02	Analyzing the Single Event Upset Vulnerability of Binarized Neural Networks on SRAM FPGAs	Ioanna Souvatzoglou et.al.	2404.01757	translate	read	null
2024-04-02	Disentangled Pre-training for Human-Object Interaction Detection	Zhuolong Li et.al.	2404.01725	translate	read	null
2024-04-02	Task Integration Distillation for Object Detectors	Hai Su et.al.	2404.01699	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)