Object Detection - 2024-05 | Paper Arxiv Daily

Object Detection - 2024-05

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-05-31	Power of Cooperative Supervision: Multiple Teachers Framework for Enhanced 3D Semi-Supervised Object Detection	Jin-Hee Lee et.al.	2405.20720	translate	read	link
2024-05-30	On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines	Selim Kuzucu et.al.	2405.20459	translate	read	link
2024-05-30	RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection	Fangyi Chen et.al.	2405.19854	translate	read	null
2024-05-30	Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology	Frank A. Ruis et.al.	2405.19822	translate	read	null
2024-05-30	Towards Unified Multi-granularity Text Detection with Interactive Attention	Xingyu Wan et.al.	2405.19765	translate	read	null
2024-05-30	Fully Test-Time Adaptation for Monocular 3D Object Detection	Hongbin Lin et.al.	2405.19682	translate	read	link
2024-05-30	YotoR-You Only Transform One Representation	José Ignacio Díaz Villa et.al.	2405.19629	translate	read	null
2024-05-29	Enabling Visual Recognition at Radio Frequency	Haowen Lai et.al.	2405.19516	translate	read	null
2024-05-29	Model Agnostic Defense against Adversarial Patch Attacks on Object Detection in Unmanned Aerial Vehicles	Saurabh Pathak et.al.	2405.19179	translate	read	null
2024-05-29	RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision	Jinzhong Wang et.al.	2405.18955	translate	read	null
2024-05-29	SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving	Yiming Cui et.al.	2405.18857	translate	read	null
2024-05-29	PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram	Sifan Zhou et.al.	2405.18734	translate	read	null
2024-05-28	A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic	Ioanna Gogou et.al.	2405.18387	translate	read	link
2024-05-28	Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?	Yifan Bai et.al.	2405.18361	translate	read	null
2024-05-28	Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention	Weitai Kang et.al.	2405.18295	translate	read	link
2024-05-28	DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture	Shentong Mo et.al.	2405.17995	translate	read	link
2024-05-28	Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection	Teodor-George Marchitan et.al.	2405.17964	translate	read	null
2024-05-28	Self-supervised Pre-training for Transferable Multi-modal Perception	Xiaohao Xu et.al.	2405.17942	translate	read	null
2024-05-28	Boosting General Trimap-free Matting in the Real-World Image	Leo Shan Wenzhang Zhou Grace Zhao et.al.	2405.17916	translate	read	null
2024-05-28	The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention	Xingyu Ding et.al.	2405.17776	translate	read	null
2024-05-27	Understanding differences in applying DETR to natural and medical images	Yanqi Xu et.al.	2405.17677	translate	read	null
2024-05-27	Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection	Shuai Zeng et.al.	2405.17422	translate	read	link
2024-05-27	Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association	Tingwei Liu et.al.	2405.17323	translate	read	null
2024-05-27	Enhanced Automotive Radar Collaborative Sensing By Exploiting Constructive Interference	Lifan Xu et.al.	2405.17297	translate	read	null
2024-05-27	SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving	Avinash Nittur Ramesh et.al.	2405.17030	translate	read	null
2024-05-27	Collective Perception Datasets for Autonomous Driving: A Comprehensive Review	Sven Teufel et.al.	2405.16973	translate	read	null
2024-05-27	OED: Towards One-stage End-to-End Dynamic Scene Graph Generation	Guan Wang et.al.	2405.16925	translate	read	link
2024-05-27	ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection	Ziying Song et.al.	2405.16873	translate	read	null
2024-05-27	A re-calibration method for object detection with multi-modal alignment bias in autonomous driving	Zhihang Song et.al.	2405.16848	translate	read	null
2024-05-26	A Study on Unsupervised Anomaly Detection and Defect Localization using Generative Model in Ultrasonic Non-Destructive Testing	Yusaku Ando et.al.	2405.16580	translate	read	null
2024-05-26	AI-Generated Text Detection and Classification Based on BERT Deep Learning Algorithm	Hao Wang et.al.	2405.16422	translate	read	null
2024-05-24	UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes	Ted Lentsch et.al.	2405.15688	translate	read	link
2024-05-24	Multimodal Object Detection via Probabilistic a priori Information Integration	Hafsa El Hafyani et.al.	2405.15596	translate	read	null
2024-05-24	Scale-Invariant Feature Disentanglement via Adversarial Learning for UAV-based Object Detection	Fan Liu et.al.	2405.15465	translate	read	null
2024-05-24	Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets	Hoàng-Ân Lê et.al.	2405.15394	translate	read	null
2024-05-24	Towards Global Optimal Visual In-Context Learning Prompt Selection	Chengming Xu et.al.	2405.15279	translate	read	null
2024-05-24	Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection	Yajing Liu et.al.	2405.15225	translate	read	null
2024-05-24	ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models	Jingyuan Zhu et.al.	2405.15199	translate	read	null
2024-05-24	MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method	Pan Liao et.al.	2405.15176	translate	read	null
2024-05-23	Learning to Detect and Segment Mobile Objects from Unlabeled Videos	Yihong Sun et.al.	2405.14841	translate	read	null
2024-05-23	Designing A Sustainable Marine Debris Clean-up Framework without Human Labels	Raymond Wang et.al.	2405.14815	translate	read	null
2024-05-23	Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond	Zhechao Wang et.al.	2405.14674	translate	read	null
2024-05-23	Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment	Muhammad Sohail Danish et.al.	2405.14497	translate	read	null
2024-05-23	YOLOv10: Real-Time End-to-End Object Detection	Ao Wang et.al.	2405.14458	translate	read	link
2024-05-23	Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations	Mohammed Baharoon et.al.	2405.14239	translate	read	null
2024-05-22	Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation	Mykhailo Uss et.al.	2405.14024	translate	read	null
2024-05-22	TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System	Diogo Lavado et.al.	2405.13989	translate	read	null
2024-05-22	Class-Conditional self-reward mechanism for improved Text-to-Image models	Safouane El Ghazouali et.al.	2405.13473	translate	read	link
2024-05-22	Adaptive Wireless Image Semantic Transmission and Over-The-Air Testing	Jiarun Ding et.al.	2405.13403	translate	read	null
2024-05-21	BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once	Theodore Zhao et.al.	2405.12971	translate	read	null
2024-05-21	AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection	Zizhao Chen et.al.	2405.12944	translate	read	link
2024-05-21	Predicting the Influence of Adverse Weather on Pedestrian Detection with Automotive Radar and Lidar Sensors	Daniel Weihmayr et.al.	2405.12736	translate	read	null
2024-05-21	Spotting AI’s Touch: Identifying LLM-Paraphrased Spans in Text	Yafu Li et.al.	2405.12689	translate	read	null
2024-05-21	Automating Attendance Management in Human Resources: A Design Science Approach Using Computer Vision and Facial Recognition	Bao-Thien Nguyen-Tat et.al.	2405.12633	translate	read	null
2024-05-21	FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors	Shuai Liu et.al.	2405.12601	translate	read	link
2024-05-21	Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering	Hiba Maryam et.al.	2405.12533	translate	read	null
2024-05-21	Active Object Detection with Knowledge Aggregation and Distillation from Large Models	Dejie Yang et.al.	2405.12509	translate	read	null
2024-05-21	Mutual Information Analysis in Multimodal Learning Systems	Hadi Hadizadeh et.al.	2405.12456	translate	read	null
2024-05-20	Multi-View Attentive Contextualization for Multi-View 3D Object Detection	Xianpeng Liu et.al.	2405.12200	translate	read	null
2024-05-20	Bangladeshi Native Vehicle Detection in Wild	Bipin Saha et.al.	2405.12150	translate	read	link
2024-05-20	Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments	Jooyong Park et.al.	2405.11855	translate	read	null
2024-05-20	DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment	Jianhong Han et.al.	2405.11765	translate	read	link
2024-05-20	Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain Adaptation	Runou Yang et.al.	2405.11754	translate	read	link
2024-05-19	FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention	Ziang Guo et.al.	2405.11682	translate	read	link
2024-05-19	SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization	Jialong Guo et.al.	2405.11582	translate	read	link
2024-05-19	The First Swahili Language Scene Text Detection and Recognition Dataset	Fadila Wendigoundi Douamba et.al.	2405.11437	translate	read	link
2024-05-18	InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images	Wuzhou Li et.al.	2405.11293	translate	read	null
2024-05-18	Visible and Clear: Finding Tiny Objects in Difference Map	Bing Cao et.al.	2405.11276	translate	read	null
2024-05-17	A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model	Mingxiang Fu et.al.	2405.10890	translate	read	null
2024-05-17	DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts	Anastasia Voznyuk et.al.	2405.10629	translate	read	link
2024-05-17	DuoSpaceNet: Leveraging Both Bird’s-Eye-View and Perspective View Representations for 3D Object Detection	Zhe Huang et.al.	2405.10577	translate	read	null
2024-05-16	Drone-type-Set: Drone types detection benchmark for drone detection and tracking	Kholoud AlDosari et.al.	2405.10398	translate	read	null
2024-05-16	Grounded 3D-LLM with Referent Tokens	Yilun Chen et.al.	2405.10370	translate	read	link
2024-05-16	Grounding DINO 1.5: Advance the “Edge” of Open-Set Object Detection	Tianhe Ren et.al.	2405.10300	translate	read	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	translate	read	link
2024-05-16	SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network	Zhaoxu Li et.al.	2405.10148	translate	read	link
2024-05-16	SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection	Mingxuan Liu et.al.	2405.10053	translate	read	link
2024-05-16	FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection	Siliang Ma et.al.	2405.09942	translate	read	null
2024-05-16	Infrared Adversarial Car Stickers	Xiaopei Zhu et.al.	2405.09924	translate	read	null
2024-05-16	PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features	Xusheng Li et.al.	2405.09828	translate	read	null
2024-05-16	Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection	Feiran Li et.al.	2405.09782	translate	read	link
2024-05-15	Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation	Guo Yachan et.al.	2405.09682	translate	read	null
2024-05-15	Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels	Guozhang Liu et.al.	2405.09024	translate	read	null
2024-05-14	CLIP with Quality Captions: A Strong Pretraining for Vision Tasks	Pavan Kumar Anasosalu Vasu et.al.	2405.08911	translate	read	null
2024-05-14	Open-Vocabulary Object Detection via Neighboring Region Attention Alignment	Sunyuan Qiang et.al.	2405.08593	translate	read	null
2024-05-14	Semantic Contextualization of Face Forgery: A New Definition, Dataset, and Detection Method	Mian Zou et.al.	2405.08487	translate	read	link
2024-05-14	RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images	Zong-Wei Hong et.al.	2405.08483	translate	read	link
2024-05-14	Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale Events	Xin Wu et.al.	2405.08251	translate	read	link
2024-05-13	RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors	Liam Dugan et.al.	2405.07940	translate	read	null
2024-05-13	oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving	Abdul Hannan Khan et.al.	2405.07698	translate	read	null
2024-05-13	MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders	Xueying Jiang et.al.	2405.07696	translate	read	null
2024-05-13	Quality-aware Selective Fusion Network for V-D-T Salient Object Detection	Liuxin Bao et.al.	2405.07655	translate	read	link
2024-05-13	Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying	Thomas Pöllabauer et.al.	2405.07653	translate	read	null
2024-05-13	Integrity Monitoring of 3D Object Detection in Automated Driving Systems using Raw Activation Patterns and Spatial Filtering	Hakan Yekta Yatbaz et.al.	2405.07600	translate	read	null
2024-05-13	Environmental Matching Attack Against Unmanned Aerial Vehicles Object Detection	Dehong Kong et.al.	2405.07595	translate	read	null
2024-05-13	Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis	Tianci Bi et.al.	2405.07481	translate	read	null
2024-05-13	Enhancing 3D Object Detection by Using Neural Network with Self-adaptive Thresholding	Houze Liu et.al.	2405.07479	translate	read	null
2024-05-12	MAML MOT: Multiple Object Tracking based on Meta-Learning	Jiayi Chen et.al.	2405.07272	translate	read	null
2024-05-10	How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?	Engin Uzun et.al.	2405.06383	translate	read	null
2024-05-10	Precise Apple Detection and Localization in Orchards using YOLOv5 for Robotic Harvesting Systems	Jiang Ziyue et.al.	2405.06260	translate	read	null
2024-05-09	CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks	Nick et.al.	2405.05755	translate	read	null
2024-05-09	Depth Awakens: A Depth-perceptual Attention Fusion Network for RGB-D Camouflaged Object Detection	Xinran Liua et.al.	2405.05614	translate	read	null
2024-05-09	The object detection model uses combined extraction with KNN and RF classification	Florentina Tatrin Kurniati et.al.	2405.05551	translate	read	null
2024-05-08	Reviewing Intelligent Cinematography: AI research for camera-based video production	Adrian Azzarelli et.al.	2405.05039	translate	read	null
2024-05-07	A Novel Wide-Area Multiobject Detection System with High-Probability Region Searching	Xianlei Long et.al.	2405.04589	translate	read	null
2024-05-07	DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving	Chen Min et.al.	2405.04390	translate	read	null
2024-05-07	A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields	Raiyan Rahman et.al.	2405.04305	translate	read	null
2024-05-07	ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers	Jinke Li et.al.	2405.04299	translate	read	link
2024-05-07	Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore	Junchao Wu et.al.	2405.04286	translate	read	link
2024-05-07	Deep Event-based Object Detection in Autonomous Driving: A Survey	Bingquan Zhou et.al.	2405.03995	translate	read	null
2024-05-06	BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection	Saket S. Chaturvedi et.al.	2405.03884	translate	read	null
2024-05-06	RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection	Thennarasi Balakrishnan et.al.	2405.03541	translate	read	link
2024-05-06	Low-light Object Detection	Pengpeng Li et.al.	2405.03519	translate	read	null
2024-05-06	Salient Object Detection From Arbitrary Modalities	Nianchang Huang et.al.	2405.03352	translate	read	null
2024-05-06	Modality Prompts for Arbitrary Modality Salient Object Detection	Nianchang Huang et.al.	2405.03351	translate	read	null
2024-05-06	Vietnamese AI Generated Text Detection	Quang-Dan Tran et.al.	2405.03206	translate	read	null
2024-05-06	PTQ4SAM: Post-Training Quantization for Segment Anything	Chengtao Lv et.al.	2405.03144	translate	read	link
2024-05-05	Performance Evaluation of Real-Time Object Detection for Electric Scooters	Dong Chen et.al.	2405.03039	translate	read	link
2024-05-05	SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection	Kassaw Abraham Mulat et.al.	2405.02906	translate	read	null
2024-05-07	Adaptive Guidance Learning for Camouflaged Object Detection	Zhennan Chen et.al.	2405.02824	translate	read	null
2024-05-05	PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection	Zhaoqi Leng et.al.	2405.02811	translate	read	null
2024-05-02	Segmentation-Free Outcome Prediction in Head and Neck Cancer: Deep Learning-based Feature Extraction from Multi-Angle Maximum Intensity Projections (MA-MIPs) of PET Images	Amirhosein Toosi et.al.	2405.01756	translate	read	null
2024-05-02	PointCompress3D – A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems	Walter Zimmer et.al.	2405.01750	translate	read	null
2024-05-02	Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey	Guoping Xu et.al.	2405.01725	translate	read	link
2024-05-02	SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients	Tushar Verma et.al.	2405.01699	translate	read	null
2024-05-02	Imagine the Unseen: Occluded Pedestrian Detection via Adversarial Feature Completion	Shanshan Zhang et.al.	2405.01311	translate	read	null
2024-05-02	Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease Remediation	Dr. Selva Kumar S et.al.	2405.01310	translate	read	null
2024-05-02	Towards Consistent Object Detection via LiDAR-Camera Synergy	Kai Luo et.al.	2405.01258	translate	read	link
2024-05-02	Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection	Ahmad Khalil et.al.	2405.01108	translate	read	null
2024-05-01	Grains of Saliency: Optimizing Saliency-based Training of Biometric Attack Detection Models	Colton R. Crum et.al.	2405.00650	translate	read	null
2024-05-01	Object detection under the linear subspace model with application to cryo-EM images	Amitay Eldar et.al.	2405.00364	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)