Object Detection - 2024-08 | Paper Arxiv Daily

Object Detection - 2024-08

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-08-30	Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations	Ahmed Hammam et.al.	2408.17311	translate	read	null
2024-08-30	Hybrid Classification-Regression Adaptive Loss for Dense Object Detection	Yanquan Huang et.al.	2408.17182	translate	read	null
2024-08-30	UTrack: Multi-Object Tracking with Uncertain Detections	Edgardo Solano-Carrillo et.al.	2408.17098	translate	read	link
2024-08-30	PIB: Prioritized Information Bottleneck Framework for Collaborative Edge Video Analytics	Zhengru Fang et.al.	2408.17047	translate	read	null
2024-08-30	CP-VoteNet: Contrastive Prototypical VoteNet for Few-Shot Point Cloud Object Detection	Xuejing Li et.al.	2408.17036	translate	read	null
2024-08-30	MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR	Binbin Xu et.al.	2408.17034	translate	read	null
2024-08-29	Analyzing Errors in Controlled Turret System Given Target Location Input from Artificial Intelligence Methods in Automatic Target Recognition	Matthew Karlson et.al.	2408.16923	translate	read	null
2024-08-29	Space3D-Bench: Spatial 3D Question Answering Benchmark	Emilia Szymanska et.al.	2408.16662	translate	read	null
2024-08-29	SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection	Rohit Venkata Sai Dulam et.al.	2408.16645	translate	read	null
2024-08-29	UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation	Piotr Rudol et.al.	2408.16501	translate	read	null
2024-08-29	Weakly Supervised Object Detection for Automatic Tooth-marked Tongue Recognition	Yongcun Zhang et.al.	2408.16451	translate	read	link
2024-08-29	Enhancing Sound Source Localization via False Negative Elimination	Zengjie Song et.al.	2408.16448	translate	read	link
2024-08-29	High-yield large-scale suspended graphene membranes over closed cavities for sensor applications	Sebastian Lukas et.al.	2408.16408	translate	read	null
2024-08-29	FA-YOLO: Research On Efficient Feature Selection YOLO Improved Algorithm Based On FMDS and AGMF Modules	Yukang Huo et.al.	2408.16313	translate	read	null
2024-08-29	Anno-incomplete Multi-dataset Detection	Yiran Xu et.al.	2408.16247	translate	read	null
2024-08-29	PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird’s-Eye-View	Zichen Yu et.al.	2408.16200	translate	read	null
2024-08-28	ChartEye: A Deep Learning Framework for Chart Information Extraction	Osama Mustafa et.al.	2408.16123	translate	read	null
2024-08-28	microYOLO: Towards Single-Shot Object Detection on Microcontrollers	Mark Deutel et.al.	2408.15865	translate	read	null
2024-08-28	What is YOLOv8: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector	Muhammad Yaseen et.al.	2408.15857	translate	read	null
2024-08-28	Network transferability of adversarial patches in real-time object detection	Jens Bayer et.al.	2408.15833	translate	read	link
2024-08-28	Object Detection for Vehicle Dashcams using Transformers	Osama Mustafa et.al.	2408.15809	translate	read	null
2024-08-29	RIDE: Boosting 3D Object Detection for LiDAR Point Clouds via Rotation-Invariant Analysis	Zhaoxuan Wang et.al.	2408.15643	translate	read	null
2024-08-28	MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion	Yanglin Deng et.al.	2408.15641	translate	read	link
2024-08-28	Semantic and goal-oriented edge computing for satellite Earth Observation	Beatriz Soret et.al.	2408.15639	translate	read	null
2024-08-28	Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection	Sondos Mohamed et.al.	2408.15637	translate	read	null
2024-08-28	Can Visual Language Models Replace OCR-Based Visual Question Answering Pipelines in Production? A Case Study in Retail	Bianca Lamm et.al.	2408.15626	translate	read	null
2024-08-28	RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving	Haisheng Su et.al.	2408.15503	translate	read	null
2024-08-27	A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships	Gracile Astlin Pereira et.al.	2408.15178	translate	read	null
2024-08-27	Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance	Kunpeng Wang et.al.	2408.15063	translate	read	null
2024-08-27	Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection	Siyuan Yao et.al.	2408.15020	translate	read	link
2024-08-27	Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation	Elona Shatri et.al.	2408.15002	translate	read	null
2024-08-27	BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and Localization	Mario A. V. Saucedo et.al.	2408.14941	translate	read	null
2024-08-26	PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection	Yidi Li et.al.	2408.14600	translate	read	null
2024-08-26	A Survey of Camouflaged Object Detection and Beyond	Fengyang Xiao et.al.	2408.14562	translate	read	null
2024-08-26	Beyond Few-shot Object Detection: A Detailed Survey	Vishal Chudasama et.al.	2408.14249	translate	read	null
2024-08-26	TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation	Anh-Dzung Doan et.al.	2408.14227	translate	read	null
2024-08-26	EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection	Pengyu Li et.al.	2408.14189	translate	read	null
2024-08-26	More Pictures Say More: Visual Intersection Network for Open Set Object Detection	Bingcheng Dong et.al.	2408.14032	translate	read	null
2024-08-25	Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems	Mohammad Hossein Amini et.al.	2408.13950	translate	read	null
2024-08-25	OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation	Muhammad Rameez ur Rahman et.al.	2408.13936	translate	read	link
2024-08-25	Infrared Domain Adaptation with Zero-Shot Quantization	Burak Sevsay et.al.	2408.13925	translate	read	null
2024-08-25	TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training	Li Li et.al.	2408.13902	translate	read	null
2024-08-25	Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection	Seongmin Park et.al.	2408.13798	translate	read	null
2024-08-24	Mean Height Aided Post-Processing for Pedestrian Detection	Jing Yuan et.al.	2408.13646	translate	read	null
2024-08-23	MCTR: Multi Camera Tracking Transformer	Alexandru Niculescu-Mizil et.al.	2408.13243	translate	read	null
2024-08-23	DeTPP: Leveraging Object Detection for Robust Long-Horizon Event Prediction	Ivan Karpukhin et.al.	2408.13131	translate	read	null
2024-08-23	VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models	Wentao Wu et.al.	2408.13031	translate	read	link
2024-08-23	Can AI Assistance Aid in the Grading of Handwritten Answer Sheets?	Pritam Sil et.al.	2408.12870	translate	read	null
2024-08-23	Symmetric masking strategy enhances the performance of Masked Image Modeling	Khanh-Binh Nguyen et.al.	2408.12772	translate	read	null
2024-08-22	CatFree3D: Category-agnostic 3D Object Detection with Diffusion	Wenjing Bian et.al.	2408.12747	translate	read	null
2024-08-22	Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection	Ruixiao Zhang et.al.	2408.12708	translate	read	null
2024-08-22	xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations	Can Qin et.al.	2408.12590	translate	read	null
2024-08-22	Enhanced Parking Perception by Multi-Task Fisheye Cross-view Transformers	Antonyo Musabini et.al.	2408.12575	translate	read	null
2024-08-22	Comparing YOLOv5 Variants for Vehicle Detection: A Performance Analysis	Athulya Sundaresan Geetha et.al.	2408.12550	translate	read	null
2024-08-22	UMAD: University of Macau Anomaly Detection Benchmark Dataset	Dong Li et.al.	2408.12527	translate	read	link
2024-08-22	Class-balanced Open-set Semi-supervised Object Detection for Medical Images	Zhanyun Lu et.al.	2408.12355	translate	read	null
2024-08-22	OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion	Guoting Wei et.al.	2408.12246	translate	read	null
2024-08-22	On the Credibility of Backdoor Attacks Against Object Detectors in the Physical World	Bao Gia Doan et.al.	2408.12122	translate	read	null
2024-08-21	CARLA Drone: Monocular 3D Object Detection from a Different Perspective	Johannes Meier et.al.	2408.11958	translate	read	null
2024-08-21	SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance	Zhiqiang Wu et.al.	2408.11760	translate	read	null
2024-08-21	Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections	Ahmed S. Abdelrahman et.al.	2408.11649	translate	read	null
2024-08-21	Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection	Liang Yao et.al.	2408.11407	translate	read	null
2024-08-20	On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes	Sadia Ilyas et.al.	2408.11221	translate	read	null
2024-08-20	Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVs	Sanjay Bhargav Dharavath et.al.	2408.11207	translate	read	link
2024-08-20	A Closer Look at Data Augmentation Strategies for Finetuning-Based Low/Few-Shot Object Detection	Vladislav Li et.al.	2408.10940	translate	read	null
2024-08-20	Aligning Object Detector Bounding Boxes with Human Preference	Ombretta Strafforello et.al.	2408.10844	translate	read	null
2024-08-20	LightMDETR: A Lightweight Approach for Low-Cost Open-Vocabulary Object Detection Training	Binta Sow et.al.	2408.10787	translate	read	null
2024-08-20	Just a Hint: Point-Supervised Camouflaged Object Detection	Huafeng Chen et.al.	2408.10777	translate	read	null
2024-08-21	Generative AI in Industrial Machine Vision – A Review	Hans Aoyang Zhou et.al.	2408.10775	translate	read	null
2024-08-20	Detection of Intracranial Hemorrhage for Trauma Patients	Antoine P. Sanner et.al.	2408.10768	translate	read	null
2024-08-20	SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection	Huafeng Chen et.al.	2408.10760	translate	read	null
2024-08-20	Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception	Jiaru Zhong et.al.	2408.10531	translate	read	null
2024-08-19	Leveraging Superfluous Information in Contrastive Representation Learning	Xuechu Yu et.al.	2408.10292	translate	read	null
2024-08-19	SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition	Wiktor Mucha et.al.	2408.10037	translate	read	null
2024-08-19	Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving	Jun Yan et.al.	2408.09839	translate	read	link
2024-08-19	Latent Diffusion for Guided Document Table Generation	Syed Jawwad Haider Hamdani et.al.	2408.09800	translate	read	null
2024-08-18	Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection	Kaiwen Wang et.al.	2408.09431	translate	read	null
2024-08-18	Boundary-Recovering Network for Temporal Action Detection	Jihwan Kim et.al.	2408.09354	translate	read	null
2024-08-18	YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems	Chien-Yao Wang et.al.	2408.09332	translate	read	null
2024-08-17	GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System	Shuo Wang et.al.	2408.09191	translate	read	null
2024-08-17	PADetBench: Towards Benchmarking Physical Attacks against Object Detection	Jiawei Lian et.al.	2408.09181	translate	read	link
2024-08-17	MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation	Xiao Zhao et.al.	2408.09122	translate	read	null
2024-08-17	Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community	Jiancheng Pan et.al.	2408.09110	translate	read	null
2024-08-16	SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation	Xinyu Xiong et.al.	2408.08870	translate	read	link
2024-08-16	Multimodal Relational Triple Extraction with Query-based Entity Object Transformer	Lei Hei et.al.	2408.08709	translate	read	null
2024-08-16	Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs	Jinming Liu et.al.	2408.08575	translate	read	null
2024-08-15	5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks	Dongshuo Yin et.al.	2408.08345	translate	read	link
2024-08-15	Learned Multimodal Compression for Autonomous Driving	Hadi Hadizadeh et.al.	2408.08211	translate	read	null
2024-08-16	OC3D: Weakly Supervised Outdoor 3D Object Detection with Only Coarse Click Annotation	Qiming Xia et.al.	2408.08092	translate	read	null
2024-08-15	CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection	Xunfa Lai et.al.	2408.08050	translate	read	null
2024-08-15	Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement	Wenxuan Li et.al.	2408.07999	translate	read	null
2024-08-15	GOReloc: Graph-based Object-Level Relocalization for Visual SLAM	Yutong Wang et.al.	2408.07917	translate	read	link
2024-08-14	See It All: Contextualized Late Aggregation for 3D Dense Captioning	Minjung Kim et.al.	2408.07648	translate	read	null
2024-08-14	Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving	Yuqing Wen et.al.	2408.07605	translate	read	null
2024-08-14	Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection	Zhonglin Chen et.al.	2408.07455	translate	read	null
2024-08-14	Sign language recognition based on deep learning and low-cost handcrafted descriptors	Alvaro Leandro Cavalcante Carneiro et.al.	2408.07244	translate	read	link
2024-08-13	Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces	Zhiling Chen et.al.	2408.07146	translate	read	null
2024-08-13	Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries	Qi Song et.al.	2408.06901	translate	read	null
2024-08-13	Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection	Matthias Bartolo et.al.	2408.06803	translate	read	link
2024-08-13	Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions	Miao Zhang et.al.	2408.06772	translate	read	null
2024-08-13	Unified-IoU: For High-Quality Object Detection	Xiangjie Luo et.al.	2408.06636	translate	read	link
2024-08-13	A lightweight YOLOv5-FFM model for occlusion pedestrian detection	Xiangjie Luo et.al.	2408.06633	translate	read	null
2024-08-13	MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers	Zichao Dong et.al.	2408.06604	translate	read	null
2024-08-12	Latent Disentanglement for Low Light Image Enhancement	Zhihao Zheng et.al.	2408.06245	translate	read	null
2024-08-12	MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception	Sven Teufel et.al.	2408.06137	translate	read	link
2024-08-12	DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection	Junjie Guo et.al.	2408.06123	translate	read	null
2024-08-12	Optimizing Vision Transformers with Data-Free Knowledge Transfer	Gousia Habib et.al.	2408.05952	translate	read	null
2024-08-12	MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection	Zitian Wang et.al.	2408.05945	translate	read	null
2024-08-12	Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes	Ke Zhou et.al.	2408.05936	translate	read	null
2024-08-12	Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts	Peng Wu et.al.	2408.05905	translate	read	null
2024-08-12	Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network	Kailai Sun et.al.	2408.05877	translate	read	null
2024-08-11	U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training	Zhuoyan Liu et.al.	2408.05780	translate	read	link
2024-08-11	FADE: A Dataset for Detecting Falling Objects around Buildings in Video	Zhigang Tu et.al.	2408.05750	translate	read	null
2024-08-09	DeepInteraction++: Multi-Modality Interaction for Autonomous Driving	Zeyu Yang et.al.	2408.05075	translate	read	link
2024-08-09	RadarPillars: Efficient Object Detection from 4D Radar Point Clouds	Alexander Musiat et.al.	2408.05020	translate	read	null
2024-08-09	Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation	Yifan Feng et.al.	2408.04804	translate	read	link
2024-08-08	SOD-YOLOv8 – Enhancing YOLOv8 for Small Object Detection in Traffic Scenes	Boshra Khalili et.al.	2408.04786	translate	read	null
2024-08-08	Data-Driven Pixel Control: Challenges and Prospects	Saurabh Farkya et.al.	2408.04767	translate	read	null
2024-08-10	SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More	Tianrun Chen et.al.	2408.04579	translate	read	null
2024-08-07	Impact Analysis of Data Drift Towards The Development of Safety-Critical Automotive System	Md Shahi Amran Hossain et.al.	2408.04476	translate	read	null
2024-08-08	Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework	Subhasis Dasgupta et.al.	2408.04360	translate	read	null
2024-08-08	Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection	Shixuan Gao et.al.	2408.04326	translate	read	null
2024-08-08	LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection	Mervat Abassy et.al.	2408.04284	translate	read	null
2024-08-08	Learning to Rewrite: Generalized LLM-Generated Text Detection	Wei Hao et.al.	2408.04237	translate	read	null
2024-08-07	PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation	Blessing Agyei Kyem et.al.	2408.04110	translate	read	link
2024-08-07	Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection	Christian Fruhwirth-Reisinger et.al.	2408.03790	translate	read	null
2024-08-07	Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model	Guoqing Zhu et.al.	2408.03748	translate	read	link
2024-08-07	CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications	Tianfang Zhang et.al.	2408.03703	translate	read	link
2024-08-07	L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection	Xun Huang et.al.	2408.03677	translate	read	null
2024-08-07	Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks	Jaewook Lee et.al.	2408.03663	translate	read	null
2024-08-07	Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving	Amirhosein Chahe et.al.	2408.03516	translate	read	null
2024-08-07	GUI Element Detection Using SOTA YOLO Deep Learning Models	Seyed Shayan Daneshvar et.al.	2408.03507	translate	read	null
2024-08-06	AI Foundation Models in Remote Sensing: A Survey	Siqi Lu et.al.	2408.03464	translate	read	null
2024-08-06	Biomedical Image Segmentation: A Systematic Literature Review of Deep Learning Based Object Detection Methods	Fazli Wahid et.al.	2408.03393	translate	read	null
2024-08-06	Nighttime Pedestrian Detection Based on Fore-Background Contrast Learning	He Yao et.al.	2408.03030	translate	read	null
2024-08-06	Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection	Sen Nie et.al.	2408.02891	translate	read	null
2024-08-05	HQOD: Harmonious Quantization for Object Detection	Long Huang et.al.	2408.02561	translate	read	null
2024-08-05	Tensorial template matching for fast cross-correlation with rotations and its application for tomography	Antonio Martinez-Sanchez et.al.	2408.02398	translate	read	null
2024-08-05	Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization	Changtao Miao et.al.	2408.02306	translate	read	null
2024-08-05	AssemAI: Interpretable Image-Based Anomaly Detection for Manufacturing Pipelines	Renjith Prasad et.al.	2408.02181	translate	read	null
2024-08-04	KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving	Zhihao Lai et.al.	2408.02088	translate	read	null
2024-08-06	A Survey and Evaluation of Adversarial Attacks for Object Detection	Khoi Nguyen Tiet Nguyen et.al.	2408.01934	translate	read	null
2024-08-04	CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery	Zilin Chen et.al.	2408.01897	translate	read	null
2024-08-03	Supervised Image Translation from Visible to Infrared Domain for Object Detection	Prahlad Anand et.al.	2408.01843	translate	read	null
2024-08-03	Domain penalisation for improved Out-of-Distribution Generalisation	Shuvam Jena et.al.	2408.01746	translate	read	null
2024-08-03	LAM3D: Leveraging Attention for Monocular 3D Object Detection	Diana-Alexandra Sas et.al.	2408.01739	translate	read	null
2024-08-02	A Robotics-Inspired Scanpath Model Reveals the Importance of Uncertainty and Semantic Object Cues for Gaze Guidance in Dynamic Scenes	Vito Mengers et.al.	2408.01322	translate	read	null
2024-08-02	Underwater Object Detection Enhancement via Channel Stabilization	Muhammad Ali et.al.	2408.01293	translate	read	null
2024-08-02	PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network	Changqun Xia et.al.	2408.01137	translate	read	null
2024-08-02	Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions	Ajinkya Shinde et.al.	2408.01085	translate	read	null
2024-08-02	Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model	Yang Jin et.al.	2408.01044	translate	read	null
2024-08-02	MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection	Xiangbo Gao et.al.	2408.01037	translate	read	null
2024-08-02	Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach	Yabin Zhu et.al.	2408.00969	translate	read	null
2024-08-01	Joint Neural Networks for One-shot Object Recognition and Detection	Camilo J. Vargas et.al.	2408.00701	translate	read	null
2024-08-01	Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection	Ruiyang Zhang et.al.	2408.00619	translate	read	null
2024-08-01	U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight	Tongtong Feng et.al.	2408.00606	translate	read	null
2024-08-01	MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection	Xiangyuan Peng et.al.	2408.00565	translate	read	null
2024-08-01	Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval	Gangyan Zeng et.al.	2408.00441	translate	read	null
2024-08-01	MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection	Youjia Fu et.al.	2408.00438	translate	read	null
2024-08-01	DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training	Yu Xie et.al.	2408.00355	translate	read	null
2024-08-01	A Simple Background Augmentation Method for Object Detection with Diffusion Model	Yuhang Li et.al.	2408.00350	translate	read	null
2024-08-01	Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection	Jiacheng Deng et.al.	2408.00286	translate	read	null
2024-08-01	RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment	Zhe Huang et.al.	2408.00257	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)