Object Detection - 2024-08

Publish Date Title Authors PDF Translate Read Code
2024-08-30 Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations Ahmed Hammam et.al. 2408.17311 translate read null
2024-08-30 Hybrid Classification-Regression Adaptive Loss for Dense Object Detection Yanquan Huang et.al. 2408.17182 translate read null
2024-08-30 UTrack: Multi-Object Tracking with Uncertain Detections Edgardo Solano-Carrillo et.al. 2408.17098 translate read link
2024-08-30 PIB: Prioritized Information Bottleneck Framework for Collaborative Edge Video Analytics Zhengru Fang et.al. 2408.17047 translate read null
2024-08-30 CP-VoteNet: Contrastive Prototypical VoteNet for Few-Shot Point Cloud Object Detection Xuejing Li et.al. 2408.17036 translate read null
2024-08-30 MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR Binbin Xu et.al. 2408.17034 translate read null
2024-08-29 Analyzing Errors in Controlled Turret System Given Target Location Input from Artificial Intelligence Methods in Automatic Target Recognition Matthew Karlson et.al. 2408.16923 translate read null
2024-08-29 Space3D-Bench: Spatial 3D Question Answering Benchmark Emilia Szymanska et.al. 2408.16662 translate read null
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 translate read null
2024-08-29 UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation Piotr Rudol et.al. 2408.16501 translate read null
2024-08-29 Weakly Supervised Object Detection for Automatic Tooth-marked Tongue Recognition Yongcun Zhang et.al. 2408.16451 translate read link
2024-08-29 Enhancing Sound Source Localization via False Negative Elimination Zengjie Song et.al. 2408.16448 translate read link
2024-08-29 High-yield large-scale suspended graphene membranes over closed cavities for sensor applications Sebastian Lukas et.al. 2408.16408 translate read null
2024-08-29 FA-YOLO: Research On Efficient Feature Selection YOLO Improved Algorithm Based On FMDS and AGMF Modules Yukang Huo et.al. 2408.16313 translate read null
2024-08-29 Anno-incomplete Multi-dataset Detection Yiran Xu et.al. 2408.16247 translate read null
2024-08-29 PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird’s-Eye-View Zichen Yu et.al. 2408.16200 translate read null
2024-08-28 ChartEye: A Deep Learning Framework for Chart Information Extraction Osama Mustafa et.al. 2408.16123 translate read null
2024-08-28 microYOLO: Towards Single-Shot Object Detection on Microcontrollers Mark Deutel et.al. 2408.15865 translate read null
2024-08-28 What is YOLOv8: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector Muhammad Yaseen et.al. 2408.15857 translate read null
2024-08-28 Network transferability of adversarial patches in real-time object detection Jens Bayer et.al. 2408.15833 translate read link
2024-08-28 Object Detection for Vehicle Dashcams using Transformers Osama Mustafa et.al. 2408.15809 translate read null
2024-08-29 RIDE: Boosting 3D Object Detection for LiDAR Point Clouds via Rotation-Invariant Analysis Zhaoxuan Wang et.al. 2408.15643 translate read null
2024-08-28 MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion Yanglin Deng et.al. 2408.15641 translate read link
2024-08-28 Semantic and goal-oriented edge computing for satellite Earth Observation Beatriz Soret et.al. 2408.15639 translate read null
2024-08-28 Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection Sondos Mohamed et.al. 2408.15637 translate read null
2024-08-28 Can Visual Language Models Replace OCR-Based Visual Question Answering Pipelines in Production? A Case Study in Retail Bianca Lamm et.al. 2408.15626 translate read null
2024-08-28 RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving Haisheng Su et.al. 2408.15503 translate read null
2024-08-27 A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships Gracile Astlin Pereira et.al. 2408.15178 translate read null
2024-08-27 Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance Kunpeng Wang et.al. 2408.15063 translate read null
2024-08-27 Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection Siyuan Yao et.al. 2408.15020 translate read link
2024-08-27 Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation Elona Shatri et.al. 2408.15002 translate read null
2024-08-27 BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and Localization Mario A. V. Saucedo et.al. 2408.14941 translate read null
2024-08-26 PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection Yidi Li et.al. 2408.14600 translate read null
2024-08-26 A Survey of Camouflaged Object Detection and Beyond Fengyang Xiao et.al. 2408.14562 translate read null
2024-08-26 Beyond Few-shot Object Detection: A Detailed Survey Vishal Chudasama et.al. 2408.14249 translate read null
2024-08-26 TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation Anh-Dzung Doan et.al. 2408.14227 translate read null
2024-08-26 EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection Pengyu Li et.al. 2408.14189 translate read null
2024-08-26 More Pictures Say More: Visual Intersection Network for Open Set Object Detection Bingcheng Dong et.al. 2408.14032 translate read null
2024-08-25 Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems Mohammad Hossein Amini et.al. 2408.13950 translate read null
2024-08-25 OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation Muhammad Rameez ur Rahman et.al. 2408.13936 translate read link
2024-08-25 Infrared Domain Adaptation with Zero-Shot Quantization Burak Sevsay et.al. 2408.13925 translate read null
2024-08-25 TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training Li Li et.al. 2408.13902 translate read null
2024-08-25 Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection Seongmin Park et.al. 2408.13798 translate read null
2024-08-24 Mean Height Aided Post-Processing for Pedestrian Detection Jing Yuan et.al. 2408.13646 translate read null
2024-08-23 MCTR: Multi Camera Tracking Transformer Alexandru Niculescu-Mizil et.al. 2408.13243 translate read null
2024-08-23 DeTPP: Leveraging Object Detection for Robust Long-Horizon Event Prediction Ivan Karpukhin et.al. 2408.13131 translate read null
2024-08-23 VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models Wentao Wu et.al. 2408.13031 translate read link
2024-08-23 Can AI Assistance Aid in the Grading of Handwritten Answer Sheets? Pritam Sil et.al. 2408.12870 translate read null
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 translate read null
2024-08-22 CatFree3D: Category-agnostic 3D Object Detection with Diffusion Wenjing Bian et.al. 2408.12747 translate read null
2024-08-22 Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection Ruixiao Zhang et.al. 2408.12708 translate read null
2024-08-22 xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations Can Qin et.al. 2408.12590 translate read null
2024-08-22 Enhanced Parking Perception by Multi-Task Fisheye Cross-view Transformers Antonyo Musabini et.al. 2408.12575 translate read null
2024-08-22 Comparing YOLOv5 Variants for Vehicle Detection: A Performance Analysis Athulya Sundaresan Geetha et.al. 2408.12550 translate read null
2024-08-22 UMAD: University of Macau Anomaly Detection Benchmark Dataset Dong Li et.al. 2408.12527 translate read link
2024-08-22 Class-balanced Open-set Semi-supervised Object Detection for Medical Images Zhanyun Lu et.al. 2408.12355 translate read null
2024-08-22 OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion Guoting Wei et.al. 2408.12246 translate read null
2024-08-22 On the Credibility of Backdoor Attacks Against Object Detectors in the Physical World Bao Gia Doan et.al. 2408.12122 translate read null
2024-08-21 CARLA Drone: Monocular 3D Object Detection from a Different Perspective Johannes Meier et.al. 2408.11958 translate read null
2024-08-21 SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance Zhiqiang Wu et.al. 2408.11760 translate read null
2024-08-21 Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections Ahmed S. Abdelrahman et.al. 2408.11649 translate read null
2024-08-21 Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection Liang Yao et.al. 2408.11407 translate read null
2024-08-20 On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes Sadia Ilyas et.al. 2408.11221 translate read null
2024-08-20 Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVs Sanjay Bhargav Dharavath et.al. 2408.11207 translate read link
2024-08-20 A Closer Look at Data Augmentation Strategies for Finetuning-Based Low/Few-Shot Object Detection Vladislav Li et.al. 2408.10940 translate read null
2024-08-20 Aligning Object Detector Bounding Boxes with Human Preference Ombretta Strafforello et.al. 2408.10844 translate read null
2024-08-20 LightMDETR: A Lightweight Approach for Low-Cost Open-Vocabulary Object Detection Training Binta Sow et.al. 2408.10787 translate read null
2024-08-20 Just a Hint: Point-Supervised Camouflaged Object Detection Huafeng Chen et.al. 2408.10777 translate read null
2024-08-21 Generative AI in Industrial Machine Vision – A Review Hans Aoyang Zhou et.al. 2408.10775 translate read null
2024-08-20 Detection of Intracranial Hemorrhage for Trauma Patients Antoine P. Sanner et.al. 2408.10768 translate read null
2024-08-20 SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection Huafeng Chen et.al. 2408.10760 translate read null
2024-08-20 Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception Jiaru Zhong et.al. 2408.10531 translate read null
2024-08-19 Leveraging Superfluous Information in Contrastive Representation Learning Xuechu Yu et.al. 2408.10292 translate read null
2024-08-19 SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition Wiktor Mucha et.al. 2408.10037 translate read null
2024-08-19 Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving Jun Yan et.al. 2408.09839 translate read link
2024-08-19 Latent Diffusion for Guided Document Table Generation Syed Jawwad Haider Hamdani et.al. 2408.09800 translate read null
2024-08-18 Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection Kaiwen Wang et.al. 2408.09431 translate read null
2024-08-18 Boundary-Recovering Network for Temporal Action Detection Jihwan Kim et.al. 2408.09354 translate read null
2024-08-18 YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems Chien-Yao Wang et.al. 2408.09332 translate read null
2024-08-17 GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System Shuo Wang et.al. 2408.09191 translate read null
2024-08-17 PADetBench: Towards Benchmarking Physical Attacks against Object Detection Jiawei Lian et.al. 2408.09181 translate read link
2024-08-17 MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation Xiao Zhao et.al. 2408.09122 translate read null
2024-08-17 Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community Jiancheng Pan et.al. 2408.09110 translate read null
2024-08-16 SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation Xinyu Xiong et.al. 2408.08870 translate read link
2024-08-16 Multimodal Relational Triple Extraction with Query-based Entity Object Transformer Lei Hei et.al. 2408.08709 translate read null
2024-08-16 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 translate read null
2024-08-15 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Dongshuo Yin et.al. 2408.08345 translate read link
2024-08-15 Learned Multimodal Compression for Autonomous Driving Hadi Hadizadeh et.al. 2408.08211 translate read null
2024-08-16 OC3D: Weakly Supervised Outdoor 3D Object Detection with Only Coarse Click Annotation Qiming Xia et.al. 2408.08092 translate read null
2024-08-15 CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection Xunfa Lai et.al. 2408.08050 translate read null
2024-08-15 Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement Wenxuan Li et.al. 2408.07999 translate read null
2024-08-15 GOReloc: Graph-based Object-Level Relocalization for Visual SLAM Yutong Wang et.al. 2408.07917 translate read link
2024-08-14 See It All: Contextualized Late Aggregation for 3D Dense Captioning Minjung Kim et.al. 2408.07648 translate read null
2024-08-14 Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving Yuqing Wen et.al. 2408.07605 translate read null
2024-08-14 Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection Zhonglin Chen et.al. 2408.07455 translate read null
2024-08-14 Sign language recognition based on deep learning and low-cost handcrafted descriptors Alvaro Leandro Cavalcante Carneiro et.al. 2408.07244 translate read link
2024-08-13 Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces Zhiling Chen et.al. 2408.07146 translate read null
2024-08-13 Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries Qi Song et.al. 2408.06901 translate read null
2024-08-13 Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection Matthias Bartolo et.al. 2408.06803 translate read link
2024-08-13 Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions Miao Zhang et.al. 2408.06772 translate read null
2024-08-13 Unified-IoU: For High-Quality Object Detection Xiangjie Luo et.al. 2408.06636 translate read link
2024-08-13 A lightweight YOLOv5-FFM model for occlusion pedestrian detection Xiangjie Luo et.al. 2408.06633 translate read null
2024-08-13 MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers Zichao Dong et.al. 2408.06604 translate read null
2024-08-12 Latent Disentanglement for Low Light Image Enhancement Zhihao Zheng et.al. 2408.06245 translate read null
2024-08-12 MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception Sven Teufel et.al. 2408.06137 translate read link
2024-08-12 DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection Junjie Guo et.al. 2408.06123 translate read null
2024-08-12 Optimizing Vision Transformers with Data-Free Knowledge Transfer Gousia Habib et.al. 2408.05952 translate read null
2024-08-12 MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection Zitian Wang et.al. 2408.05945 translate read null
2024-08-12 Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes Ke Zhou et.al. 2408.05936 translate read null
2024-08-12 Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts Peng Wu et.al. 2408.05905 translate read null
2024-08-12 Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network Kailai Sun et.al. 2408.05877 translate read null
2024-08-11 U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training Zhuoyan Liu et.al. 2408.05780 translate read link
2024-08-11 FADE: A Dataset for Detecting Falling Objects around Buildings in Video Zhigang Tu et.al. 2408.05750 translate read null
2024-08-09 DeepInteraction++: Multi-Modality Interaction for Autonomous Driving Zeyu Yang et.al. 2408.05075 translate read link
2024-08-09 RadarPillars: Efficient Object Detection from 4D Radar Point Clouds Alexander Musiat et.al. 2408.05020 translate read null
2024-08-09 Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation Yifan Feng et.al. 2408.04804 translate read link
2024-08-08 SOD-YOLOv8 – Enhancing YOLOv8 for Small Object Detection in Traffic Scenes Boshra Khalili et.al. 2408.04786 translate read null
2024-08-08 Data-Driven Pixel Control: Challenges and Prospects Saurabh Farkya et.al. 2408.04767 translate read null
2024-08-10 SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More Tianrun Chen et.al. 2408.04579 translate read null
2024-08-07 Impact Analysis of Data Drift Towards The Development of Safety-Critical Automotive System Md Shahi Amran Hossain et.al. 2408.04476 translate read null
2024-08-08 Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework Subhasis Dasgupta et.al. 2408.04360 translate read null
2024-08-08 Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection Shixuan Gao et.al. 2408.04326 translate read null
2024-08-08 LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection Mervat Abassy et.al. 2408.04284 translate read null
2024-08-08 Learning to Rewrite: Generalized LLM-Generated Text Detection Wei Hao et.al. 2408.04237 translate read null
2024-08-07 PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation Blessing Agyei Kyem et.al. 2408.04110 translate read link
2024-08-07 Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection Christian Fruhwirth-Reisinger et.al. 2408.03790 translate read null
2024-08-07 Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model Guoqing Zhu et.al. 2408.03748 translate read link
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 translate read link
2024-08-07 L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection Xun Huang et.al. 2408.03677 translate read null
2024-08-07 Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks Jaewook Lee et.al. 2408.03663 translate read null
2024-08-07 Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving Amirhosein Chahe et.al. 2408.03516 translate read null
2024-08-07 GUI Element Detection Using SOTA YOLO Deep Learning Models Seyed Shayan Daneshvar et.al. 2408.03507 translate read null
2024-08-06 AI Foundation Models in Remote Sensing: A Survey Siqi Lu et.al. 2408.03464 translate read null
2024-08-06 Biomedical Image Segmentation: A Systematic Literature Review of Deep Learning Based Object Detection Methods Fazli Wahid et.al. 2408.03393 translate read null
2024-08-06 Nighttime Pedestrian Detection Based on Fore-Background Contrast Learning He Yao et.al. 2408.03030 translate read null
2024-08-06 Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection Sen Nie et.al. 2408.02891 translate read null
2024-08-05 HQOD: Harmonious Quantization for Object Detection Long Huang et.al. 2408.02561 translate read null
2024-08-05 Tensorial template matching for fast cross-correlation with rotations and its application for tomography Antonio Martinez-Sanchez et.al. 2408.02398 translate read null
2024-08-05 Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization Changtao Miao et.al. 2408.02306 translate read null
2024-08-05 AssemAI: Interpretable Image-Based Anomaly Detection for Manufacturing Pipelines Renjith Prasad et.al. 2408.02181 translate read null
2024-08-04 KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving Zhihao Lai et.al. 2408.02088 translate read null
2024-08-06 A Survey and Evaluation of Adversarial Attacks for Object Detection Khoi Nguyen Tiet Nguyen et.al. 2408.01934 translate read null
2024-08-04 CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery Zilin Chen et.al. 2408.01897 translate read null
2024-08-03 Supervised Image Translation from Visible to Infrared Domain for Object Detection Prahlad Anand et.al. 2408.01843 translate read null
2024-08-03 Domain penalisation for improved Out-of-Distribution Generalisation Shuvam Jena et.al. 2408.01746 translate read null
2024-08-03 LAM3D: Leveraging Attention for Monocular 3D Object Detection Diana-Alexandra Sas et.al. 2408.01739 translate read null
2024-08-02 A Robotics-Inspired Scanpath Model Reveals the Importance of Uncertainty and Semantic Object Cues for Gaze Guidance in Dynamic Scenes Vito Mengers et.al. 2408.01322 translate read null
2024-08-02 Underwater Object Detection Enhancement via Channel Stabilization Muhammad Ali et.al. 2408.01293 translate read null
2024-08-02 PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network Changqun Xia et.al. 2408.01137 translate read null
2024-08-02 Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions Ajinkya Shinde et.al. 2408.01085 translate read null
2024-08-02 Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model Yang Jin et.al. 2408.01044 translate read null
2024-08-02 MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection Xiangbo Gao et.al. 2408.01037 translate read null
2024-08-02 Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach Yabin Zhu et.al. 2408.00969 translate read null
2024-08-01 Joint Neural Networks for One-shot Object Recognition and Detection Camilo J. Vargas et.al. 2408.00701 translate read null
2024-08-01 Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection Ruiyang Zhang et.al. 2408.00619 translate read null
2024-08-01 U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight Tongtong Feng et.al. 2408.00606 translate read null
2024-08-01 MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection Xiangyuan Peng et.al. 2408.00565 translate read null
2024-08-01 Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval Gangyan Zeng et.al. 2408.00441 translate read null
2024-08-01 MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection Youjia Fu et.al. 2408.00438 translate read null
2024-08-01 DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training Yu Xie et.al. 2408.00355 translate read null
2024-08-01 A Simple Background Augmentation Method for Object Detection with Diffusion Model Yuhang Li et.al. 2408.00350 translate read null
2024-08-01 Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection Jiacheng Deng et.al. 2408.00286 translate read null
2024-08-01 RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment Zhe Huang et.al. 2408.00257 translate read null

(<a href=../Object_Detection.md>back to Object Detection</a>)