Object Detection - 2024-08
Object Detection - 2024-08
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-08-30 | Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations | Ahmed Hammam et.al. | 2408.17311 | translate | read | null |
| 2024-08-30 | Hybrid Classification-Regression Adaptive Loss for Dense Object Detection | Yanquan Huang et.al. | 2408.17182 | translate | read | null |
| 2024-08-30 | UTrack: Multi-Object Tracking with Uncertain Detections | Edgardo Solano-Carrillo et.al. | 2408.17098 | translate | read | link |
| 2024-08-30 | PIB: Prioritized Information Bottleneck Framework for Collaborative Edge Video Analytics | Zhengru Fang et.al. | 2408.17047 | translate | read | null |
| 2024-08-30 | CP-VoteNet: Contrastive Prototypical VoteNet for Few-Shot Point Cloud Object Detection | Xuejing Li et.al. | 2408.17036 | translate | read | null |
| 2024-08-30 | MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR | Binbin Xu et.al. | 2408.17034 | translate | read | null |
| 2024-08-29 | Analyzing Errors in Controlled Turret System Given Target Location Input from Artificial Intelligence Methods in Automatic Target Recognition | Matthew Karlson et.al. | 2408.16923 | translate | read | null |
| 2024-08-29 | Space3D-Bench: Spatial 3D Question Answering Benchmark | Emilia Szymanska et.al. | 2408.16662 | translate | read | null |
| 2024-08-29 | SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection | Rohit Venkata Sai Dulam et.al. | 2408.16645 | translate | read | null |
| 2024-08-29 | UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation | Piotr Rudol et.al. | 2408.16501 | translate | read | null |
| 2024-08-29 | Weakly Supervised Object Detection for Automatic Tooth-marked Tongue Recognition | Yongcun Zhang et.al. | 2408.16451 | translate | read | link |
| 2024-08-29 | Enhancing Sound Source Localization via False Negative Elimination | Zengjie Song et.al. | 2408.16448 | translate | read | link |
| 2024-08-29 | High-yield large-scale suspended graphene membranes over closed cavities for sensor applications | Sebastian Lukas et.al. | 2408.16408 | translate | read | null |
| 2024-08-29 | FA-YOLO: Research On Efficient Feature Selection YOLO Improved Algorithm Based On FMDS and AGMF Modules | Yukang Huo et.al. | 2408.16313 | translate | read | null |
| 2024-08-29 | Anno-incomplete Multi-dataset Detection | Yiran Xu et.al. | 2408.16247 | translate | read | null |
| 2024-08-29 | PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird’s-Eye-View | Zichen Yu et.al. | 2408.16200 | translate | read | null |
| 2024-08-28 | ChartEye: A Deep Learning Framework for Chart Information Extraction | Osama Mustafa et.al. | 2408.16123 | translate | read | null |
| 2024-08-28 | microYOLO: Towards Single-Shot Object Detection on Microcontrollers | Mark Deutel et.al. | 2408.15865 | translate | read | null |
| 2024-08-28 | What is YOLOv8: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector | Muhammad Yaseen et.al. | 2408.15857 | translate | read | null |
| 2024-08-28 | Network transferability of adversarial patches in real-time object detection | Jens Bayer et.al. | 2408.15833 | translate | read | link |
| 2024-08-28 | Object Detection for Vehicle Dashcams using Transformers | Osama Mustafa et.al. | 2408.15809 | translate | read | null |
| 2024-08-29 | RIDE: Boosting 3D Object Detection for LiDAR Point Clouds via Rotation-Invariant Analysis | Zhaoxuan Wang et.al. | 2408.15643 | translate | read | null |
| 2024-08-28 | MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion | Yanglin Deng et.al. | 2408.15641 | translate | read | link |
| 2024-08-28 | Semantic and goal-oriented edge computing for satellite Earth Observation | Beatriz Soret et.al. | 2408.15639 | translate | read | null |
| 2024-08-28 | Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection | Sondos Mohamed et.al. | 2408.15637 | translate | read | null |
| 2024-08-28 | Can Visual Language Models Replace OCR-Based Visual Question Answering Pipelines in Production? A Case Study in Retail | Bianca Lamm et.al. | 2408.15626 | translate | read | null |
| 2024-08-28 | RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving | Haisheng Su et.al. | 2408.15503 | translate | read | null |
| 2024-08-27 | A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships | Gracile Astlin Pereira et.al. | 2408.15178 | translate | read | null |
| 2024-08-27 | Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance | Kunpeng Wang et.al. | 2408.15063 | translate | read | null |
| 2024-08-27 | Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection | Siyuan Yao et.al. | 2408.15020 | translate | read | link |
| 2024-08-27 | Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation | Elona Shatri et.al. | 2408.15002 | translate | read | null |
| 2024-08-27 | BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and Localization | Mario A. V. Saucedo et.al. | 2408.14941 | translate | read | null |
| 2024-08-26 | PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection | Yidi Li et.al. | 2408.14600 | translate | read | null |
| 2024-08-26 | A Survey of Camouflaged Object Detection and Beyond | Fengyang Xiao et.al. | 2408.14562 | translate | read | null |
| 2024-08-26 | Beyond Few-shot Object Detection: A Detailed Survey | Vishal Chudasama et.al. | 2408.14249 | translate | read | null |
| 2024-08-26 | TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation | Anh-Dzung Doan et.al. | 2408.14227 | translate | read | null |
| 2024-08-26 | EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection | Pengyu Li et.al. | 2408.14189 | translate | read | null |
| 2024-08-26 | More Pictures Say More: Visual Intersection Network for Open Set Object Detection | Bingcheng Dong et.al. | 2408.14032 | translate | read | null |
| 2024-08-25 | Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems | Mohammad Hossein Amini et.al. | 2408.13950 | translate | read | null |
| 2024-08-25 | OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation | Muhammad Rameez ur Rahman et.al. | 2408.13936 | translate | read | link |
| 2024-08-25 | Infrared Domain Adaptation with Zero-Shot Quantization | Burak Sevsay et.al. | 2408.13925 | translate | read | null |
| 2024-08-25 | TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training | Li Li et.al. | 2408.13902 | translate | read | null |
| 2024-08-25 | Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection | Seongmin Park et.al. | 2408.13798 | translate | read | null |
| 2024-08-24 | Mean Height Aided Post-Processing for Pedestrian Detection | Jing Yuan et.al. | 2408.13646 | translate | read | null |
| 2024-08-23 | MCTR: Multi Camera Tracking Transformer | Alexandru Niculescu-Mizil et.al. | 2408.13243 | translate | read | null |
| 2024-08-23 | DeTPP: Leveraging Object Detection for Robust Long-Horizon Event Prediction | Ivan Karpukhin et.al. | 2408.13131 | translate | read | null |
| 2024-08-23 | VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models | Wentao Wu et.al. | 2408.13031 | translate | read | link |
| 2024-08-23 | Can AI Assistance Aid in the Grading of Handwritten Answer Sheets? | Pritam Sil et.al. | 2408.12870 | translate | read | null |
| 2024-08-23 | Symmetric masking strategy enhances the performance of Masked Image Modeling | Khanh-Binh Nguyen et.al. | 2408.12772 | translate | read | null |
| 2024-08-22 | CatFree3D: Category-agnostic 3D Object Detection with Diffusion | Wenjing Bian et.al. | 2408.12747 | translate | read | null |
| 2024-08-22 | Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection | Ruixiao Zhang et.al. | 2408.12708 | translate | read | null |
| 2024-08-22 | xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations | Can Qin et.al. | 2408.12590 | translate | read | null |
| 2024-08-22 | Enhanced Parking Perception by Multi-Task Fisheye Cross-view Transformers | Antonyo Musabini et.al. | 2408.12575 | translate | read | null |
| 2024-08-22 | Comparing YOLOv5 Variants for Vehicle Detection: A Performance Analysis | Athulya Sundaresan Geetha et.al. | 2408.12550 | translate | read | null |
| 2024-08-22 | UMAD: University of Macau Anomaly Detection Benchmark Dataset | Dong Li et.al. | 2408.12527 | translate | read | link |
| 2024-08-22 | Class-balanced Open-set Semi-supervised Object Detection for Medical Images | Zhanyun Lu et.al. | 2408.12355 | translate | read | null |
| 2024-08-22 | OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion | Guoting Wei et.al. | 2408.12246 | translate | read | null |
| 2024-08-22 | On the Credibility of Backdoor Attacks Against Object Detectors in the Physical World | Bao Gia Doan et.al. | 2408.12122 | translate | read | null |
| 2024-08-21 | CARLA Drone: Monocular 3D Object Detection from a Different Perspective | Johannes Meier et.al. | 2408.11958 | translate | read | null |
| 2024-08-21 | SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance | Zhiqiang Wu et.al. | 2408.11760 | translate | read | null |
| 2024-08-21 | Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections | Ahmed S. Abdelrahman et.al. | 2408.11649 | translate | read | null |
| 2024-08-21 | Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection | Liang Yao et.al. | 2408.11407 | translate | read | null |
| 2024-08-20 | On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes | Sadia Ilyas et.al. | 2408.11221 | translate | read | null |
| 2024-08-20 | Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVs | Sanjay Bhargav Dharavath et.al. | 2408.11207 | translate | read | link |
| 2024-08-20 | A Closer Look at Data Augmentation Strategies for Finetuning-Based Low/Few-Shot Object Detection | Vladislav Li et.al. | 2408.10940 | translate | read | null |
| 2024-08-20 | Aligning Object Detector Bounding Boxes with Human Preference | Ombretta Strafforello et.al. | 2408.10844 | translate | read | null |
| 2024-08-20 | LightMDETR: A Lightweight Approach for Low-Cost Open-Vocabulary Object Detection Training | Binta Sow et.al. | 2408.10787 | translate | read | null |
| 2024-08-20 | Just a Hint: Point-Supervised Camouflaged Object Detection | Huafeng Chen et.al. | 2408.10777 | translate | read | null |
| 2024-08-21 | Generative AI in Industrial Machine Vision – A Review | Hans Aoyang Zhou et.al. | 2408.10775 | translate | read | null |
| 2024-08-20 | Detection of Intracranial Hemorrhage for Trauma Patients | Antoine P. Sanner et.al. | 2408.10768 | translate | read | null |
| 2024-08-20 | SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection | Huafeng Chen et.al. | 2408.10760 | translate | read | null |
| 2024-08-20 | Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception | Jiaru Zhong et.al. | 2408.10531 | translate | read | null |
| 2024-08-19 | Leveraging Superfluous Information in Contrastive Representation Learning | Xuechu Yu et.al. | 2408.10292 | translate | read | null |
| 2024-08-19 | SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition | Wiktor Mucha et.al. | 2408.10037 | translate | read | null |
| 2024-08-19 | Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Jun Yan et.al. | 2408.09839 | translate | read | link |
| 2024-08-19 | Latent Diffusion for Guided Document Table Generation | Syed Jawwad Haider Hamdani et.al. | 2408.09800 | translate | read | null |
| 2024-08-18 | Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection | Kaiwen Wang et.al. | 2408.09431 | translate | read | null |
| 2024-08-18 | Boundary-Recovering Network for Temporal Action Detection | Jihwan Kim et.al. | 2408.09354 | translate | read | null |
| 2024-08-18 | YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems | Chien-Yao Wang et.al. | 2408.09332 | translate | read | null |
| 2024-08-17 | GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Shuo Wang et.al. | 2408.09191 | translate | read | null |
| 2024-08-17 | PADetBench: Towards Benchmarking Physical Attacks against Object Detection | Jiawei Lian et.al. | 2408.09181 | translate | read | link |
| 2024-08-17 | MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation | Xiao Zhao et.al. | 2408.09122 | translate | read | null |
| 2024-08-17 | Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community | Jiancheng Pan et.al. | 2408.09110 | translate | read | null |
| 2024-08-16 | SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation | Xinyu Xiong et.al. | 2408.08870 | translate | read | link |
| 2024-08-16 | Multimodal Relational Triple Extraction with Query-based Entity Object Transformer | Lei Hei et.al. | 2408.08709 | translate | read | null |
| 2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | translate | read | null |
| 2024-08-15 | 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks | Dongshuo Yin et.al. | 2408.08345 | translate | read | link |
| 2024-08-15 | Learned Multimodal Compression for Autonomous Driving | Hadi Hadizadeh et.al. | 2408.08211 | translate | read | null |
| 2024-08-16 | OC3D: Weakly Supervised Outdoor 3D Object Detection with Only Coarse Click Annotation | Qiming Xia et.al. | 2408.08092 | translate | read | null |
| 2024-08-15 | CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection | Xunfa Lai et.al. | 2408.08050 | translate | read | null |
| 2024-08-15 | Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement | Wenxuan Li et.al. | 2408.07999 | translate | read | null |
| 2024-08-15 | GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Yutong Wang et.al. | 2408.07917 | translate | read | link |
| 2024-08-14 | See It All: Contextualized Late Aggregation for 3D Dense Captioning | Minjung Kim et.al. | 2408.07648 | translate | read | null |
| 2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | translate | read | null |
| 2024-08-14 | Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection | Zhonglin Chen et.al. | 2408.07455 | translate | read | null |
| 2024-08-14 | Sign language recognition based on deep learning and low-cost handcrafted descriptors | Alvaro Leandro Cavalcante Carneiro et.al. | 2408.07244 | translate | read | link |
| 2024-08-13 | Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces | Zhiling Chen et.al. | 2408.07146 | translate | read | null |
| 2024-08-13 | Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries | Qi Song et.al. | 2408.06901 | translate | read | null |
| 2024-08-13 | Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection | Matthias Bartolo et.al. | 2408.06803 | translate | read | link |
| 2024-08-13 | Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions | Miao Zhang et.al. | 2408.06772 | translate | read | null |
| 2024-08-13 | Unified-IoU: For High-Quality Object Detection | Xiangjie Luo et.al. | 2408.06636 | translate | read | link |
| 2024-08-13 | A lightweight YOLOv5-FFM model for occlusion pedestrian detection | Xiangjie Luo et.al. | 2408.06633 | translate | read | null |
| 2024-08-13 | MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers | Zichao Dong et.al. | 2408.06604 | translate | read | null |
| 2024-08-12 | Latent Disentanglement for Low Light Image Enhancement | Zhihao Zheng et.al. | 2408.06245 | translate | read | null |
| 2024-08-12 | MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception | Sven Teufel et.al. | 2408.06137 | translate | read | link |
| 2024-08-12 | DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection | Junjie Guo et.al. | 2408.06123 | translate | read | null |
| 2024-08-12 | Optimizing Vision Transformers with Data-Free Knowledge Transfer | Gousia Habib et.al. | 2408.05952 | translate | read | null |
| 2024-08-12 | MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection | Zitian Wang et.al. | 2408.05945 | translate | read | null |
| 2024-08-12 | Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes | Ke Zhou et.al. | 2408.05936 | translate | read | null |
| 2024-08-12 | Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts | Peng Wu et.al. | 2408.05905 | translate | read | null |
| 2024-08-12 | Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network | Kailai Sun et.al. | 2408.05877 | translate | read | null |
| 2024-08-11 | U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training | Zhuoyan Liu et.al. | 2408.05780 | translate | read | link |
| 2024-08-11 | FADE: A Dataset for Detecting Falling Objects around Buildings in Video | Zhigang Tu et.al. | 2408.05750 | translate | read | null |
| 2024-08-09 | DeepInteraction++: Multi-Modality Interaction for Autonomous Driving | Zeyu Yang et.al. | 2408.05075 | translate | read | link |
| 2024-08-09 | RadarPillars: Efficient Object Detection from 4D Radar Point Clouds | Alexander Musiat et.al. | 2408.05020 | translate | read | null |
| 2024-08-09 | Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation | Yifan Feng et.al. | 2408.04804 | translate | read | link |
| 2024-08-08 | SOD-YOLOv8 – Enhancing YOLOv8 for Small Object Detection in Traffic Scenes | Boshra Khalili et.al. | 2408.04786 | translate | read | null |
| 2024-08-08 | Data-Driven Pixel Control: Challenges and Prospects | Saurabh Farkya et.al. | 2408.04767 | translate | read | null |
| 2024-08-10 | SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More | Tianrun Chen et.al. | 2408.04579 | translate | read | null |
| 2024-08-07 | Impact Analysis of Data Drift Towards The Development of Safety-Critical Automotive System | Md Shahi Amran Hossain et.al. | 2408.04476 | translate | read | null |
| 2024-08-08 | Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework | Subhasis Dasgupta et.al. | 2408.04360 | translate | read | null |
| 2024-08-08 | Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection | Shixuan Gao et.al. | 2408.04326 | translate | read | null |
| 2024-08-08 | LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection | Mervat Abassy et.al. | 2408.04284 | translate | read | null |
| 2024-08-08 | Learning to Rewrite: Generalized LLM-Generated Text Detection | Wei Hao et.al. | 2408.04237 | translate | read | null |
| 2024-08-07 | PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation | Blessing Agyei Kyem et.al. | 2408.04110 | translate | read | link |
| 2024-08-07 | Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Christian Fruhwirth-Reisinger et.al. | 2408.03790 | translate | read | null |
| 2024-08-07 | Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model | Guoqing Zhu et.al. | 2408.03748 | translate | read | link |
| 2024-08-07 | CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Tianfang Zhang et.al. | 2408.03703 | translate | read | link |
| 2024-08-07 | L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection | Xun Huang et.al. | 2408.03677 | translate | read | null |
| 2024-08-07 | Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks | Jaewook Lee et.al. | 2408.03663 | translate | read | null |
| 2024-08-07 | Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving | Amirhosein Chahe et.al. | 2408.03516 | translate | read | null |
| 2024-08-07 | GUI Element Detection Using SOTA YOLO Deep Learning Models | Seyed Shayan Daneshvar et.al. | 2408.03507 | translate | read | null |
| 2024-08-06 | AI Foundation Models in Remote Sensing: A Survey | Siqi Lu et.al. | 2408.03464 | translate | read | null |
| 2024-08-06 | Biomedical Image Segmentation: A Systematic Literature Review of Deep Learning Based Object Detection Methods | Fazli Wahid et.al. | 2408.03393 | translate | read | null |
| 2024-08-06 | Nighttime Pedestrian Detection Based on Fore-Background Contrast Learning | He Yao et.al. | 2408.03030 | translate | read | null |
| 2024-08-06 | Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection | Sen Nie et.al. | 2408.02891 | translate | read | null |
| 2024-08-05 | HQOD: Harmonious Quantization for Object Detection | Long Huang et.al. | 2408.02561 | translate | read | null |
| 2024-08-05 | Tensorial template matching for fast cross-correlation with rotations and its application for tomography | Antonio Martinez-Sanchez et.al. | 2408.02398 | translate | read | null |
| 2024-08-05 | Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization | Changtao Miao et.al. | 2408.02306 | translate | read | null |
| 2024-08-05 | AssemAI: Interpretable Image-Based Anomaly Detection for Manufacturing Pipelines | Renjith Prasad et.al. | 2408.02181 | translate | read | null |
| 2024-08-04 | KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Zhihao Lai et.al. | 2408.02088 | translate | read | null |
| 2024-08-06 | A Survey and Evaluation of Adversarial Attacks for Object Detection | Khoi Nguyen Tiet Nguyen et.al. | 2408.01934 | translate | read | null |
| 2024-08-04 | CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery | Zilin Chen et.al. | 2408.01897 | translate | read | null |
| 2024-08-03 | Supervised Image Translation from Visible to Infrared Domain for Object Detection | Prahlad Anand et.al. | 2408.01843 | translate | read | null |
| 2024-08-03 | Domain penalisation for improved Out-of-Distribution Generalisation | Shuvam Jena et.al. | 2408.01746 | translate | read | null |
| 2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | translate | read | null |
| 2024-08-02 | A Robotics-Inspired Scanpath Model Reveals the Importance of Uncertainty and Semantic Object Cues for Gaze Guidance in Dynamic Scenes | Vito Mengers et.al. | 2408.01322 | translate | read | null |
| 2024-08-02 | Underwater Object Detection Enhancement via Channel Stabilization | Muhammad Ali et.al. | 2408.01293 | translate | read | null |
| 2024-08-02 | PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network | Changqun Xia et.al. | 2408.01137 | translate | read | null |
| 2024-08-02 | Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions | Ajinkya Shinde et.al. | 2408.01085 | translate | read | null |
| 2024-08-02 | Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model | Yang Jin et.al. | 2408.01044 | translate | read | null |
| 2024-08-02 | MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection | Xiangbo Gao et.al. | 2408.01037 | translate | read | null |
| 2024-08-02 | Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Yabin Zhu et.al. | 2408.00969 | translate | read | null |
| 2024-08-01 | Joint Neural Networks for One-shot Object Recognition and Detection | Camilo J. Vargas et.al. | 2408.00701 | translate | read | null |
| 2024-08-01 | Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection | Ruiyang Zhang et.al. | 2408.00619 | translate | read | null |
| 2024-08-01 | U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight | Tongtong Feng et.al. | 2408.00606 | translate | read | null |
| 2024-08-01 | MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection | Xiangyuan Peng et.al. | 2408.00565 | translate | read | null |
| 2024-08-01 | Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval | Gangyan Zeng et.al. | 2408.00441 | translate | read | null |
| 2024-08-01 | MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection | Youjia Fu et.al. | 2408.00438 | translate | read | null |
| 2024-08-01 | DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training | Yu Xie et.al. | 2408.00355 | translate | read | null |
| 2024-08-01 | A Simple Background Augmentation Method for Object Detection with Diffusion Model | Yuhang Li et.al. | 2408.00350 | translate | read | null |
| 2024-08-01 | Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection | Jiacheng Deng et.al. | 2408.00286 | translate | read | null |
| 2024-08-01 | RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment | Zhe Huang et.al. | 2408.00257 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)