Object Detection - 2024-10

Publish Date Title Authors PDF Translate Read Code
2024-10-31 ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images Timing Yang et.al. 2410.24001 translate read link
2024-10-31 Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images Yakun Xie et.al. 2410.23991 translate read null
2024-10-31 Uncertainty Estimation for 3D Object Detection via Evidential Learning Nikita Durasov et.al. 2410.23910 translate read null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 translate read null
2024-10-31 Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem Louis Soum-Fontez et.al. 2410.23767 translate read null
2024-10-31 DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios Junchao Wu et.al. 2410.23746 translate read link
2024-10-31 GigaCheck: Detecting LLM-generated Content Irina Tolstykh et.al. 2410.23728 translate read null
2024-10-31 Context-Aware Token Selection and Packing for Enhanced Vision Transformer Tianyi Zhang et.al. 2410.23608 translate read null
2024-10-30 EMMA: End-to-End Multimodal Model for Autonomous Driving Jyh-Jing Hwang et.al. 2410.23262 translate read null
2024-10-30 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 translate read null
2024-10-30 First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024 Tengfei Zhang et.al. 2410.23077 translate read null
2024-10-30 AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection Yujin Wang et.al. 2410.22939 translate read null
2024-10-30 YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems Mujadded Al Rabbani Alif et.al. 2410.22898 translate read null
2024-10-29 Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection Gyusam Chang et.al. 2410.22461 translate read null
2024-10-29 Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels Ruigang Fu et.al. 2410.22139 translate read link
2024-10-29 Data Generation for Hardware-Friendly Post-Training Quantization Lior Dikstein et.al. 2410.22110 translate read null
2024-10-29 Cognitive Semantic Augmentation LEO Satellite Networks for Earth Observation Hong-fu Chou et.al. 2410.21916 translate read null
2024-10-29 PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices Ming Kang et.al. 2410.21822 translate read link
2024-10-28 MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps Yating Xu et.al. 2410.21566 translate read link
2024-10-28 TACO: Adversarial Camouflage Optimization on Trucks to Fool Object Detectors Adonisz Dimitriu et.al. 2410.21443 translate read null
2024-10-28 Joint Audio-Visual Idling Vehicle Detection with Streamlined Input Dependencies Xiwen Li et.al. 2410.21170 translate read null
2024-10-28 Synthetica: Large Scale Synthetic Data for Robot Perception Ritvik Singh et.al. 2410.21153 translate read null
2024-10-28 DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning Xun Guo et.al. 2410.20964 translate read link
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 translate read null
2024-10-28 SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity Kunyun Wang et.al. 2410.20790 translate read null
2024-10-27 Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network Chongxiao Liu et.al. 2410.20546 translate read null
2024-10-27 Guidance Disentanglement Network for Optics-Guided Thermal UAV Image Super-Resolution Zhicheng Zhao et.al. 2410.20466 translate read link
2024-10-27 Open-Vocabulary Object Detection via Language Hierarchy Jiaxing Huang et.al. 2410.20371 translate read null
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 translate read null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 translate read null
2024-10-25 MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services Hongjia Wu et.al. 2410.19665 translate read null
2024-10-25 Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models Shenghao Fu et.al. 2410.19635 translate read null
2024-10-25 MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors Fanqi Pu et.al. 2410.19590 translate read link
2024-10-25 DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems Muhammad Zaeem Shahzad et.al. 2410.19336 translate read null
2024-10-25 In-Simulation Testing of Deep Learning Vision Models in Autonomous Robotic Manipulators Dmytro Humeniuk et.al. 2410.19277 translate read null
2024-10-24 HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision Burak Ercan et.al. 2410.19164 translate read null
2024-10-24 Optimizing Edge Offloading Decisions for Object Detection Jiaming Qiu et.al. 2410.18919 translate read link
2024-10-24 You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection Mingbo Hong et.al. 2410.18398 translate read null
2024-10-24 Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images Dong-Guw Lee et.al. 2410.18340 translate read link
2024-10-23 KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark Vannkinh Nom et.al. 2410.18277 translate read null
2024-10-23 Automated Defect Detection and Grading of Piarom Dates Using Deep Learning Nasrin Azimi et.al. 2410.18208 translate read null
2024-10-23 DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection Qingpeng Li et.al. 2410.17822 translate read link
2024-10-23 YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions Xiguang Li et.al. 2410.17734 translate read null
2024-10-23 YOLOv11: An Overview of the Key Architectural Enhancements Rahima Khanam et.al. 2410.17725 translate read null
2024-10-23 PlantCamo: Plant Camouflage Detection Jinyu Yang et.al. 2410.17598 translate read link
2024-10-23 OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking Haiji Liang et.al. 2410.17534 translate read link
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 translate read null
2024-10-22 YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion Junzhou Chen et.al. 2410.17144 translate read null
2024-10-22 FlightAR: AR Flight Assistance Interface with Multiple Video Streams and Object Detection Aimed at Immersive Drone Control Oleg Sautenkov et.al. 2410.16943 translate read null
2024-10-22 AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models Yongjian Wu et.al. 2410.16820 translate read link
2024-10-22 DSORT-MCU: Detecting Small Objects in Real-Time on Microcontroller Units Liam Boyle et.al. 2410.16769 translate read null
2024-10-22 DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model Zhixiong Nan et.al. 2410.16707 translate read null
2024-10-22 Fire and Smoke Detection with Burning Intensity Representation Xiaoyi Han et.al. 2410.16642 translate read link
2024-10-21 Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Yufei Zhan et.al. 2410.16163 translate read link
2024-10-21 Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data Nikos Sakellariou et.al. 2410.16089 translate read null
2024-10-21 Few-shot target-driven instance detection based on open-vocabulary object detection models Ben Crulis et.al. 2410.16028 translate read null
2024-10-21 How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit? Maximilian Ulmer et.al. 2410.15766 translate read null
2024-10-21 P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving Mohamed R. Elshamy et.al. 2410.15602 translate read null
2024-10-21 Deep Learning and Machine Learning – Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 translate read null
2024-10-21 Online Pseudo-Label Unified Object Detection for Multiple Datasets Training XiaoJun Tang et.al. 2410.15569 translate read null
2024-10-20 TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool Thinh Phan et.al. 2410.15518 translate read null
2024-10-20 YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary Hao-Tang Tsui et.al. 2410.15346 translate read null
2024-10-20 Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Yusuke Hosoya et.al. 2410.15315 translate read null
2024-10-18 MultiOrg: A Multi-rater Organoid-detection Dataset Christina Bukas et.al. 2410.14612 translate read null
2024-10-18 Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement Zihao Cheng et.al. 2410.14259 translate read null
2024-10-18 Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech Shuwei He et.al. 2410.14101 translate read link
2024-10-18 Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines Kosuke Tatsumura et.al. 2410.14093 translate read null
2024-10-17 FaceSaliencyAug: Mitigating Geographic, Gender and Stereotypical Biases via Saliency-Based Data Augmentation Teerath Kumar et.al. 2410.14070 translate read null
2024-10-17 Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring Kristina Telegraph et.al. 2410.13616 translate read null
2024-10-17 RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images Kejun Ren et.al. 2410.13532 translate read null
2024-10-16 Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar Aayush Agrawal et.al. 2410.12953 translate read null
2024-10-16 MambaBEV: An efficient 3D detection model with Mamba2 Zihan You et.al. 2410.12673 translate read null
2024-10-16 On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs Herun Wan et.al. 2410.12600 translate read null
2024-10-16 Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion Minkyoung Cho et.al. 2410.12592 translate read null
2024-10-16 Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look Yong Zhang et.al. 2410.12396 translate read null
2024-10-16 Real-time Stereo-based 3D Object Detection for Streaming Perception Changcai Li et.al. 2410.12394 translate read link
2024-10-16 Context-Infused Visual Grounding for Art Selina Khan et.al. 2410.12369 translate read link
2024-10-16 Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond Pengwei Liang et.al. 2410.12274 translate read null
2024-10-16 Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm Guanming Huang et.al. 2410.12259 translate read null
2024-10-16 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 translate read null
2024-10-16 Unveiling the Limits of Alignment: Multi-modal Dynamic Local Fusion Network and A Benchmark for Unaligned RGBT Video Object Detection Qishun Wang et.al. 2410.12143 translate read null
2024-10-15 Fractal Calibration for long-tailed object detection Konstantinos Panagiotis Alexandridis et.al. 2410.11774 translate read link
2024-10-15 POLO – Point-based, multi-class animal detection Giacomo May et.al. 2410.11741 translate read null
2024-10-15 YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection Olalekan Akindele et.al. 2410.11727 translate read null
2024-10-15 SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection Shuhan Dong et.al. 2410.11358 translate read null
2024-10-15 Open World Object Detection: A Survey Yiming Li et.al. 2410.11301 translate read null
2024-10-15 Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training Bryan Bo Cao et.al. 2410.11233 translate read null
2024-10-15 TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement Zhiwei Lin et.al. 2410.11228 translate read null
2024-10-15 CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction Pranav Gupta et.al. 2410.11211 translate read link
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 translate read link
2024-10-14 UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles Hui Ye et.al. 2410.11125 translate read null
2024-10-14 ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection Martin Aubard et.al. 2410.10554 translate read link
2024-10-14 Learning to Ground VLMs without Forgetting Aritra Bhowmik et.al. 2410.10491 translate read null
2024-10-14 SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments Khaled Gabr et.al. 2410.10409 translate read null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 translate read link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 translate read link
2024-10-14 ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Jiwei Chen et.al. 2410.10298 translate read null
2024-10-14 Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors Tao Lin et.al. 2410.10091 translate read link
2024-10-15 Optimizing Waste Management with Advanced Object Detection for Garbage Classification Everest Z. Kuang et.al. 2410.09975 translate read null
2024-10-13 EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition Jingyu Liu et.al. 2410.09954 translate read null
2024-10-13 LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond Md Tanvir Islam et.al. 2410.09831 translate read link
2024-10-11 DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection Haochen Li et.al. 2410.09004 translate read null
2024-10-11 LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection Mingjia Li et.al. 2410.08810 translate read null
2024-10-11 Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Robert Turnbull et.al. 2410.08740 translate read null
2024-10-11 MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation Qihang Yang et.al. 2410.08739 translate read null
2024-10-11 Boosting Open-Vocabulary Object Detection by Handling Background Samples Ruizhe Zeng et.al. 2410.08645 translate read null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 translate read link
2024-10-11 VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking Zekun Qian et.al. 2410.08529 translate read null
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 translate read null
2024-10-10 PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection Botao Ren et.al. 2410.08210 translate read null
2024-10-10 Robust AI-Generated Text Detection by Restricted Embeddings Kristian Kuznetsov et.al. 2410.08113 translate read link
2024-10-10 Dynamic Object Catching with Quadruped Robot Front Legs André Schakkal et.al. 2410.08065 translate read null
2024-10-10 HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective Pei Liu et.al. 2410.07758 translate read null
2024-10-10 O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out Mısra Yavuz et.al. 2410.07514 translate read null
2024-10-09 Progressive Multi-Modal Fusion for Robust 3D Object Detection Rohit Mohan et.al. 2410.07475 translate read null
2024-10-09 Self-Supervised Learning for Real-World Object Detection: a Survey Alina Ciocarlan et.al. 2410.07442 translate read null
2024-10-09 Robust infrared small target detection using self-supervised and a contrario paradigms Alina Ciocarlan et.al. 2410.07437 translate read null
2024-10-09 SurANet: Surrounding-Aware Network for Concealed Object Detection via Highly-Efficient Interactive Contrastive Learning Strategy Yuhan Kang et.al. 2410.06842 translate read link
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 translate read link
2024-10-09 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 translate read null
2024-10-09 QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird’s-Eye-View Representation Yuxin Li et.al. 2410.06516 translate read null
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 translate read link
2024-10-08 Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach Sha Guo et.al. 2410.06149 translate read null
2024-10-08 Training-free LLM-generated Text Detection by Mining Token Probability Sequences Yihuai Xu et.al. 2410.06072 translate read link
2024-10-08 Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts Zhiwei Lin et.al. 2410.05963 translate read null
2024-10-08 Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga Takara Taniguchi et.al. 2410.05935 translate read null
2024-10-08 Unobserved Object Detection using Generative Models Subhransu S. Bhattacharjee et.al. 2410.05869 translate read link
2024-10-07 Real-Time Truly-Coupled Lidar-Inertial Motion Correction and Spatiotemporal Dynamic Object Detection Cedric Le Gentil et.al. 2410.05152 translate read null
2024-10-07 Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava Mehdi Azarafza et.al. 2410.05096 translate read null
2024-10-07 Improving Object Detection via Local-global Contrastive Learning Danai Triantafyllidou et.al. 2410.05058 translate read null
2024-10-07 Windshield Integration of Thermal and Color Fusion for Automatic Emergency Braking in Low Visibility Conditions Gabriel Jobert et.al. 2410.04928 translate read null
2024-10-07 Improved detection of discarded fish species through BoxAL active learning Maria Sokolova et.al. 2410.04880 translate read link
2024-10-06 Learning De-Biased Representations for Remote-Sensing Imagery Zichen Tian et.al. 2410.04546 translate read link
2024-10-05 AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text Ximing Lu et.al. 2410.04265 translate read link
2024-10-05 ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments Lorenzo Terenzi et.al. 2410.04250 translate read null
2024-10-05 Fast Object Detection with a Machine Learning Edge Device Richard C. Rodriguez et.al. 2410.04173 translate read null
2024-10-05 Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception Zhengru Fang et.al. 2410.04168 translate read link
2024-10-04 DRAFTS: A Deep Learning-Based Radio Fast Transient Search Pipeline Yong-Kun Zhang et.al. 2410.03200 translate read null
2024-10-03 Is Your Paper Being Reviewed by an LLM? Investigating AI Text Detectability in Peer Review Sungduk Yu et.al. 2410.03019 translate read null
2024-10-04 Learning 3D Perception from Others’ Predictions Jinsu Yoo et.al. 2410.02646 translate read null
2024-10-02 Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker Xinlong Hou et.al. 2410.01966 translate read null
2024-10-02 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection Yang Cao et.al. 2410.01647 translate read link
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 translate read null
2024-10-02 Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gaps Jiyun Jang et.al. 2410.01319 translate read null
2024-10-02 Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices Jeho Lee et.al. 2410.01270 translate read null
2024-10-02 High and Low Resolution Tradeoffs in Roadside Multimodal Sensing Shaozu Ding et.al. 2410.01250 translate read link
2024-10-02 Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions Ashutosh Kumar et.al. 2410.01225 translate read link
2024-10-02 A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particles Arda Genc et.al. 2410.01213 translate read link
2024-10-01 Synthetic imagery for fuzzy object detection: A comparative study Siavash H. Khajavi et.al. 2410.01124 translate read null
2024-10-01 Generating Seamless Virtual Immunohistochemical Whole Slide Images with Content and Color Consistency Sitong Liu et.al. 2410.01072 translate read null
2024-10-01 ARPOV: Expanding Visualization of Object Detection in AR with Panoramic Mosaic Stitching Erin McGowan et.al. 2410.01055 translate read null

(<a href=../Object_Detection.md>back to Object Detection</a>)