Object Detection - 2024-09
Object Detection - 2024-09
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-09-30 | Accelerating Non-Maximum Suppression: A Graph Theory Perspective | King-Siong Si et.al. | 2409.20520 | translate | read | link |
| 2024-09-30 | NUTRIVISION: A System for Automatic Diet Management in Smart Healthcare | Madhumita Veeramreddy et.al. | 2409.20508 | translate | read | null |
| 2024-09-30 | Navigating Threats: A Survey of Physical Adversarial Attacks on LiDAR Perception Systems in Autonomous Vehicles | Amira Guesmi et.al. | 2409.20426 | translate | read | null |
| 2024-09-30 | Training a Computer Vision Model for Commercial Bakeries with Primarily Synthetic Images | Thomas H. Schmitt et.al. | 2409.20122 | translate | read | null |
| 2024-09-30 | GearTrack: Automating 6D Pose Estimation | Yu Deng et.al. | 2409.19986 | translate | read | null |
| 2024-09-30 | TSdetector: Temporal-Spatial Self-correction Collaborative Learning for Colonoscopy Video Detection | Kaini Wang et.al. | 2409.19983 | translate | read | null |
| 2024-09-30 | DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Zhen Yang et.al. | 2409.19972 | translate | read | link |
| 2024-09-30 | HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes | Changfeng Feng et.al. | 2409.19833 | translate | read | link |
| 2024-09-29 | Applying the Lower-Biased Teacher Model in Semi-Suepervised Object Detection | Shuang Wang et.al. | 2409.19703 | translate | read | null |
| 2024-09-29 | OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images | Jiaqi Zhao et.al. | 2409.19648 | translate | read | link |
| 2024-09-27 | Spectral Wavelet Dropout: Regularization in the Wavelet Domain | Rinor Cakaj et.al. | 2409.18951 | translate | read | null |
| 2024-09-27 | MCUBench: A Benchmark of Tiny Object Detectors on MCUs | Sudhakar Sah et.al. | 2409.18866 | translate | read | link |
| 2024-09-27 | A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation | Jer Pelhan et.al. | 2409.18686 | translate | read | link |
| 2024-09-27 | Query matching for spatio-temporal action detection with query-based object detector | Shimon Hori et.al. | 2409.18408 | translate | read | null |
| 2024-09-26 | Efficient Microscopic Image Instance Segmentation for Food Crystal Quality Control | Xiaoyu Ji et.al. | 2409.18291 | translate | read | null |
| 2024-09-26 | Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing | Huthaifa I. Ashqar et.al. | 2409.18286 | translate | read | null |
| 2024-09-26 | GSON: A Group-based Social Navigation Framework with Large Multimodal Model | Shangyi Luo et.al. | 2409.18084 | translate | read | null |
| 2024-09-27 | A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts | Aurel Pjetri et.al. | 2409.17851 | translate | read | null |
| 2024-09-26 | Scene Understanding in Pick-and-Place Tasks: Analyzing Transformations Between Initial and Final Scenes | Seraj Ghasemi et.al. | 2409.17720 | translate | read | null |
| 2024-09-26 | SLO-Aware Task Offloading within Collaborative Vehicle Platoons | Boris Sedlak et.al. | 2409.17667 | translate | read | null |
| 2024-09-26 | CAMOT: Camera Angle-aware Multi-Object Tracking | Felix Limanta et.al. | 2409.17533 | translate | read | null |
| 2024-09-25 | Transient Adversarial 3D Projection Attacks on Object Detection in Autonomous Driving | Ce Zhou et.al. | 2409.17403 | translate | read | null |
| 2024-09-25 | AgRegNet: A Deep Regression Network for Flower and Fruit Density Estimation, Localization, and Counting in Orchards | Uddhav Bhattarai et.al. | 2409.17400 | translate | read | null |
| 2024-09-25 | Energy-Efficient & Real-Time Computer Vision with Intelligent Skipping via Reconfigurable CMOS Image Sensors | Md Abdullah-Al Kaiser et.al. | 2409.17341 | translate | read | null |
| 2024-09-25 | BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices | Yongqi Xu et.al. | 2409.17093 | translate | read | link |
| 2024-09-25 | EventHDR: from Event to High-Speed HDR Videos and Beyond | Yunhao Zou et.al. | 2409.17029 | translate | read | null |
| 2024-09-25 | Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection | Xu Han et.al. | 2409.16827 | translate | read | null |
| 2024-09-25 | XAI-guided Insulator Anomaly Detection for Imbalanced Datasets | Maximilian Andreas Hoefler et.al. | 2409.16821 | translate | read | null |
| 2024-09-25 | Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera | Xu Han et.al. | 2409.16820 | translate | read | null |
| 2024-09-25 | Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices | Daghash K. Alqahtani et.al. | 2409.16808 | translate | read | null |
| 2024-09-25 | Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation | Youngwan Jin et.al. | 2409.16706 | translate | read | link |
| 2024-09-25 | TSBP: Improving Object Detection in Histology Images via Test-time Self-guided Bounding-box Propagation | Tingting Yang et.al. | 2409.16678 | translate | read | link |
| 2024-09-25 | Source-Free Domain Adaptation for YOLO Object Detection | Simon Varailhon et.al. | 2409.16538 | translate | read | null |
| 2024-09-24 | Real-Time Detection of Electronic Components in Waste Printed Circuit Boards: A Transformer-Based Approach | Muhammad Mohsin et.al. | 2409.16496 | translate | read | null |
| 2024-09-24 | Tiny Robotics Dataset and Benchmark for Continual Object Detection | Francesco Pasti et.al. | 2409.16215 | translate | read | link |
| 2024-09-24 | Seeing Faces in Things: A Model and Dataset for Pareidolia | Mark Hamilton et.al. | 2409.16143 | translate | read | null |
| 2024-09-24 | HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection | Yuqi Ma et.al. | 2409.16136 | translate | read | null |
| 2024-09-24 | Neuromorphic Drone Detection: an Event-RGB Multimodal Approach | Gabriele Magrini et.al. | 2409.16099 | translate | read | null |
| 2024-09-24 | Open-World Object Detection with Instance Representation Learning | Sunoh Lee et.al. | 2409.16073 | translate | read | null |
| 2024-09-24 | Towards Robust Object Detection: Identifying and Removing Backdoors via Module Inconsistency Analysis | Xianda Zhang et.al. | 2409.16057 | translate | read | null |
| 2024-09-24 | Zero-Shot Detection of AI-Generated Images | Davide Cozzolino et.al. | 2409.15875 | translate | read | null |
| 2024-09-24 | Automated Assessment of Multimodal Answer Sheets in the STEM domain | Rajlaxmi Patil et.al. | 2409.15749 | translate | read | null |
| 2024-09-24 | Real-Time Pedestrian Detection on IoT Edge Devices: A Lightweight Deep Learning Approach | Muhammad Dany Alfikri et.al. | 2409.15740 | translate | read | null |
| 2024-09-24 | PDT: Uav Target Detection Dataset for Pests and Diseases Tree | Mingle Zhou et.al. | 2409.15679 | translate | read | link |
| 2024-09-18 | Applications of Knowledge Distillation in Remote Sensing: A Survey | Yassine Himeur et.al. | 2409.12111 | translate | read | null |
| 2024-09-18 | Agglomerative Token Clustering | Joakim Bruslund Haurum et.al. | 2409.11923 | translate | read | link |
| 2024-09-18 | RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework | Xiaoyu Li et.al. | 2409.11749 | translate | read | null |
| 2024-09-17 | Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching | Kurran Singh et.al. | 2409.11555 | translate | read | null |
| 2024-09-17 | VALO: A Versatile Anytime Framework for LiDAR-based Object Detection Deep Neural Networks | Ahmet Soyyigit et.al. | 2409.11542 | translate | read | link |
| 2024-09-17 | STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking | Jianbo Ma et.al. | 2409.11234 | translate | read | link |
| 2024-09-19 | Vision foundation models: can they be applied to astrophysics data? | E. Lastufka et.al. | 2409.11175 | translate | read | null |
| 2024-09-17 | UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height | Zichen Yu et.al. | 2409.11160 | translate | read | null |
| 2024-09-17 | Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Rui Yu et.al. | 2409.11018 | translate | read | null |
| 2024-09-17 | TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection | Philip Jacobson et.al. | 2409.10901 | translate | read | null |
| 2024-09-18 | Context-Dependent Interactable Graphical User Interface Element Detection for Spatial Computing Applications | Shuqing Li et.al. | 2409.10811 | translate | read | null |
| 2024-09-16 | Online Learning via Memory: Retrieval-Augmented Detector Adaptation | Yanan Jian et.al. | 2409.10716 | translate | read | null |
| 2024-09-16 | CoMamba: Real-time Cooperative Perception Unlocked with State Space Models | Jinlong Li et.al. | 2409.10699 | translate | read | null |
| 2024-09-16 | Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation | Yifan Xu et.al. | 2409.10350 | translate | read | null |
| 2024-09-16 | Performance of Human Annotators in Object Detection and Segmentation of Remotely Sensed Data | Roni Blushtein-Livnon et.al. | 2409.10272 | translate | read | null |
| 2024-09-16 | Self-Updating Vehicle Monitoring Framework Employing Distributed Acoustic Sensing towards Real-World Settings | Xi Wang et.al. | 2409.10259 | translate | read | null |
| 2024-09-16 | DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion | Yuchen Guo et.al. | 2409.10080 | translate | read | null |
| 2024-09-16 | Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation | Meng Chen et.al. | 2409.10071 | translate | read | link |
| 2024-09-16 | LithoHoD: A Litho Simulator-Powered Framework for IC Layout Hotspot Detection | Hao-Chiang Shao et.al. | 2409.10021 | translate | read | null |
| 2024-09-16 | Comprehensive Study on Sentiment Analysis: From Rule-based to modern LLM based system | Shailja Gupta et.al. | 2409.09989 | translate | read | null |
| 2024-09-15 | Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings | Oriel Perl et.al. | 2409.09841 | translate | read | null |
| 2024-09-15 | Template-based Multi-Domain Face Recognition | Anirudh Nanduri et.al. | 2409.09832 | translate | read | null |
| 2024-09-15 | PersonaMark: Personalized LLM watermarking for model protection and user attribution | Yuehan Zhang et.al. | 2409.09739 | translate | read | null |
| 2024-09-13 | Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing | Minh-Duc Vu et.al. | 2409.08885 | translate | read | null |
| 2024-09-13 | Direct-CP: Directed Collaborative Perception for Connected and Autonomous Vehicles via Proactive Attention | Yihang Tao et.al. | 2409.08840 | translate | read | null |
| 2024-09-13 | RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision | Shuo Wang et.al. | 2409.08475 | translate | read | null |
| 2024-09-12 | X-ray Fluoroscopy Guided Localization and Steering of Medical Microrobots through Virtual Enhancement | Husnu Halid Alabay et.al. | 2409.08337 | translate | read | null |
| 2024-09-12 | What is YOLOv9: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector | Muhammad Yaseen et.al. | 2409.07813 | translate | read | null |
| 2024-09-11 | Object Depth and Size Estimation using Stereo-vision and Integration with SLAM | Layth Hamad et.al. | 2409.07623 | translate | read | null |
| 2024-09-11 | Zero-Shot Machine-Generated Text Detection Using Mixture of Large Language Models | Matthieu Dubois et.al. | 2409.07615 | translate | read | null |
| 2024-09-11 | ENACT: Entropy-based Clustering of Attention Input for Improving the Computational Performance of Object Detection Transformers | Giorgos Savathrakis et.al. | 2409.07541 | translate | read | link |
| 2024-09-11 | Watchlist Challenge: 3rd Open-set Face Detection and Identification | Furkan Kasım et.al. | 2409.07220 | translate | read | null |
| 2024-09-11 | SCLNet: A Scale-Robust Complementary Learning Network for Object Detection in UAV Images | Xuexue Li et.al. | 2409.07024 | translate | read | null |
| 2024-09-11 | ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics | Xiaomin Lin et.al. | 2409.07003 | translate | read | null |
| 2024-09-11 | Brain-Inspired Stepwise Patch Merging for Vision Transformers | Yonghao Yu et.al. | 2409.06963 | translate | read | null |
| 2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | translate | read | link |
| 2024-09-10 | Technical Report of Mobile Manipulator Robot for Industrial Environments | Erfan Amoozad Khalili et.al. | 2409.06693 | translate | read | null |
| 2024-09-10 | A comprehensive study on Blood Cancer detection and classification using Convolutional Neural Network | Md Taimur Ahad et.al. | 2409.06689 | translate | read | null |
| 2024-09-10 | When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking | Emirhan Bayar et.al. | 2409.06617 | translate | read | link |
| 2024-09-10 | Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception | Xiang Zhang et.al. | 2409.06584 | translate | read | null |
| 2024-09-10 | Semi-Supervised 3D Object Detection with Chanel Augmentation using Transformation Equivariance | Minju Kang et.al. | 2409.06583 | translate | read | null |
| 2024-09-10 | Knowledge Distillation via Query Selection for Detection Transformer | Yi Liu et.al. | 2409.06443 | translate | read | null |
| 2024-09-10 | An Attribute-Enriched Dataset and Auto-Annotated Pipeline for Open Detection | Pengfei Qi et.al. | 2409.06300 | translate | read | null |
| 2024-09-09 | Replay Consolidation with Label Propagation for Continual Object Detection | Riccardo De Monte et.al. | 2409.05650 | translate | read | null |
| 2024-09-09 | Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery | Fan Zhang et.al. | 2409.05624 | translate | read | null |
| 2024-09-09 | LEROjD: Lidar Extended Radar-Only Object Detection | Patrick Palmer et.al. | 2409.05564 | translate | read | link |
| 2024-09-09 | Proto-OOD: Enhancing OOD Object Detection with Prototype Feature Similarity | Junkun Chen et.al. | 2409.05466 | translate | read | null |
| 2024-09-09 | Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection | Huang-Yu Chen et.al. | 2409.05425 | translate | read | null |
| 2024-09-08 | A Low-Computational Video Synopsis Framework with a Standard Dataset | Ramtin Malekpour et.al. | 2409.05230 | translate | read | link |
| 2024-09-08 | Can OOD Object Detectors Learn from Foundation Models? | Jiahui Liu et.al. | 2409.05162 | translate | read | link |
| 2024-09-08 | WaterSeeker: Efficient Detection of Watermarked Segments in Large Documents | Leyi Pan et.al. | 2409.05112 | translate | read | null |
| 2024-09-08 | Visual Grounding with Multi-modal Conditional Adaptation | Ruilin Yao et.al. | 2409.04999 | translate | read | link |
| 2024-09-08 | Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception | Rongsong Li et.al. | 2409.04980 | translate | read | null |
| 2024-09-06 | Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences | Rui Yu et.al. | 2409.04390 | translate | read | null |
| 2024-09-06 | UniDet3D: Multi-dataset Indoor 3D Object Detection | Maksim Kolodiazhnyi et.al. | 2409.04234 | translate | read | link |
| 2024-09-06 | Feature Compression for Cloud-Edge Multimodal 3D Object Detection | Chongzhen Tian et.al. | 2409.04123 | translate | read | null |
| 2024-09-06 | D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection | Kentaro Hirahara et.al. | 2409.04060 | translate | read | null |
| 2024-09-06 | BFA-YOLO: Balanced multiscale object detection network for multi-view building facade attachments detection | Yangguang Chen et.al. | 2409.04025 | translate | read | null |
| 2024-09-05 | LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones | Moritz Nottebaum et.al. | 2409.03460 | translate | read | link |
| 2024-09-05 | Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications | Tong Bu et.al. | 2409.03368 | translate | read | null |
| 2024-09-05 | YOLO-PPA based Efficient Traffic Sign Detection for Cruise Control in Autonomous Driving | Jingyu Zhang et.al. | 2409.03320 | translate | read | null |
| 2024-09-05 | Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints | Keisuke Toida et.al. | 2409.03252 | translate | read | null |
| 2024-09-04 | Boundless: Generating Photorealistic Synthetic Data for Object Detection in Urban Streetscapes | Mehmet Kerem Turkcan et.al. | 2409.03022 | translate | read | link |
| 2024-09-04 | Real-Time Dynamic Scale-Aware Fusion Detection Network: Take Road Damage Detection as an example | Weichao Pan et.al. | 2409.02546 | translate | read | null |
| 2024-09-04 | TP-GMOT: Tracking Generic Multiple Object by Textual Prompt with Motion-Appearance Cost (MAC) SORT | Duy Le Dinh Anh et.al. | 2409.02490 | translate | read | link |
| 2024-09-04 | Rapid Automatic Multiple Moving Objects Detection Method Based on Feature Extraction from Images with Non-sidereal Tracking | Lei Wang et.al. | 2409.02405 | translate | read | null |
| 2024-09-04 | Pluralistic Salient Object Detection | Xuelu Feng et.al. | 2409.02368 | translate | read | null |
| 2024-09-03 | Site Selection for the Second Flyeye Telescope: A Simulation Study for Optimizing Near-Earth Object Discovery | D. Föhring et.al. | 2409.02329 | translate | read | null |
| 2024-09-03 | K-Origins: Better Colour Quantification for Neural Networks | Lewis Mason et.al. | 2409.02281 | translate | read | null |
| 2024-09-03 | Evaluation and Comparison of Visual Language Models for Transportation Engineering Problems | Sanjita Prajapati et.al. | 2409.02278 | translate | read | null |
| 2024-09-03 | A Modern Take on Visual Relationship Reasoning for Grasp Planning | Paolo Rabino et.al. | 2409.02035 | translate | read | null |
| 2024-09-03 | Latent Distillation for Continual Object Detection at the Edge | Francesco Pasti et.al. | 2409.01872 | translate | read | link |
| 2024-09-03 | Real-Time Indoor Object Detection based on hybrid CNN-Transformer Approach | Salah Eddine Laidoudi et.al. | 2409.01871 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)