Object Detection - 2024-06
Object Detection - 2024-06
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-06-28 | Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood | Yang Xu et.al. | 2406.19874 | translate | read | link |
| 2024-06-28 | Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking | Qingrui Hu et.al. | 2406.19655 | translate | read | null |
| 2024-06-27 | Robustness Testing of Black-Box Models Against CT Degradation Through Test-Time Augmentation | Jack Highton et.al. | 2406.19557 | translate | read | null |
| 2024-06-27 | BOrg: A Brain Organoid-Based Mitosis Dataset for Automatic Analysis of Brain Diseases | Muhammad Awais et.al. | 2406.19556 | translate | read | link |
| 2024-06-27 | Weighted Circle Fusion: Ensembling Circle Representation from Different Object Detection Results | Jialin Yue et.al. | 2406.19540 | translate | read | null |
| 2024-06-27 | Stereo Vision Based Robot for Remote Monitoring with VR Support | Mohamed Fazil M. S. et.al. | 2406.19498 | translate | read | null |
| 2024-06-27 | HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection | Liujuan Cao et.al. | 2406.19394 | translate | read | link |
| 2024-06-27 | STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning | Yanan Zhang et.al. | 2406.19362 | translate | read | null |
| 2024-06-27 | Towards Reducing Data Acquisition and Labeling for Defect Detection using Simulated Data | Lukas Malte Kemeter et.al. | 2406.19175 | translate | read | null |
| 2024-06-27 | FDLite: A Single Stage Lightweight Face Detector Network | Yogesh Aggarwal et.al. | 2406.19107 | translate | read | null |
| 2024-06-27 | Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO | Fuseini Mumuni et.al. | 2406.19057 | translate | read | null |
| 2024-06-27 | BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection | Yang Song et.al. | 2406.19048 | translate | read | null |
| 2024-06-27 | A Universal Railway Obstacle Detection System based on Semi-supervised Segmentation And Optical Flow | Qiushi Guo et.al. | 2406.18908 | translate | read | null |
| 2024-06-26 | SpY: A Context-Based Approach to Spacecraft Component Detection | Trupti Mahendrakar et.al. | 2406.18709 | translate | read | null |
| 2024-06-26 | Unveiling the Unknown: Conditional Evidence Decoupling for Unknown Rejection | Zhaowei Wu et.al. | 2406.18443 | translate | read | link |
| 2024-06-26 | Detecting Machine-Generated Texts: Not Just “AI vs Humans” and Explainability is Complicated | Jiazhou Ji et.al. | 2406.18259 | translate | read | null |
| 2024-06-26 | CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection | Meiying Zhang et.al. | 2406.18129 | translate | read | null |
| 2024-06-26 | The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval | Meinardus Boris et.al. | 2406.18113 | translate | read | link |
| 2024-06-25 | Unmasking the Imposters: In-Domain Detection of Human vs. Machine-Generated Tweets | Bryan E. Tuck et.al. | 2406.17967 | translate | read | null |
| 2024-06-25 | ET tu, CLIP? Addressing Common Object Errors for Unseen Environments | Ye Won Byun et.al. | 2406.17876 | translate | read | null |
| 2024-06-25 | MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection | Michelle Adeline et.al. | 2406.17654 | translate | read | link |
| 2024-06-25 | Embedded event based object detection with spiking neural network | Jonathan Courtois et.al. | 2406.17617 | translate | read | null |
| 2024-06-27 | Towards Open-set Camera 3D Object Detection | Zhuolin He et.al. | 2406.17297 | translate | read | null |
| 2024-06-25 | Exploring Test-Time Adaptation for Object Detection in Continually Changing Environments | Shilei Cao et.al. | 2406.16439 | translate | read | null |
| 2024-06-24 | Artistic-style text detector and a new Movie-Poster dataset | Aoxiang Ning et.al. | 2406.16307 | translate | read | null |
| 2024-06-24 | Investigating the Influence of Prompt-Specific Shortcuts in AI Generated Text Detection | Choonghyun Park et.al. | 2406.16275 | translate | read | null |
| 2024-06-23 | Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain | Maged Badawi et.al. | 2406.16143 | translate | read | null |
| 2024-06-22 | Understanding Student and Academic Staff Perceptions of AI Use in Assessment and Feedback | Jasper Roe et.al. | 2406.15808 | translate | read | null |
| 2024-06-22 | Smart Feature is What You Need | Zhaoxin Hu et.al. | 2406.15805 | translate | read | link |
| 2024-06-22 | MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception | Guanqun Wang et.al. | 2406.15768 | translate | read | null |
| 2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | translate | read | null |
| 2024-06-21 | DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection | Jia Syuen Lim et.al. | 2406.14924 | translate | read | null |
| 2024-06-21 | MOS: Model Synergy for Test-Time Adaptation on LiDAR-Based 3D Object Detection | Zhuoxiao Chen et.al. | 2406.14878 | translate | read | null |
| 2024-06-20 | Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines | Xinyi Ying et.al. | 2406.14482 | translate | read | link |
| 2024-06-20 | Enhanced Bank Check Security: Introducing a Novel Dataset and Transformer-Based Approach for Detection and Verification | Muhammad Saif Ullah Khan et.al. | 2406.14370 | translate | read | link |
| 2024-06-20 | HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting? | Ivan Karpukhin et.al. | 2406.14341 | translate | read | link |
| 2024-06-20 | LeYOLO, New Scalable and Efficient CNN Architecture for Object Detection | Lilian Hollard et.al. | 2406.14239 | translate | read | link |
| 2024-06-20 | SSAD: Self-supervised Auxiliary Detection Framework for Panoramic X-ray based Dental Disease Diagnosis | Zijian Cai et.al. | 2406.13963 | translate | read | link |
| 2024-06-20 | Towards the in-situ Trunk Identification and Length Measurement of Sea Cucumbers via Bézier Curve Modelling | Shuaixin Liu et.al. | 2406.13951 | translate | read | link |
| 2024-06-19 | DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection | Zhuoxiao Chen et.al. | 2406.13891 | translate | read | link |
| 2024-06-19 | Semantic Enhanced Few-shot Object Detection | Zheng Wang et.al. | 2406.13498 | translate | read | null |
| 2024-06-19 | Snowy Scenes,Clear Detections: A Robust Model for Traffic Light Detection in Adverse Weather Conditions | Shivank Garg et.al. | 2406.13473 | translate | read | link |
| 2024-06-19 | Strengthening Layer Interaction via Dynamic Layer Attention | Kaishen Wang et.al. | 2406.13392 | translate | read | link |
| 2024-06-18 | Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation | Nikolas Koutsoubis et.al. | 2406.12815 | translate | read | link |
| 2024-06-18 | Online Anchor-based Training for Image Classification Tasks | Maria Tzelepi et.al. | 2406.12662 | translate | read | null |
| 2024-06-18 | Applying Ensemble Methods to Model-Agnostic Machine-Generated Text Detection | Ivan Ong et.al. | 2406.12570 | translate | read | null |
| 2024-06-18 | MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts | Dominik Macko et.al. | 2406.12549 | translate | read | null |
| 2024-06-18 | ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection | Junhao Lin et.al. | 2406.12536 | translate | read | link |
| 2024-06-18 | SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions | Yuexiong Ding et.al. | 2406.12395 | translate | read | null |
| 2024-06-18 | Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines | Honglei Zhang et.al. | 2406.12367 | translate | read | null |
| 2024-06-18 | Certified ML Object Detection for Surveillance Missions | Mohammed Belcaid et.al. | 2406.12362 | translate | read | null |
| 2024-06-18 | DASSF: Dynamic-Attention Scale-Sequence Fusion for Aerial Object Detection | Haodong Li et.al. | 2406.12285 | translate | read | null |
| 2024-06-18 | The Solution for CVPR2024 Foundational Few-Shot Object Detection Challenge | Hongpeng Pan et.al. | 2406.12225 | translate | read | null |
| 2024-06-17 | V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results | Jiaqi Wang et.al. | 2406.11739 | translate | read | null |
| 2024-06-17 | YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection | Tamara R. Lenhard et.al. | 2406.11641 | translate | read | null |
| 2024-06-17 | Low-power Ship Detection in Satellite Images Using Neuromorphic Hardware | Gregor Lenz et.al. | 2406.11319 | translate | read | null |
| 2024-06-17 | Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection | Yecheol Kim et.al. | 2406.11313 | translate | read | link |
| 2024-06-17 | Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection | Yunsong Wang et.al. | 2406.11311 | translate | read | null |
| 2024-06-17 | Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding | Yunsong Wang et.al. | 2406.11283 | translate | read | null |
| 2024-06-17 | YOLO9tr: A Lightweight Model for Pavement Damage Detection Utilizing a Generalized Efficient Layer Aggregation Network and Attention Mechanism | Sompote Youwai et.al. | 2406.11254 | translate | read | link |
| 2024-06-16 | GANmut: Generating and Modifying Facial Expressions | Maria Surani et.al. | 2406.11079 | translate | read | null |
| 2024-06-16 | Exploring the Limitations of Detecting Machine-Generated Text | Jad Doughman et.al. | 2406.11073 | translate | read | null |
| 2024-06-16 | Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP | Shuyang Lin et.al. | 2406.10961 | translate | read | null |
| 2024-06-14 | EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models | Julian Straub et.al. | 2406.10224 | translate | read | link |
| 2024-06-14 | YOLOv1 to YOLOv10: A comprehensive review of YOLO variants and their application in the agricultural domain | Mujadded Al Rabbani Alif et.al. | 2406.10139 | translate | read | null |
| 2024-06-14 | Shelf-Supervised Multi-Modal Pre-Training for 3D Object Detection | Mehar Khurana et.al. | 2406.10115 | translate | read | null |
| 2024-06-14 | Automated GIS-Based Framework for Detecting Crosswalk Changes from Bi-Temporal High-Resolution Aerial Images | Richard Boadu Antwi et.al. | 2406.09731 | translate | read | null |
| 2024-06-14 | An alternate approach for estimating grain-growth kinetics | Manoj Prabakar et.al. | 2406.09653 | translate | read | null |
| 2024-06-13 | Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach | Yansheng Li et.al. | 2406.09410 | translate | read | link |
| 2024-06-13 | Towards Evaluating the Robustness of Visual State Space Models | Hashmat Shadab Malik et.al. | 2406.09407 | translate | read | link |
| 2024-06-13 | Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models | Yushi Hu et.al. | 2406.09403 | translate | read | null |
| 2024-06-13 | Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024 | Peixi Wu et.al. | 2406.09201 | translate | read | null |
| 2024-06-13 | Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors | Ying Zhou et.al. | 2406.08922 | translate | read | link |
| 2024-06-13 | Computer vision-based model for detecting turning lane features on Florida’s public roadways | Richard Boadu Antwi et.al. | 2406.08822 | translate | read | null |
| 2024-06-13 | BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection | Wenjie Wang et.al. | 2406.08785 | translate | read | null |
| 2024-06-12 | UnO: Unsupervised Occupancy Fields for Perception and Forecasting | Ben Agro et.al. | 2406.08691 | translate | read | null |
| 2024-06-12 | Transformation-Dependent Adversarial Attacks | Yaoteng Tan et.al. | 2406.08443 | translate | read | null |
| 2024-06-12 | Dataset Enhancement with Instance-Level Augmentations | Orest Kupyn et.al. | 2406.08249 | translate | read | link |
| 2024-06-12 | Chemistry3D: Robotic Interaction Benchmark for Chemistry Experiments | Shoujie Li et.al. | 2406.08160 | translate | read | null |
| 2024-06-12 | CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer | Hualian Sheng et.al. | 2406.08152 | translate | read | null |
| 2024-06-12 | MWIRSTD: A MWIR Small Target Detection Dataset | Nikhil Kumar et.al. | 2406.08063 | translate | read | link |
| 2024-06-12 | Sense Less, Generate More: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing | Sina Tayebati et.al. | 2406.07833 | translate | read | link |
| 2024-06-11 | A Deep Learning Approach to Detect Complete Safety Equipment For Construction Workers Based On YOLOv7 | Md. Shariful Islam et.al. | 2406.07707 | translate | read | null |
| 2024-06-11 | Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection | J. Schueler et.al. | 2406.07538 | translate | read | null |
| 2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506 | translate | read | link |
| 2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332 | translate | read | null |
| 2024-06-11 | Unsupervised Object Detection with Theoretical Guarantees | Marian Longa et.al. | 2406.07284 | translate | read | null |
| 2024-06-11 | Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jinyuan Li et.al. | 2406.07268 | translate | read | null |
| 2024-06-11 | EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Yining Shi et.al. | 2406.07042 | translate | read | link |
| 2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032 | translate | read | null |
| 2024-06-12 | LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jiahua Xu et.al. | 2406.07023 | translate | read | null |
| 2024-06-11 | Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection | Junfei Yi et.al. | 2406.06999 | translate | read | null |
| 2024-06-10 | UnSupDLA: Towards Unsupervised Document Layout Analysis | Talha Uddin Sheikh et.al. | 2406.06236 | translate | read | null |
| 2024-06-10 | UEMM-Air: A Synthetic Multi-modal Dataset for Unmanned Aerial Vehicle Object Detection | Fan Liu et.al. | 2406.06230 | translate | read | link |
| 2024-06-10 | ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery | Xian Sun et.al. | 2406.06028 | translate | read | null |
| 2024-06-10 | Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024 | Jinwoo Ahn et.al. | 2406.05963 | translate | read | null |
| 2024-06-10 | Open-Vocabulary Part-Based Grasping | Tjeard van Oort et.al. | 2406.05951 | translate | read | null |
| 2024-06-09 | Stealthy Targeted Backdoor Attacks against Image Captioning | Wenshu Fan et.al. | 2406.05874 | translate | read | null |
| 2024-06-09 | Scaling Graph Convolutions for Mobile Vision | William Avery et.al. | 2406.05850 | translate | read | link |
| 2024-06-09 | Mamba YOLO: SSMs-Based YOLO For Object Detection | Zeyu Wang et.al. | 2406.05835 | translate | read | link |
| 2024-06-09 | ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving | Chen Ma et.al. | 2406.05810 | translate | read | null |
| 2024-06-09 | SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention | Muhammad Nawfal Meeran et.al. | 2406.05802 | translate | read | link |
| 2024-06-07 | Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment | Venkanna Babu Guthula et.al. | 2406.04949 | translate | read | null |
| 2024-06-07 | EGOR: Efficient Generated Objects Replay for incremental object detection | Zijia An et.al. | 2406.04829 | translate | read | null |
| 2024-06-07 | UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping | Pengju Tian et.al. | 2406.04648 | translate | read | null |
| 2024-06-07 | UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection | Yuchao Wang et.al. | 2406.04647 | translate | read | null |
| 2024-06-06 | CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset | Abdelrahman Abdallah et.al. | 2406.04493 | translate | read | link |
| 2024-06-06 | DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas et.al. | 2406.04426 | translate | read | null |
| 2024-06-06 | Parameter-Inverted Image Pyramid Networks | Xizhou Zhu et.al. | 2406.04330 | translate | read | link |
| 2024-06-06 | LenslessFace: An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification | Xin Cai et.al. | 2406.04129 | translate | read | null |
| 2024-06-06 | Semmeldetector: Application of Machine Learning in Commercial Bakeries | Thomas H. Schmitt et.al. | 2406.04050 | translate | read | null |
| 2024-06-06 | Frequency-based Matcher for Long-tailed Semantic Segmentation | Shan Li et.al. | 2406.03917 | translate | read | link |
| 2024-06-06 | Instance Segmentation and Teeth Classification in Panoramic X-rays | Devichand Budagam et.al. | 2406.03747 | translate | read | link |
| 2024-06-05 | FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Cyprien Quéméneur et.al. | 2406.03611 | translate | read | link |
| 2024-06-05 | LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection | Qiang Chen et.al. | 2406.03459 | translate | read | link |
| 2024-06-05 | Global Clipper: Enhancing Safety and Reliability of Transformer-based Object Detection Models | Qutub Syed Sha et.al. | 2406.03229 | translate | read | null |
| 2024-06-05 | Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection | Qutub Syed et.al. | 2406.03188 | translate | read | null |
| 2024-06-05 | Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework | Eliraz Orfaig et.al. | 2406.03129 | translate | read | null |
| 2024-06-04 | Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Mohamed El Amine Boudjoghra et.al. | 2406.02548 | translate | read | link |
| 2024-06-04 | SatSplatYOLO: 3D Gaussian Splatting-based Virtual Object Detection Ensembles for Satellite Feature Recognition | Van Minh Nguyen et.al. | 2406.02533 | translate | read | null |
| 2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | translate | read | link |
| 2024-06-04 | Low-Rank Adaption on Transformer-based Oriented Object Detector for Satellite Onboard Processing of Remote Sensing Images | Xinyang Pu et.al. | 2406.02385 | translate | read | link |
| 2024-06-04 | Radar Spectra-Language Model for Automotive Scene Parsing | Mariia Pushkareva et.al. | 2406.02158 | translate | read | null |
| 2024-06-04 | Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning | Heather Doig et.al. | 2406.01932 | translate | read | null |
| 2024-06-04 | GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer | Ding Jia et.al. | 2406.01210 | translate | read | link |
| 2024-06-03 | Learning Adaptive Fusion Bank for Multi-modal Salient Object Detection | Kunpeng Wang et.al. | 2406.01127 | translate | read | link |
| 2024-06-03 | Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline | Jan Lippemeier et.al. | 2406.01071 | translate | read | null |
| 2024-06-03 | Multi-Object Tracking based on Imaging Radar 3D Object Detection | Patrick Palmer et.al. | 2406.01011 | translate | read | null |
(<a href=../Object_Detection.md>back to Object Detection</a>)