Object Detection - 2024-06 | Paper Arxiv Daily

Object Detection - 2024-06

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-06-28	Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood	Yang Xu et.al.	2406.19874	translate	read	link
2024-06-28	Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking	Qingrui Hu et.al.	2406.19655	translate	read	null
2024-06-27	Robustness Testing of Black-Box Models Against CT Degradation Through Test-Time Augmentation	Jack Highton et.al.	2406.19557	translate	read	null
2024-06-27	BOrg: A Brain Organoid-Based Mitosis Dataset for Automatic Analysis of Brain Diseases	Muhammad Awais et.al.	2406.19556	translate	read	link
2024-06-27	Weighted Circle Fusion: Ensembling Circle Representation from Different Object Detection Results	Jialin Yue et.al.	2406.19540	translate	read	null
2024-06-27	Stereo Vision Based Robot for Remote Monitoring with VR Support	Mohamed Fazil M. S. et.al.	2406.19498	translate	read	null
2024-06-27	HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection	Liujuan Cao et.al.	2406.19394	translate	read	link
2024-06-27	STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning	Yanan Zhang et.al.	2406.19362	translate	read	null
2024-06-27	Towards Reducing Data Acquisition and Labeling for Defect Detection using Simulated Data	Lukas Malte Kemeter et.al.	2406.19175	translate	read	null
2024-06-27	FDLite: A Single Stage Lightweight Face Detector Network	Yogesh Aggarwal et.al.	2406.19107	translate	read	null
2024-06-27	Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO	Fuseini Mumuni et.al.	2406.19057	translate	read	null
2024-06-27	BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection	Yang Song et.al.	2406.19048	translate	read	null
2024-06-27	A Universal Railway Obstacle Detection System based on Semi-supervised Segmentation And Optical Flow	Qiushi Guo et.al.	2406.18908	translate	read	null
2024-06-26	SpY: A Context-Based Approach to Spacecraft Component Detection	Trupti Mahendrakar et.al.	2406.18709	translate	read	null
2024-06-26	Unveiling the Unknown: Conditional Evidence Decoupling for Unknown Rejection	Zhaowei Wu et.al.	2406.18443	translate	read	link
2024-06-26	Detecting Machine-Generated Texts: Not Just “AI vs Humans” and Explainability is Complicated	Jiazhou Ji et.al.	2406.18259	translate	read	null
2024-06-26	CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection	Meiying Zhang et.al.	2406.18129	translate	read	null
2024-06-26	The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval	Meinardus Boris et.al.	2406.18113	translate	read	link
2024-06-25	Unmasking the Imposters: In-Domain Detection of Human vs. Machine-Generated Tweets	Bryan E. Tuck et.al.	2406.17967	translate	read	null
2024-06-25	ET tu, CLIP? Addressing Common Object Errors for Unseen Environments	Ye Won Byun et.al.	2406.17876	translate	read	null
2024-06-25	MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection	Michelle Adeline et.al.	2406.17654	translate	read	link
2024-06-25	Embedded event based object detection with spiking neural network	Jonathan Courtois et.al.	2406.17617	translate	read	null
2024-06-27	Towards Open-set Camera 3D Object Detection	Zhuolin He et.al.	2406.17297	translate	read	null
2024-06-25	Exploring Test-Time Adaptation for Object Detection in Continually Changing Environments	Shilei Cao et.al.	2406.16439	translate	read	null
2024-06-24	Artistic-style text detector and a new Movie-Poster dataset	Aoxiang Ning et.al.	2406.16307	translate	read	null
2024-06-24	Investigating the Influence of Prompt-Specific Shortcuts in AI Generated Text Detection	Choonghyun Park et.al.	2406.16275	translate	read	null
2024-06-23	Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain	Maged Badawi et.al.	2406.16143	translate	read	null
2024-06-22	Understanding Student and Academic Staff Perceptions of AI Use in Assessment and Feedback	Jasper Roe et.al.	2406.15808	translate	read	null
2024-06-22	Smart Feature is What You Need	Zhaoxin Hu et.al.	2406.15805	translate	read	link
2024-06-22	MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception	Guanqun Wang et.al.	2406.15768	translate	read	null
2024-06-21	Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection	Lynn Vonderhaar et.al.	2406.15268	translate	read	null
2024-06-21	DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection	Jia Syuen Lim et.al.	2406.14924	translate	read	null
2024-06-21	MOS: Model Synergy for Test-Time Adaptation on LiDAR-Based 3D Object Detection	Zhuoxiao Chen et.al.	2406.14878	translate	read	null
2024-06-20	Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines	Xinyi Ying et.al.	2406.14482	translate	read	link
2024-06-20	Enhanced Bank Check Security: Introducing a Novel Dataset and Transformer-Based Approach for Detection and Verification	Muhammad Saif Ullah Khan et.al.	2406.14370	translate	read	link
2024-06-20	HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting?	Ivan Karpukhin et.al.	2406.14341	translate	read	link
2024-06-20	LeYOLO, New Scalable and Efficient CNN Architecture for Object Detection	Lilian Hollard et.al.	2406.14239	translate	read	link
2024-06-20	SSAD: Self-supervised Auxiliary Detection Framework for Panoramic X-ray based Dental Disease Diagnosis	Zijian Cai et.al.	2406.13963	translate	read	link
2024-06-20	Towards the in-situ Trunk Identification and Length Measurement of Sea Cucumbers via Bézier Curve Modelling	Shuaixin Liu et.al.	2406.13951	translate	read	link
2024-06-19	DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection	Zhuoxiao Chen et.al.	2406.13891	translate	read	link
2024-06-19	Semantic Enhanced Few-shot Object Detection	Zheng Wang et.al.	2406.13498	translate	read	null
2024-06-19	Snowy Scenes,Clear Detections: A Robust Model for Traffic Light Detection in Adverse Weather Conditions	Shivank Garg et.al.	2406.13473	translate	read	link
2024-06-19	Strengthening Layer Interaction via Dynamic Layer Attention	Kaishen Wang et.al.	2406.13392	translate	read	link
2024-06-18	Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation	Nikolas Koutsoubis et.al.	2406.12815	translate	read	link
2024-06-18	Online Anchor-based Training for Image Classification Tasks	Maria Tzelepi et.al.	2406.12662	translate	read	null
2024-06-18	Applying Ensemble Methods to Model-Agnostic Machine-Generated Text Detection	Ivan Ong et.al.	2406.12570	translate	read	null
2024-06-18	MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts	Dominik Macko et.al.	2406.12549	translate	read	null
2024-06-18	ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection	Junhao Lin et.al.	2406.12536	translate	read	link
2024-06-18	SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions	Yuexiong Ding et.al.	2406.12395	translate	read	null
2024-06-18	Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines	Honglei Zhang et.al.	2406.12367	translate	read	null
2024-06-18	Certified ML Object Detection for Surveillance Missions	Mohammed Belcaid et.al.	2406.12362	translate	read	null
2024-06-18	DASSF: Dynamic-Attention Scale-Sequence Fusion for Aerial Object Detection	Haodong Li et.al.	2406.12285	translate	read	null
2024-06-18	The Solution for CVPR2024 Foundational Few-Shot Object Detection Challenge	Hongpeng Pan et.al.	2406.12225	translate	read	null
2024-06-17	V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results	Jiaqi Wang et.al.	2406.11739	translate	read	null
2024-06-17	YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection	Tamara R. Lenhard et.al.	2406.11641	translate	read	null
2024-06-17	Low-power Ship Detection in Satellite Images Using Neuromorphic Hardware	Gregor Lenz et.al.	2406.11319	translate	read	null
2024-06-17	Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection	Yecheol Kim et.al.	2406.11313	translate	read	link
2024-06-17	Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection	Yunsong Wang et.al.	2406.11311	translate	read	null
2024-06-17	Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding	Yunsong Wang et.al.	2406.11283	translate	read	null
2024-06-17	YOLO9tr: A Lightweight Model for Pavement Damage Detection Utilizing a Generalized Efficient Layer Aggregation Network and Attention Mechanism	Sompote Youwai et.al.	2406.11254	translate	read	link
2024-06-16	GANmut: Generating and Modifying Facial Expressions	Maria Surani et.al.	2406.11079	translate	read	null
2024-06-16	Exploring the Limitations of Detecting Machine-Generated Text	Jad Doughman et.al.	2406.11073	translate	read	null
2024-06-16	Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP	Shuyang Lin et.al.	2406.10961	translate	read	null
2024-06-14	EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models	Julian Straub et.al.	2406.10224	translate	read	link
2024-06-14	YOLOv1 to YOLOv10: A comprehensive review of YOLO variants and their application in the agricultural domain	Mujadded Al Rabbani Alif et.al.	2406.10139	translate	read	null
2024-06-14	Shelf-Supervised Multi-Modal Pre-Training for 3D Object Detection	Mehar Khurana et.al.	2406.10115	translate	read	null
2024-06-14	Automated GIS-Based Framework for Detecting Crosswalk Changes from Bi-Temporal High-Resolution Aerial Images	Richard Boadu Antwi et.al.	2406.09731	translate	read	null
2024-06-14	An alternate approach for estimating grain-growth kinetics	Manoj Prabakar et.al.	2406.09653	translate	read	null
2024-06-13	Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach	Yansheng Li et.al.	2406.09410	translate	read	link
2024-06-13	Towards Evaluating the Robustness of Visual State Space Models	Hashmat Shadab Malik et.al.	2406.09407	translate	read	link
2024-06-13	Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models	Yushi Hu et.al.	2406.09403	translate	read	null
2024-06-13	Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024	Peixi Wu et.al.	2406.09201	translate	read	null
2024-06-13	Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors	Ying Zhou et.al.	2406.08922	translate	read	link
2024-06-13	Computer vision-based model for detecting turning lane features on Florida’s public roadways	Richard Boadu Antwi et.al.	2406.08822	translate	read	null
2024-06-13	BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection	Wenjie Wang et.al.	2406.08785	translate	read	null
2024-06-12	UnO: Unsupervised Occupancy Fields for Perception and Forecasting	Ben Agro et.al.	2406.08691	translate	read	null
2024-06-12	Transformation-Dependent Adversarial Attacks	Yaoteng Tan et.al.	2406.08443	translate	read	null
2024-06-12	Dataset Enhancement with Instance-Level Augmentations	Orest Kupyn et.al.	2406.08249	translate	read	link
2024-06-12	Chemistry3D: Robotic Interaction Benchmark for Chemistry Experiments	Shoujie Li et.al.	2406.08160	translate	read	null
2024-06-12	CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer	Hualian Sheng et.al.	2406.08152	translate	read	null
2024-06-12	MWIRSTD: A MWIR Small Target Detection Dataset	Nikhil Kumar et.al.	2406.08063	translate	read	link
2024-06-12	Sense Less, Generate More: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing	Sina Tayebati et.al.	2406.07833	translate	read	link
2024-06-11	A Deep Learning Approach to Detect Complete Safety Equipment For Construction Workers Based On YOLOv7	Md. Shariful Islam et.al.	2406.07707	translate	read	null
2024-06-11	Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection	J. Schueler et.al.	2406.07538	translate	read	null
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506	translate	read	link
2024-06-11	Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach	Challapalli Phanindra Revanth et.al.	2406.07332	translate	read	null
2024-06-11	Unsupervised Object Detection with Theoretical Guarantees	Marian Longa et.al.	2406.07284	translate	read	null
2024-06-11	Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation	Jinyuan Li et.al.	2406.07268	translate	read	null
2024-06-11	EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network	Yining Shi et.al.	2406.07042	translate	read	link
2024-06-11	RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks	Zhechao Wang et.al.	2406.07032	translate	read	null
2024-06-12	LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection	Jiahua Xu et.al.	2406.07023	translate	read	null
2024-06-11	Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection	Junfei Yi et.al.	2406.06999	translate	read	null
2024-06-10	UnSupDLA: Towards Unsupervised Document Layout Analysis	Talha Uddin Sheikh et.al.	2406.06236	translate	read	null
2024-06-10	UEMM-Air: A Synthetic Multi-modal Dataset for Unmanned Aerial Vehicle Object Detection	Fan Liu et.al.	2406.06230	translate	read	link
2024-06-10	ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery	Xian Sun et.al.	2406.06028	translate	read	null
2024-06-10	Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024	Jinwoo Ahn et.al.	2406.05963	translate	read	null
2024-06-10	Open-Vocabulary Part-Based Grasping	Tjeard van Oort et.al.	2406.05951	translate	read	null
2024-06-09	Stealthy Targeted Backdoor Attacks against Image Captioning	Wenshu Fan et.al.	2406.05874	translate	read	null
2024-06-09	Scaling Graph Convolutions for Mobile Vision	William Avery et.al.	2406.05850	translate	read	link
2024-06-09	Mamba YOLO: SSMs-Based YOLO For Object Detection	Zeyu Wang et.al.	2406.05835	translate	read	link
2024-06-09	ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving	Chen Ma et.al.	2406.05810	translate	read	null
2024-06-09	SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention	Muhammad Nawfal Meeran et.al.	2406.05802	translate	read	link
2024-06-07	Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment	Venkanna Babu Guthula et.al.	2406.04949	translate	read	null
2024-06-07	EGOR: Efficient Generated Objects Replay for incremental object detection	Zijia An et.al.	2406.04829	translate	read	null
2024-06-07	UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping	Pengju Tian et.al.	2406.04648	translate	read	null
2024-06-07	UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection	Yuchao Wang et.al.	2406.04647	translate	read	null
2024-06-06	CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset	Abdelrahman Abdallah et.al.	2406.04493	translate	read	link
2024-06-06	DeTra: A Unified Model for Object Detection and Trajectory Forecasting	Sergio Casas et.al.	2406.04426	translate	read	null
2024-06-06	Parameter-Inverted Image Pyramid Networks	Xizhou Zhu et.al.	2406.04330	translate	read	link
2024-06-06	LenslessFace: An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification	Xin Cai et.al.	2406.04129	translate	read	null
2024-06-06	Semmeldetector: Application of Machine Learning in Commercial Bakeries	Thomas H. Schmitt et.al.	2406.04050	translate	read	null
2024-06-06	Frequency-based Matcher for Long-tailed Semantic Segmentation	Shan Li et.al.	2406.03917	translate	read	link
2024-06-06	Instance Segmentation and Teeth Classification in Panoramic X-rays	Devichand Budagam et.al.	2406.03747	translate	read	link
2024-06-05	FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles	Cyprien Quéméneur et.al.	2406.03611	translate	read	link
2024-06-05	LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection	Qiang Chen et.al.	2406.03459	translate	read	link
2024-06-05	Global Clipper: Enhancing Safety and Reliability of Transformer-based Object Detection Models	Qutub Syed Sha et.al.	2406.03229	translate	read	null
2024-06-05	Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection	Qutub Syed et.al.	2406.03188	translate	read	null
2024-06-05	Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework	Eliraz Orfaig et.al.	2406.03129	translate	read	null
2024-06-04	Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation	Mohamed El Amine Boudjoghra et.al.	2406.02548	translate	read	link
2024-06-04	SatSplatYOLO: 3D Gaussian Splatting-based Virtual Object Detection Ensembles for Satellite Feature Recognition	Van Minh Nguyen et.al.	2406.02533	translate	read	null
2024-06-04	GrootVL: Tree Topology is All You Need in State Space Model	Yicheng Xiao et.al.	2406.02395	translate	read	link
2024-06-04	Low-Rank Adaption on Transformer-based Oriented Object Detector for Satellite Onboard Processing of Remote Sensing Images	Xinyang Pu et.al.	2406.02385	translate	read	link
2024-06-04	Radar Spectra-Language Model for Automotive Scene Parsing	Mariia Pushkareva et.al.	2406.02158	translate	read	null
2024-06-04	Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning	Heather Doig et.al.	2406.01932	translate	read	null
2024-06-04	GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer	Ding Jia et.al.	2406.01210	translate	read	link
2024-06-03	Learning Adaptive Fusion Bank for Multi-modal Salient Object Detection	Kunpeng Wang et.al.	2406.01127	translate	read	link
2024-06-03	Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline	Jan Lippemeier et.al.	2406.01071	translate	read	null
2024-06-03	Multi-Object Tracking based on Imaging Radar 3D Object Detection	Patrick Palmer et.al.	2406.01011	translate	read	null

(<a href=../Object_Detection.md>back to Object Detection</a>)