Object Detection - 2025-07
Object Detection - 2025-07
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-07-31 | 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection | Yung-Hsu Yang et.al. | 2507.23567 | translate | read | link |
| 2025-07-24 | Protecting Vulnerable Voices: Synthetic Dataset Generation for Self-Disclosure Detection | Shalini Jangra et.al. | 2507.22930 | translate | read | null |
| 2025-07-25 | Bias Analysis for Synthetic Face Detection: A Case Study of the Impact of Facial Attributes | Asmae Lamsaf et.al. | 2507.19705 | translate | read | null |
| 2025-07-25 | Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing | Haichuan Li et.al. | 2507.19691 | translate | read | null |
| 2025-07-25 | An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles | Matthias Weiß et.al. | 2507.19446 | translate | read | null |
| 2025-07-25 | EffiComm: Bandwidth Efficient Multi Agent Communication | Melih Yazgan et.al. | 2507.19354 | translate | read | null |
| 2025-07-25 | Multistream Network for LiDAR and Camera-based 3D Object Detection in Outdoor Scenes | Muhammad Ibrahim et.al. | 2507.19304 | translate | read | null |
| 2025-07-25 | Cross Spatial Temporal Fusion Attention for Remote Sensing Object Detection via Image Feature Matching | Abu Sadat Mohammad Salehin Amit et.al. | 2507.19118 | translate | read | null |
| 2025-07-25 | Revisiting DETR for Small Object Detection via Noise-Resilient Query Optimization | Xiaocheng Fang et.al. | 2507.19059 | translate | read | null |
| 2025-07-25 | YOLO for Knowledge Extraction from Vehicle Images: A Baseline Study | Saraa Al-Saddik et.al. | 2507.18966 | translate | read | null |
| 2025-07-25 | WiSE-OD: Benchmarking Robustness in Infrared Object Detection | Heitor R. Medeiros et.al. | 2507.18925 | translate | read | null |
| 2025-07-25 | Synthetic-to-Real Camouflaged Object Detection | Zhihao Luo et.al. | 2507.18911 | translate | read | null |
| 2025-07-24 | Towards Large Scale Geostatistical Methane Monitoring with Part-based Object Detection | Adhemar de Senneville et.al. | 2507.18513 | translate | read | null |
| 2025-07-24 | Human Scanpath Prediction in Target-Present Visual Search with Semantic-Foveal Bayesian Attention | João Luzio et.al. | 2507.18503 | translate | read | null |
| 2025-07-24 | A COCO-Formatted Instance-Level Dataset for Plasmodium Falciparum Detection in Giemsa-Stained Blood Smears | Frauke Wilm et.al. | 2507.18483 | translate | read | null |
| 2025-07-24 | Revisiting Physically Realizable Adversarial Object Attack against LiDAR-based Detection: Clarifying Problem Formulation and Experimental Protocols | Luo Cheng et.al. | 2507.18457 | translate | read | null |
| 2025-07-24 | Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction | Runmin Zhang et.al. | 2507.18331 | translate | read | link |
| 2025-07-24 | LMM-Det: Make Large Multimodal Models Excel in Object Detection | Jincheng Li et.al. | 2507.18300 | translate | read | link |
| 2025-07-24 | Evaluation of facial landmark localization performance in a surgical setting | Ines Frajtag et.al. | 2507.18248 | translate | read | null |
| 2025-07-24 | Real-Time Object Detection and Classification using YOLO for Edge FPGAs | Rashed Al Amin et.al. | 2507.18174 | translate | read | null |
| 2025-07-24 | WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection | Haodong Zhu et.al. | 2507.18173 | translate | read | null |
| 2025-07-24 | OpenNav: Open-World Navigation with Multimodal Large Language Models | Mingfeng Yuan et.al. | 2507.18033 | translate | read | null |
| 2025-07-23 | Bearded Dragon Activity Recognition Pipeline: An AI-Based Approach to Behavioural Monitoring | Arsen Yermukan et.al. | 2507.17987 | translate | read | null |
| 2025-07-23 | FishDet-M: A Unified Large-Scale Benchmark for Robust Fish Detection and CLIP-Guided Model Selection in Diverse Aquatic Visual Domains | Muayad Abujabal et.al. | 2507.17859 | translate | read | null |
| 2025-07-23 | Perspective-Invariant 3D Object Detection | Ao Liang et.al. | 2507.17665 | translate | read | null |
| 2025-07-23 | Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning | Xinyao Liu et.al. | 2507.17539 | translate | read | link |
| 2025-07-23 | Illicit object detection in X-ray imaging using deep learning techniques: A comparative evaluation | Jorgen Cani et.al. | 2507.17508 | translate | read | link |
| 2025-07-23 | Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection | Yehao Lu et.al. | 2507.17436 | translate | read | null |
| 2025-07-23 | SFUOD: Source-Free Unknown Object Detection | Keon-Hee Park et.al. | 2507.17373 | translate | read | null |
| 2025-07-23 | Optimizing Delivery Logistics: Enhancing Speed and Safety with Drone Technology | Maharshi Shastri et.al. | 2507.17253 | translate | read | null |
| 2025-07-23 | A Low-Cost Machine Learning Approach for Timber Diameter Estimation | Fatemeh Hasanzadeh Fard et.al. | 2507.17219 | translate | read | null |
| 2025-07-22 | Few-Shot Learning in Video and 3D Object Detection: A Survey | Md Meftahul Ferdaus et.al. | 2507.17079 | translate | read | null |
| 2025-07-22 | Transformer Based Building Boundary Reconstruction using Attraction Field Maps | Muhammad Kamran et.al. | 2507.17038 | translate | read | null |
| 2025-07-22 | Divisive Decisions: Improving Salience-Based Training for Generalization in Binary Classification Tasks | Jacob Piland et.al. | 2507.17000 | translate | read | null |
| 2025-07-22 | Task-Specific Zero-shot Quantization-Aware Training for Object Detection | Changhao Li et.al. | 2507.16782 | translate | read | null |
| 2025-07-22 | Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation | Viktor Muryn et.al. | 2507.16704 | translate | read | null |
| 2025-07-22 | QRetinex-Net: Quaternion-Valued Retinex Decomposition for Low-Level Computer Vision Applications | Sos Agaian et.al. | 2507.16683 | translate | read | null |
| 2025-07-22 | Benchmarking pig detection and tracking under diverse and challenging conditions | Jonathan Henrich et.al. | 2507.16639 | translate | read | null |
| 2025-07-22 | A2Mamba: Attention-augmented State Space Models for Visual Recognition | Meng Lou et.al. | 2507.16624 | translate | read | null |
| 2025-07-22 | PlantSAM: An Object Detection-Driven Segmentation Pipeline for Herbarium Specimens | Youcef Sklab et.al. | 2507.16506 | translate | read | null |
| 2025-07-22 | Towards Railway Domain Adaptation for LiDAR-based 3D Detection: Road-to-Rail and Sim-to-Real via SynDRA-BBox | Xavier Diaz et.al. | 2507.16413 | translate | read | null |
| 2025-07-22 | Scene Text Detection and Recognition “in light of” Challenging Environmental Conditions using Aria Glasses Egocentric Vision Cameras | Joseph De Mathia et.al. | 2507.16330 | translate | read | null |
| 2025-07-22 | MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks | Junhao Su et.al. | 2507.16279 | translate | read | null |
| 2025-07-22 | Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective | Seunghyeon Kim et.al. | 2507.16254 | translate | read | null |
| 2025-07-21 | Experimenting active and sequential learning in a medieval music manuscript | Sachin Sharma et.al. | 2507.15633 | translate | read | null |
| 2025-07-21 | Few-Shot Object Detection via Spatial-Channel State Space Model | Zhimeng Xin et.al. | 2507.15308 | translate | read | null |
| 2025-07-21 | Beyond Easy Wins: A Text Hardness-Aware Benchmark for LLM-generated Text Detection | Navid Ayoobi et.al. | 2507.15286 | translate | read | null |
| 2025-07-20 | Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection | Aayush Atul Verma et.al. | 2507.15150 | translate | read | null |
| 2025-07-20 | BleedOrigin: Dynamic Bleeding Source Localization in Endoscopic Submucosal Dissection via Dual-Stage Detection and Tracking | Mengya Xu et.al. | 2507.15094 | translate | read | null |
| 2025-07-20 | InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis | Jiale Liu et.al. | 2507.14899 | translate | read | null |
| 2025-07-20 | An Uncertainty-aware DETR Enhancement Framework for Object Detection | Xingshu Chen et.al. | 2507.14855 | translate | read | null |
| 2025-07-20 | Seeing Through Deepfakes: A Human-Inspired Framework for Multi-Face Detection | Juan Hu et.al. | 2507.14807 | translate | read | null |
| 2025-07-19 | GCC-Spam: Spam Detection via GAN, Contrastive Learning, and Character Similarity Networks | Zixin Xu et.al. | 2507.14679 | translate | read | null |
| 2025-07-19 | Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection | Jifeng Shen et.al. | 2507.14643 | translate | read | null |
| 2025-07-18 | C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs | Yung-Hong Sun et.al. | 2507.14095 | translate | read | null |
| 2025-07-18 | Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection | Yujian Mo et.al. | 2507.13899 | translate | read | null |
| 2025-07-18 | Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation | Masahiro Ogawa et.al. | 2507.13628 | translate | read | null |
| 2025-07-17 | NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS (C/2025 N1) | Colin Orion Chandler et.al. | 2507.13409 | translate | read | null |
| 2025-07-17 | A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains | Antonio Finocchiaro et.al. | 2507.13326 | translate | read | null |
| 2025-07-17 | RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images | Xiaozheng Jiang et.al. | 2507.13120 | translate | read | null |
| 2025-07-17 | Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection | Riku Inoue et.al. | 2507.13085 | translate | read | null |
| 2025-07-17 | Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis | Saswat Priyadarshi Nayak et.al. | 2507.13073 | translate | read | null |
| 2025-07-17 | SOD-YOLO: Enhancing YOLO-Based Detection of Small Objects in UAV Imagery | Peijun Wang et.al. | 2507.12727 | translate | read | null |
| 2025-07-16 | Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios | Van-Hoang-Anh Phan et.al. | 2507.12449 | translate | read | null |
| 2025-07-16 | InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization | Haoyuan Liu et.al. | 2507.12420 | translate | read | null |
| 2025-07-16 | AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models | Santosh Vasa et.al. | 2507.12414 | translate | read | null |
| 2025-07-16 | OD-VIRAT: A Large-Scale Benchmark for Object Detection in Realistic Surveillance Environments | Hayat Ullah et.al. | 2507.12396 | translate | read | null |
| 2025-07-16 | Improving Lightweight Weed Detection via Knowledge Distillation | Ahmet Oğuz Saltık et.al. | 2507.12344 | translate | read | null |
| 2025-07-16 | SS-DC: Spatial-Spectral Decoupling and Coupling Across Visible-Infrared Gap for Domain Adaptive Object Detection | Xiwei Zhang et.al. | 2507.12017 | translate | read | null |
| 2025-07-16 | Frequency-Dynamic Attention Modulation for Dense Prediction | Linwei Chen et.al. | 2507.12006 | translate | read | null |
| 2025-07-15 | Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping | Yujie Zhang et.al. | 2507.11279 | translate | read | null |
| 2025-07-15 | Using Continual Learning for Real-Time Detection of Vulnerable Road Users in Complex Traffic Scenarios | Faryal Aurooj Nasir et.al. | 2507.11046 | translate | read | null |
| 2025-07-15 | Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery | Nicolas Drapier et.al. | 2507.11040 | translate | read | null |
| 2025-07-14 | A Lightweight and Robust Framework for Real-Time Colorectal Polyp Detection Using LOF-Based Preprocessing and YOLO-v11n | Saadat Behzadi et.al. | 2507.10864 | translate | read | null |
| 2025-07-14 | LLM-Guided Agentic Object Detection for Open-World Understanding | Furkan Mumcu et.al. | 2507.10844 | translate | read | null |
| 2025-07-14 | Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection | Huiyi Wang et.al. | 2507.10814 | translate | read | null |
| 2025-07-14 | Fine-Grained Zero-Shot Object Detection | Hongxu Ma et.al. | 2507.10358 | translate | read | null |
| 2025-07-14 | BlueGlass: A Framework for Composite AI Safety | Harshal Nandigramwar et.al. | 2507.10106 | translate | read | null |
| 2025-07-14 | SRG/ART-XC All-Sky X-ray Survey: Sensitivity Assessment Based on Aperture Photometry | N. Y. Tyrin et.al. | 2507.10060 | translate | read | null |
| 2025-07-14 | 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving | Yixun Zhang et.al. | 2507.09993 | translate | read | null |
| 2025-07-14 | Measuring the Impact of Rotation Equivariance on Aerial Object Detection | Xiuyu Wu et.al. | 2507.09896 | translate | read | null |
| 2025-07-14 | Secure and Efficient UAV-Based Face Detection via Homomorphic Encryption and Edge Computing | Nguyen Van Duc et.al. | 2507.09860 | translate | read | null |
| 2025-07-13 | MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression | Ofir Gordon et.al. | 2507.09616 | translate | read | null |
| 2025-07-12 | Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline | Shiyi Mu et.al. | 2507.09214 | translate | read | null |
| 2025-07-12 | On the Fragility of Multimodal Perception to Temporal Misalignment in Autonomous Driving | Md Hasan Shahriar et.al. | 2507.09095 | translate | read | null |
| 2025-07-11 | VISTA: A Visual Analytics Framework to Enhance Foundation Model-Generated Data Labels | Xiwei Xuan et.al. | 2507.09008 | translate | read | null |
| 2025-07-11 | RoundaboutHD: High-Resolution Real-World Urban Environment Benchmark for Multi-Camera Vehicle Tracking | Yuqiang Lin et.al. | 2507.08729 | translate | read | null |
| 2025-07-11 | DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images | Haoran Sun et.al. | 2507.08648 | translate | read | null |
| 2025-07-11 | OnlineBEV: Recurrent Temporal Fusion in Bird’s Eye View Representations for Multi-Camera 3D Perception | Junho Koh et.al. | 2507.08644 | translate | read | null |
| 2025-07-11 | Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset | Mathias Zinnen et.al. | 2507.08384 | translate | read | null |
| 2025-07-11 | Spectroscopic Observations of Four Candidates for Blue Large-Amplitude Pulsators. No BLAPs at High Galactic Latitudes | P. Pietrukowicz et.al. | 2507.08372 | translate | read | null |
| 2025-07-11 | Understanding Driving Risks using Large Language Models: Toward Elderly Driver Assessment | Yuki Yoshihara et.al. | 2507.08367 | translate | read | null |
| 2025-07-10 | An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision | Jareen Anjom et.al. | 2507.08165 | translate | read | null |
| 2025-07-10 | Rainbow Artifacts from Electromagnetic Signal Injection Attacks on Image Sensors | Youqian Zhang et.al. | 2507.07773 | translate | read | null |
| 2025-07-09 | Automated Video Segmentation Machine Learning Pipeline | Johannes Merz et.al. | 2507.07242 | translate | read | null |
| 2025-07-09 | Aerial Maritime Vessel Detection and Identification | Antonella Barisic Kulas et.al. | 2507.07153 | translate | read | null |
| 2025-07-09 | DenoiseCP-Net: Efficient Collective Perception in Adverse Weather via Joint LiDAR-Based 3D Object Detection and Denoising | Sven Teufel et.al. | 2507.06976 | translate | read | null |
| 2025-07-09 | A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level | Johanna Orsholm et.al. | 2507.06972 | translate | read | null |
| 2025-07-09 | Dataset and Benchmark for Enhancing Critical Retained Foreign Object Detection | Yuli Wang et.al. | 2507.06937 | translate | read | null |
| 2025-07-09 | Unlocking Thermal Aerial Imaging: Synthetic Enhancement of UAV Datasets | Antonella Barisic Kulas et.al. | 2507.06797 | translate | read | null |
| 2025-07-09 | LOVON: Legged Open-Vocabulary Object Navigator | Daojie Peng et.al. | 2507.06747 | translate | read | null |
| 2025-07-09 | EA: An Event Autoencoder for High-Speed Vision Sensing | Riadul Islam et.al. | 2507.06459 | translate | read | null |
| 2025-07-08 | Hierarchical Multi-Stage Transformer Architecture for Context-Aware Temporal Action Localization | Hayat Ullah et.al. | 2507.06411 | translate | read | null |
| 2025-07-08 | ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge | Daghash K. Alqahtani et.al. | 2507.06011 | translate | read | null |
| 2025-07-08 | R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding | Joonhyung Park et.al. | 2507.05673 | translate | read | null |
| 2025-07-07 | YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries | Aquino Joctum et.al. | 2507.05376 | translate | read | null |
| 2025-07-07 | From a Different Star: 3I/ATLAS in the context of the Ōtautahi-Oxford interstellar object population model | Matthew J. Hopkins et.al. | 2507.05318 | translate | read | null |
| 2025-07-07 | Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations | Xiang Xu et.al. | 2507.05260 | translate | read | null |
| 2025-07-07 | AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models | Chinnappa Guggilla et.al. | 2507.05157 | translate | read | null |
| 2025-07-07 | LERa: Replanning with Visual Feedback in Instruction Following | Svyatoslav Pchelintsev et.al. | 2507.05135 | translate | read | null |
| 2025-07-07 | Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking | Maria Damanaki et.al. | 2507.04762 | translate | read | null |
| 2025-07-07 | CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection | Hanzhi Zhong et.al. | 2507.04587 | translate | read | null |
| 2025-07-06 | MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection | Hanshi Wang et.al. | 2507.04369 | translate | read | null |
| 2025-07-06 | DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection | Paul Hill et.al. | 2507.04323 | translate | read | null |
| 2025-07-06 | ZERO: Multi-modal Prompt-based Visual Grounding | Sangbum Choi et.al. | 2507.04270 | translate | read | null |
| 2025-07-05 | Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge | Linshen Liu et.al. | 2507.04123 | translate | read | null |
| 2025-07-04 | Zero Memory Overhead Approach for Protecting Vision Transformer Parameters | Fereshteh Baradaran et.al. | 2507.03816 | translate | read | null |
| 2025-07-03 | Partial Weakly-Supervised Oriented Object Detection | Mingxin Liu et.al. | 2507.02751 | translate | read | null |
| 2025-07-03 | Automatic Labelling for Low-Light Pedestrian Detection | Dimitrios Bouzoulas et.al. | 2507.02513 | translate | read | null |
| 2025-07-03 | Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection | Weiwei Duan et.al. | 2507.02454 | translate | read | null |
| 2025-07-03 | A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion | Maryem Fadili et.al. | 2507.02430 | translate | read | null |
| 2025-07-03 | PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection | Seokyeong Lee et.al. | 2507.02393 | translate | read | null |
| 2025-07-03 | Two-Steps Neural Networks for an Automated Cerebrovascular Landmark Detection | Rafic Nader et.al. | 2507.02349 | translate | read | null |
| 2025-07-03 | Perception Activator: An intuitive and portable framework for brain cognitive exploration | Le Xu et.al. | 2507.02311 | translate | read | null |
| 2025-07-03 | Understanding Trade offs When Conditioning Synthetic Data | Brandon Trabucco et.al. | 2507.02217 | translate | read | null |
| 2025-07-02 | How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks | Rahul Ramachandran et.al. | 2507.01955 | translate | read | link |
| 2025-07-02 | Survivability of Backdoor Attacks on Unconstrained Face Recognition Systems | Quentin Le Roux et.al. | 2507.01607 | translate | read | null |
| 2025-07-02 | Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation | Andrei Jelea et.al. | 2507.01347 | translate | read | null |
| 2025-07-01 | Rapid Salient Object Detection with Difference Convolutional Neural Networks | Zhuo Su et.al. | 2507.01182 | translate | read | null |
| 2025-07-01 | Robust Component Detection for Flexible Manufacturing: A Deep Learning Approach to Tray-Free Object Recognition under Variable Lighting | Fatemeh Sadat Daneshmand et.al. | 2507.00852 | translate | read | null |
| 2025-07-01 | UAVD-Mamba: Deformable Token Fusion Vision Mamba for Multimodal UAV Detection | Wei Li et.al. | 2507.00849 | translate | read | null |
| 2025-07-01 | High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery | Hongxing Peng et.al. | 2507.00825 | translate | read | null |
| 2025-07-01 | Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation | Hao Xing et.al. | 2507.00752 | translate | read | null |
| 2025-07-01 | UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement | Xiao Zhang et.al. | 2507.00721 | translate | read | null |
| 2025-07-01 | Rectifying Magnitude Neglect in Linear Attention | Qihang Fan et.al. | 2507.00698 | translate | read | link |
(<a href=../Object_Detection.md>back to Object Detection</a>)