Pose Estimation
Pose Estimation
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2025-12-18 | PoseMoE: Mixture-of-Experts Network for Monocular 3D Human Pose Estimation | Mengyuan Liu et.al. | 2512.16494 | null |
| 2025-12-18 | Avatar4D: Synthesizing Domain-Specific 4D Humans for Real-World Pose Estimation | Jerrin Bright et.al. | 2512.16199 | null |
| 2025-12-18 | LAPX: Lightweight Hourglass Network with Global Context | Haopeng Zhao et.al. | 2512.16089 | null |
| 2025-12-17 | Robust Multi-view Camera Calibration from Dense Matches | Johannes Hägerlind et.al. | 2512.15608 | null |
| 2025-12-17 | BLANKET: Anonymizing Faces in Infant Video Recordings | Ditmar Hadera et.al. | 2512.15542 | null |
| 2025-12-17 | Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting | Arthur Moreau et.al. | 2512.15508 | null |
| 2025-12-17 | RUMPL: Ray-Based Transformers for Universal Multi-View 2D to 3D Human Pose Lifting | Seyed Abolfazl Ghasemzadeh et.al. | 2512.15488 | null |
| 2025-12-17 | See It Before You Grab It: Deep Learning-based Action Anticipation in Basketball | Arnau Barrera Roy et.al. | 2512.15386 | null |
| 2025-12-17 | NAP3D: NeRF Assisted 3D-3D Pose Alignment for Autonomous Vehicles | Gaurav Bansal et.al. | 2512.15080 | null |
| 2025-12-16 | Isolated Sign Language Recognition with Segmentation and Pose Estimation | Daniel Perkins et.al. | 2512.14876 | null |
| 2025-12-16 | FastDDHPose: Towards Unified, Efficient, and Disentangled 3D Human Pose Estimation | Qingyuan Cai et.al. | 2512.14162 | null |
| 2025-12-15 | LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction | Tianye Ding et.al. | 2512.13680 | null |
| 2025-12-13 | Audio-Visual Camera Pose Estimation with Passive Scene Sounds and In-the-Wild Video | Daniel Adebi et.al. | 2512.12165 | null |
| 2025-12-10 | mmWEAVER: Environment-Specific mmWave Signal Synthesis from a Photo and Activity Description | Mahathir Monjur et.al. | 2512.11894 | null |
| 2025-12-12 | A Multi-Mode Structured Light 3D Imaging System with Multi-Source Information Fusion for Underwater Pipeline Detection | Qinghan Hu et.al. | 2512.11354 | null |
| 2025-12-11 | SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model | Yukai Shi et.al. | 2512.10957 | null |
| 2025-12-11 | E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training | Qitao Zhao et.al. | 2512.10950 | null |
| 2025-12-11 | PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning | Jianqi Chen et.al. | 2512.10840 | null |
| 2025-12-11 | Geo6DPose: Fast Zero-Shot 6D Object Pose Estimation via Geometry-Filtered Feature Matching | Javier Villena Toro et.al. | 2512.10674 | null |
| 2025-12-11 | Mr. Virgil: Learning Multi-robot Visual-range Relative Localization | Si Wang et.al. | 2512.10540 | null |
| 2025-12-11 | An M-Health Algorithmic Approach to Identify and Assess Physiotherapy Exercises in Real Time | Stylianos Kandylakis et.al. | 2512.10437 | null |
| 2025-12-11 | Point2Pose: A Generative Framework for 3D Human Pose Estimation with Multi-View Point Cloud Dataset | Hyunsoo Lee et.al. | 2512.10321 | null |
| 2025-12-11 | THE-Pose: Topological Prior with Hybrid Graph Fusion for Estimating Category-Level 6D Object Pose | Eunho Lee et.al. | 2512.10251 | null |
| 2025-12-10 | FastPose-ViT: A Vision Transformer for Real-Time Spacecraft Pose Estimation | Pierre Ancey et.al. | 2512.09792 | null |
| 2025-12-10 | Development and Testing for Perception Based Autonomous Landing of a Long-Range QuadPlane | Ashik E Rasul et.al. | 2512.09343 | null |
| 2025-12-09 | ConceptPose: Training-Free Zero-Shot Object Pose Estimation using Concept Vectors | Liming Kuang et.al. | 2512.09056 | null |
| 2025-12-09 | Selfi: Self Improving Reconstruction Engine via 3D Geometric Feature Alignment | Youming Deng et.al. | 2512.08930 | null |
| 2025-12-09 | SDT-6D: Fully Sparse Depth-Transformer for Staged End-to-End 6D Pose Estimation in Industrial Multi-View Bin Picking | Nico Leuze et.al. | 2512.08430 | null |
| 2025-12-09 | Zero-Splat TeleAssist: A Zero-Shot Pose Estimation Framework for Semantic Teleoperation | Srijan Dokania et.al. | 2512.08271 | null |
| 2025-12-08 | UltrasODM: A Dual Stream Optical Flow Mamba Network for 3D Freehand Ultrasound Reconstruction | Mayank Anand et.al. | 2512.07756 | null |
| 2025-12-08 | UnCageNet: Tracking and Pose Estimation of Caged Animal | Sayak Dutta et.al. | 2512.07712 | null |
| 2025-12-08 | VFM-VLM: Vision Foundation Model and Vision Language Model based Visual Comparison for 3D Pose Estimation | Md Selim Sarowar et.al. | 2512.07215 | null |
| 2025-12-08 | Object Pose Distribution Estimation for Determining Revolution and Reflection Uncertainty in Point Clouds | Frederik Hagelskjær et.al. | 2512.07211 | null |
| 2025-12-07 | Dynamic Visual SLAM using a General 3D Prior | Xingguang Zhong et.al. | 2512.06868 | null |
| 2025-12-07 | Physics Informed Human Posture Estimation Based on 3D Landmarks from Monocular RGB-Videos | Tobias Leuthold et.al. | 2512.06783 | null |
| 2025-12-06 | GNC-Pose: Geometry-Aware GNC-PnP for Accurate 6D Pose Estimation | Xiujin Liu et.al. | 2512.06565 | null |
| 2025-12-06 | Exploiting Spatiotemporal Properties for Efficient Event-Driven Human Pose Estimation | Haoxian Zhou et.al. | 2512.06306 | null |
| 2025-12-05 | GuideNav: User-Informed Development of a Vision-Only Robotic Navigation Assistant For Blind Travelers | Hochul Hwang et.al. | 2512.06147 | null |
| 2025-12-03 | Training-Free Robot Pose Estimation using Off-the-Shelf Foundational Models | Laurence Liang et.al. | 2512.06017 | null |
| 2025-12-05 | Deep Learning-Based Real-Time Sequential Facial Expression Analysis Using Geometric Features | Talha Enes Koksal et.al. | 2512.05669 | null |
| 2025-12-04 | Age-Inclusive 3D Human Mesh Recovery for Action-Preserving Data Anonymization | Georgios Chatzichristodoulou et.al. | 2512.05259 | null |
| 2025-12-04 | Equivariant symmetry-aware head pose estimation for fetal MRI | Ramya Muthukrishnan et.al. | 2512.04890 | null |
| 2025-12-04 | Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing | Maria-Paola Forte et.al. | 2512.04862 | null |
| 2025-12-03 | SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL | Siyi Chen et.al. | 2512.04069 | null |
| 2025-12-03 | MSG-Loc: Multi-Label Likelihood-based Semantic Graph Matching for Object-Level Global Localization | Gihyeon Lee et.al. | 2512.03522 | null |
| 2025-12-03 | AfroBeats Dance Movement Analysis Using Computer Vision: A Proof-of-Concept Framework Combining YOLO and Segment Anything Model | Kwaku Opoku-Ware et.al. | 2512.03509 | null |
| 2025-12-02 | DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling | Kairun Wen et.al. | 2512.03000 | null |
| 2025-12-02 | DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions | Yifan Zhou et.al. | 2512.02727 | null |
| 2025-12-01 | Is Image-based Object Pose Estimation Ready to Support Grasping? | Eric C. Joyce et.al. | 2512.01856 | null |
| 2025-11-29 | CC-FMO: Camera-Conditioned Zero-Shot Single Image to 3D Scene Generation with Foundation Model Orchestration | Boshi Tang et.al. | 2512.00493 | null |
| 2025-11-03 | Learning from Watching: Scalable Extraction of Manipulation Trajectories from Human Videos | X. Hu et.al. | 2512.00024 | null |
| 2025-11-28 | Zero-Shot Multi-Criteria Visual Quality Inspection for Semi-Controlled Industrial Environments via Real-Time 3D Digital Twin Simulation | Jose Moises Araya-Martinez et.al. | 2511.23214 | null |
| 2025-11-28 | DiskChunGS: Large-Scale 3D Gaussian SLAM Through Chunk-Based Memory Management | Casimir Feldmann et.al. | 2511.23030 | null |
| 2025-11-28 | Threat-Aware UAV Dodging of Human-Thrown Projectiles with an RGB-D Camera | Yuying Zhang et.al. | 2511.22847 | null |
| 2025-11-27 | Emergent Extreme-View Geometry in 3D Foundation Models | Yiwen Zhang et.al. | 2511.22686 | null |
| 2025-11-27 | UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data | Longkun Zou et.al. | 2511.22404 | null |
| 2025-11-27 | ColonAdapter: Geometry Estimation Through Foundation Model Adaptation for Colonoscopy | Zhiyi Jiang et.al. | 2511.22250 | null |
| 2025-11-26 | Seeing without Pixels: Perception from Camera Trajectories | Zihui Xue et.al. | 2511.21681 | null |
| 2025-11-26 | Uncertainty Quantification for Visual Object Pose Estimation | Lorenzo Shaikewitz et.al. | 2511.21666 | null |
| 2025-11-26 | Enhanced Landmark Detection Model in Pelvic Fluoroscopy using 2D/3D Registration Loss | Chou Mo et.al. | 2511.21575 | null |
| 2025-11-25 | Metric, inertially aligned monocular state estimation via kinetodynamic priors | Jiaxin Liu et.al. | 2511.20496 | null |
| 2025-11-25 | Dance Style Classification using Laban-Inspired and Frequency-Domain Motion Features | Ben Hamscher et.al. | 2511.20469 | null |
| 2025-11-25 | VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction | Yu Hu et.al. | 2511.19971 | null |
| 2025-11-24 | The Determinant Ratio Matrix Approach to Solving 3D Matching and 2D Orthographic Projection Alignment Tasks | Andrew J. Hanson et.al. | 2511.19511 | null |
| 2025-11-18 | PuzzlePoles: Cylindrical Fiducial Markers Based on the PuzzleBoard Pattern | Juri Zach et.al. | 2511.19448 | null |
| 2025-11-24 | Graph-based 3D Human Pose Estimation using WiFi Signals | Jichao Chen et.al. | 2511.19105 | null |
| 2025-11-24 | Analysis of Deep-Learning Methods in an ISO/TS 15066-Compliant Human-Robot Safety Framework | David Bricher et.al. | 2511.19094 | null |
| 2025-11-24 | LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space | Hai Wu et.al. | 2511.19057 | null |
| 2025-11-24 | Robust Long-term Test-Time Adaptation for 3D Human Pose Estimation through Motion Discretization | Yilin Wen et.al. | 2511.18851 | null |
| 2025-11-24 | CNN-Based Camera Pose Estimation and Localisation of Scan Images for Aircraft Visual Inspection | Xueyan Oh et.al. | 2511.18702 | null |
| 2025-11-23 | Expanding the Workspace of Electromagnetic Navigation Systems Using Dynamic Feedback for Single- and Multi-agent Control | Jasan Zughaibi et.al. | 2511.18486 | null |
| 2025-11-22 | Muskie: Multi-view Masked Image Modeling for 3D Vision Pre-training | Wenyu Li et.al. | 2511.18115 | null |
| 2025-11-21 | NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior | Dongbo Shi et.al. | 2511.17322 | null |
| 2025-11-21 | MuM: Multi-View Masked Image Modeling for 3D Vision | David Nordström et.al. | 2511.17309 | null |
| 2025-11-21 | BiFingerPose: Bimodal Finger Pose Estimation for Touch Devices | Xiongjun Guan et.al. | 2511.17306 | null |
| 2025-11-21 | RacketVision: A Multiple Racket Sports Benchmark for Unified Ball and Racket Analysis | Linfeng Dong et.al. | 2511.17045 | null |
| 2025-11-21 | MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots | Junseo Kim et.al. | 2511.16949 | null |
| 2025-11-20 | BOP-ASK: Object-Interaction Reasoning for Vision-Language Models | Vineet Bhat et.al. | 2511.16857 | null |
| 2025-11-20 | NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses | Jing Wen et.al. | 2511.16673 | null |
| 2025-11-20 | EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering | Pierrick Bournez et.al. | 2511.16542 | null |
| 2025-11-20 | Physics-Informed Machine Learning for Efficient Sim-to-Real Data Augmentation in Micro-Object Pose Estimation | Zongcai Tan et.al. | 2511.16494 | null |
| 2025-11-20 | End-to-End Motion Capture from Rigid Body Markers with Geodesic Loss | Hai Lan et.al. | 2511.16418 | null |
| 2025-11-19 | Box6D : Zero-shot Category-level 6D Pose Estimation of Warehouse Boxes | Yintao Ma et.al. | 2511.15884 | null |
| 2025-11-19 | WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion | Sajjad Pakdamansavoji et.al. | 2511.15874 | null |
| 2025-11-19 | Scriboora: Rethinking Human Pose Forecasting | Daniel Bermuth et.al. | 2511.15565 | null |
| 2025-11-18 | RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems | Jaro Meyer et.al. | 2511.14948 | null |
| 2025-11-18 | A Quantitative Method for Shoulder Presentation Evaluation in Biometric Identity Documents | Alfonso Pedro Ridao et.al. | 2511.14376 | null |
| 2025-11-18 | Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors | Jeryes Danial et.al. | 2511.14335 | null |
| 2025-11-18 | LSP-YOLO: A Lightweight Single-Stage Network for Sitting Posture Recognition on Embedded Devices | Nanjun Li et.al. | 2511.14322 | null |
| 2025-11-18 | iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion | Hao Wang et.al. | 2511.14149 | null |
| 2025-11-17 | GRLoc: Geometric Representation Regression for Visual Localization | Changyang Li et.al. | 2511.13864 | null |
| 2025-11-17 | RSPose: Ranking Based Losses for Human Pose Estimation | Muhammed Can Keles et.al. | 2511.13857 | null |
| 2025-11-17 | GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models | Yushuo Zheng et.al. | 2511.13259 | null |
| 2025-11-17 | GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry | Chiyun Noh et.al. | 2511.13216 | null |
| 2025-11-17 | End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer | Yonghui Yu et.al. | 2511.13208 | null |
| 2025-11-17 | CapeNext: Rethinking and Refining Dynamic Support Information for Category-Agnostic Pose Estimation | Yu Zhu et.al. | 2511.13102 | null |
| 2025-11-17 | PFAvatar: Pose-Fusion 3D Personalized Avatar Reconstruction from Real-World Outfit-of-the-Day Photos | Dianbing Xi et.al. | 2511.12935 | null |
| 2025-11-17 | CoordAR: One-Reference 6D Pose Estimation of Novel Objects via Autoregressive Coordinate Map Generation | Dexin Zuo et.al. | 2511.12919 | null |
| 2025-11-16 | OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding | Artem Moroz et.al. | 2511.12614 | null |
| 2025-11-16 | Visible Structure Retrieval for Lightweight Image-Based Relocalisation | Fereidoon Zangeneh et.al. | 2511.12503 | null |
| 2025-11-15 | Changes in Real Time: Online Scene Change Detection with Multi-View Fusion | Chamuditha Jayanga Galappaththige et.al. | 2511.12370 | null |
| 2025-11-15 | AURA: Development and Validation of an Augmented Unplanned Removal Alert System using Synthetic ICU Videos | Junhyuk Seo et.al. | 2511.12241 | null |
| 2025-11-15 | VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose Estimation | Jun Zhou et.al. | 2511.12030 | null |
| 2025-11-12 | Understanding the Representation of Older Adults in Motion Capture Locomotion Datasets | Yunkai Yu et.al. | 2511.11713 | null |
| 2025-11-14 | YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation | Pavel Rojtberg et.al. | 2511.11344 | null |
| 2025-11-14 | 6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data | Saptarshi Neil Sinha et.al. | 2511.11307 | null |
| 2025-11-13 | Depth Anything 3: Recovering the Visual Space from Any Views | Haotong Lin et.al. | 2511.10647 | null |
| 2025-11-13 | OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer | Haosong Peng et.al. | 2511.10560 | null |
| 2025-11-12 | STORM: Segment, Track, and Object Re-Localization from a Single Image | Yu Deng et.al. | 2511.09771 | null |
| 2025-11-12 | DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation | Jerrin Bright et.al. | 2511.09502 | null |
| 2025-11-12 | SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields | Sangheon Yang et.al. | 2511.09072 | null |
| 2025-11-12 | RadHARSimulator V2: Video to Doppler Generator | Weicheng Gao et.al. | 2511.09022 | null |
| 2025-11-12 | SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation | Hu Cui et.al. | 2511.08872 | null |
| 2025-11-11 | Adaptive graph Kolmogorov-Arnold network for 3D human pose estimation | Abu Taib Mohammed Shahjahan et.al. | 2511.08809 | null |
| 2025-11-11 | RAPTR: Radar-based 3D Pose Estimation using Transformer | Sorachi Kato et.al. | 2511.08387 | null |
| 2025-11-11 | SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering | Laura Bragagnolo et.al. | 2511.08294 | null |
| 2025-11-11 | An Image-Based Path Planning Algorithm Using a UAV Equipped with Stereo Vision | Selim Ahmet Iz et.al. | 2511.07928 | null |
| 2025-11-10 | LeCoT: revisiting network architecture for two-view correspondence pruning | Luanyuan Dai et.al. | 2511.07078 | null |
| 2025-11-10 | Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes | Meijun Guo et.al. | 2511.06765 | null |
| 2025-11-10 | Semi-distributed Cross-modal Air-Ground Relative Localization | Weining Lu et.al. | 2511.06749 | null |
| 2025-11-09 | VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes | Zhengyu Zou et.al. | 2511.06408 | null |
| 2025-11-07 | Pedicle Screw Pairing and Registration for Screw Pose Estimation from Dual C-arm Images Using CAD Models | Yehyun Suh et.al. | 2511.05702 | null |
| 2025-11-07 | Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments | Laura Alejandra Encinar Gonzalez et.al. | 2511.05404 | null |
| 2025-11-07 | No Pose Estimation? No Problem: Pose-Agnostic and Instance-Aware Test-Time Adaptation for Monocular Depth Estimation | Mingyu Sung et.al. | 2511.05055 | null |
| 2025-11-06 | Synchronous Observer Design for Landmark-Inertial SLAM with Almost-Global Convergence | Arkadeep Saha et.al. | 2511.04531 | null |
| 2025-11-06 | A Two-stage Adaptive Lifting PINN Framework for Solving Viscous Approximations to Hyperbolic Conservation Laws | Yameng Zhu et.al. | 2511.04490 | null |
| 2025-11-06 | Deep Dictionary-Free Method for Identifying Linear Model of Nonlinear System with Input Delay | Patrik Valábek et.al. | 2511.04451 | null |
| 2025-11-06 | MedSapiens: Taking a Pose to Rethink Medical Imaging Landmark Detection | Marawan Elbatel et.al. | 2511.04255 | null |
| 2025-11-06 | DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platforms | Shengyu Tang et.al. | 2511.04128 | null |
| 2025-11-06 | Simple 3D Pose Features Support Human and Machine Social Scene Understanding | Wenshuo Qin et.al. | 2511.03988 | null |
| 2025-11-05 | CORE - A Cell-Level Coarse-to-Fine Image Registration Engine for Multi-stain Image Alignment | Esha Sadia Nasir et.al. | 2511.03826 | null |
| 2025-11-05 | FusionDP: Foundation Model-Assisted Differentially Private Learning for Partially Sensitive Features | Linghui Zeng et.al. | 2511.03806 | null |
| 2025-10-30 | Electric Vehicle Charging Load Modeling: A Survey, Trends, Challenges and Opportunities | Xiachong Lin et.al. | 2511.03741 | null |
| 2025-10-21 | AI-Enhanced Wi-Fi Sensing Through Single Transceiver Pair | Yuxuan Liu et.al. | 2511.02845 | null |
| 2025-11-04 | Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks | Dmitrii Pozdeev et.al. | 2511.02830 | null |
| 2025-11-04 | Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization | Tao Liu et.al. | 2511.02489 | link |
| 2025-11-04 | A New Perspective on Precision and Recall for Generative Models | Benjamin Sykes et.al. | 2511.02414 | null |
| 2025-11-04 | Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization | Shaohan Li et.al. | 2511.02329 | null |
| 2025-11-04 | Are Euler angles a useful rotation parameterisation for pose estimation with Normalizing Flows? | Giorgos Sfikas et.al. | 2511.02277 | null |
| 2025-11-04 | A Joint Variational Framework for Multimodal X-ray Ptychography and Fluorescence Reconstruction | Eric Zou et.al. | 2511.02153 | null |
| 2025-11-04 | A new approach for the analysis of evolution partial differential equations on a finite interval | Türker Özsarı et.al. | 2511.02145 | null |
| 2025-11-03 | HGFreNet: Hop-hybrid GraphFomer for 3D Human Pose Estimation with Trajectory Consistency in Frequency Domain | Kai Zhai et.al. | 2511.01756 | null |
| 2025-11-03 | Clutter Suppression in Bistatic ISAC with Joint Angle and Doppler Estimation | M. Ertug Pihtili et.al. | 2511.01599 | null |
| 2025-11-03 | Defining Energy Indicators for Impact Identification on Aerospace Composites: A Physics-Informed Machine Learning Perspective | Natália Ribeiro Marinho et.al. | 2511.01592 | null |
| 2025-11-03 | SE(3)-PoseFlow: Estimating 6D Pose Distributions for Uncertainty-Aware Robotic Manipulation | Yufeng Jin et.al. | 2511.01501 | null |
| 2025-11-03 | Floor Plan-Guided Visual Navigation Incorporating Depth and Directional Cues | Wei Huang et.al. | 2511.01493 | null |
| 2025-11-03 | Tackling the Kidnapped Robot Problem via Sparse Feasible Hypothesis Sampling and Reliable Batched Multi-Stage Inference | Muhua Zhang et.al. | 2511.01219 | null |
| 2025-11-03 | LiDAR-VGGT: Cross-Modal Coarse-to-Fine Fusion for Globally Consistent and Metric-Scale Dense Mapping | Lijie Wang et.al. | 2511.01186 | null |
| 2025-11-03 | Web-Scale Collection of Video Data for 4D Animal Reconstruction | Brian Nlong Zhao et.al. | 2511.01169 | null |
| 2025-11-01 | Active learning-based variance reduction for Monte Carlo simulations: A feasibility study for the nanodosimetry around a gold nanoparticle | Leo Thomas et.al. | 2511.00563 | null |
| 2025-10-31 | Residual Balancing for Non-Linear Outcome Models in High Dimensions | Isaac Meza et.al. | 2511.00324 | null |
| 2025-10-31 | On the well-posedness of the intermediate nonlinear Schrödinger equation on the line | Andreia Chapouto et.al. | 2511.00302 | null |
| 2025-10-31 | VLM6D: VLM based 6Dof Pose Estimation based on RGB-D Images | Md Selim Sarowar et.al. | 2511.00120 | null |
| 2025-10-31 | FedAdamW: A Communication-Efficient Optimizer with Convergence and Generalization Guarantees for Federated Large Models | Junkang Liu et.al. | 2510.27486 | null |
| 2025-10-31 | Improved refined bilinear estimates and well-posedness for generalized KdV type equations on $\mathbb{R}$ | Luc Molinet et.al. | 2510.27461 | null |
| 2025-10-30 | Cooperative Integrated Estimation-Guidance for Simultaneous Interception of Moving Targets | Lohitvel Gopikannan et.al. | 2510.26948 | null |
| 2025-10-30 | Graph Guided Modulo Recovery of EEG Signals | Soujanya Hazra et.al. | 2510.26756 | null |
| 2025-10-30 | Orbital Optimization and Neural-Network-Assisted Configuration Interaction Calculations of Rydberg States | Gianluca Levi et.al. | 2510.26751 | null |
| 2025-10-30 | Tight Differentially Private PCA via Matrix Coherence | Tommaso d’Orsi et.al. | 2510.26679 | null |
| 2025-10-30 | Statistical Inference for Matching Decisions via Matrix Completion under Dependent Missingness | Congyuan Duan et.al. | 2510.26478 | null |
| 2025-10-30 | Transcending Sparse Measurement Limits: Operator-Learning-Driven Data Super-Resolution for Inverse Source Problem | Guanyu Pan et.al. | 2510.26227 | null |
| 2025-10-30 | Sketch2PoseNet: Efficient and Generalized Sketch to 3D Human Pose Prediction | Li Wang et.al. | 2510.26196 | null |
| 2025-10-30 | JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting | Yuxuan Li et.al. | 2510.26117 | null |
| 2025-10-29 | STITCH 2.0: Extending Augmented Suturing with EKF Needle Estimation and Thread Management | Kush Hari et.al. | 2510.25768 | null |
| 2025-10-29 | Inverse-free quantum state estimation with Heisenberg scaling | Kean Chen et.al. | 2510.25750 | null |
| 2025-10-29 | LieSolver: A PDE-constrained solver for IBVPs using Lie symmetries | René P. Klausen et.al. | 2510.25731 | null |
| 2025-10-29 | Seeing Clearly and Deeply: An RGBD Imaging Approach with a Bio-inspired Monocentric Design | Zongxi Yu et.al. | 2510.25314 | null |
| 2025-10-29 | Non-Invasive Calibration Of A Stewart Platform By Photogrammetry | Sourabh Karmakar et.al. | 2510.25072 | null |
| 2025-10-28 | A Black Box Variational Inference Scheme for Inverse Problems with Demanding Physics-Based Models | G. Robalo Rei et.al. | 2510.25038 | null |
| 2025-10-28 | Understanding Multi-View Transformers | Michal Stary et.al. | 2510.24907 | null |
| 2025-10-28 | Greedy Sampling Is Provably Efficient for RLHF | Di Wu et.al. | 2510.24700 | null |
| 2025-10-28 | GeVI-SLAM: Gravity-Enhanced Stereo Visua Inertial SLAM for Underwater Robots | Yuan Shen et.al. | 2510.24533 | null |
| 2025-10-28 | Contributions to Semialgebraic-Set-Based Stability Verification of Dynamical Systems with Neural-Network-Based Controllers | Alvaro Detailleur et.al. | 2510.24391 | null |
| 2025-10-28 | Global-State-Free Obstacle Avoidance for Quadrotor Control in Air-Ground Cooperation | Baozhe Zhang et.al. | 2510.24315 | null |
| 2025-10-26 | Policies over Poses: Reinforcement Learning based Distributed Pose-Graph Optimization for Multi-Robot SLAM | Sai Krishna Ghanta et.al. | 2510.22740 | null |
| 2025-10-26 | Cross-Species Transfer Learning in Agricultural AI: Evaluating ZebraPose Adaptation for Dairy Cattle Pose Estimation | Mackenzie Tapp et.al. | 2510.22618 | null |
| 2025-10-26 | DynaPose4D: High-Quality 4D Dynamic Content Generation via Pose Alignment Loss | Jing Yang et.al. | 2510.22473 | null |
| 2025-10-25 | Breaking the Static Assumption: A Dynamic-Aware LIO Framework Via Spatio-Temporal Normal Analysis | Chen Zhiqiang et.al. | 2510.22313 | null |
| 2025-10-18 | Multi-Agent Pose Uncertainty: A Differentiable Rendering Cramér-Rao Bound | Arun Muthukkumar et.al. | 2510.21785 | null |
| 2025-10-24 | Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging | Ying Xue et.al. | 2510.21654 | null |
| 2025-10-23 | BioDet: Boosting Industrial Object Detection with Image Preprocessing Strategies | Jiaqi Hu et.al. | 2510.21000 | null |
| 2025-10-23 | ROPES: Robotic Pose Estimation via Score-Based Causal Representation Learning | Pranamya Kulkarni et.al. | 2510.20884 | null |
| 2025-10-23 | Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists | Eduardo R. Corral-Soto et.al. | 2510.20158 | null |
| 2025-10-22 | AI Pose Analysis and Kinematic Profiling of Range-of-Motion Variations in Resistance Training | Adam Diamant et.al. | 2510.20012 | null |
| 2025-10-22 | PoseCrafter: Extreme Pose Estimation with Hybrid Video Synthesis | Qing Mao et.al. | 2510.19527 | null |
| 2025-10-22 | PRGCN: A Graph Memory Network for Cross-Sequence Pattern Reuse in 3D Human Pose Estimation | Zhuoyang Xie et.al. | 2510.19475 | null |
| 2025-10-21 | Kinematic Analysis and Integration of Vision Algorithms for a Mobile Manipulator Employed Inside a Self-Driving Laboratory | Shifa Sulaiman et.al. | 2510.19081 | null |
| 2025-10-21 | UniHPR: Unified Human Pose Representation via Singular Value Contrastive Learning | Zhongyu Jiang et.al. | 2510.19078 | null |
| 2025-10-21 | PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splatting | Changkun Liu et.al. | 2510.18714 | null |
| 2025-10-21 | RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation | Junwen Huang et.al. | 2510.18521 | null |
| 2025-10-20 | Adapting Stereo Vision From Objects To 3D Lunar Surface Reconstruction with the StereoLunar Dataset | Clementine Grethen et.al. | 2510.18172 | null |
| 2025-10-20 | Raindrop GS: A Benchmark for 3D Gaussian Splatting under Raindrop Conditions | Zhiqiang Teng et.al. | 2510.17719 | null |
| 2025-10-20 | PAGE-4D: Disentangled Pose and Geometry Estimation for 4D Perception | Kaichen Zhou et.al. | 2510.17568 | null |
| 2025-10-20 | KineDiff3D: Kinematic-Aware Diffusion for Category-Level Articulated Object Shape Reconstruction and Generation | WenBo Xu et.al. | 2510.17137 | null |
| 2025-10-19 | How Universal Are SAM2 Features? | Masoud Khairi Atani et.al. | 2510.17051 | null |
| 2025-10-19 | GS2POSE: Marry Gaussian Splatting to 6D Object Pose Estimation | Junbo Li et.al. | 2510.16777 | null |
| 2025-10-18 | SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation | Yeh Keng Hao et.al. | 2510.16396 | null |
| 2025-10-17 | Proactive Scene Decomposition and Reconstruction | Baicheng Li et.al. | 2510.16272 | null |
| 2025-10-17 | Valeo Near-Field: a novel dataset for pedestrian intent detection | Antonyo Musabini et.al. | 2510.15673 | null |
| 2025-10-17 | Freehand 3D Ultrasound Imaging: Sim-in-the-Loop Probe Pose Optimization via Visual Servoing | Yameng Zhang et.al. | 2510.15668 | null |
| 2025-10-17 | MRASfM: Multi-Camera Reconstruction and Aggregation through Structure-from-Motion in Driving Scenes | Lingfeng Xuan et.al. | 2510.15467 | null |
| 2025-10-17 | PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction | Ting-Yu Yen et.al. | 2510.15386 | null |
| 2025-10-17 | Proto-Former: Unified Facial Landmark Detection by Prototype Transformer | Shengkai Hu et.al. | 2510.15338 | null |
| 2025-10-17 | CuSfM: CUDA-Accelerated Structure-from-Motion | Jingrui Yu et.al. | 2510.15271 | null |
| 2025-10-17 | LVI-Q: Robust LiDAR-Visual-Inertial-Kinematic Odometry for Quadruped Robots Using Tightly-Coupled and Efficient Alternating Optimization | Kevin Christiansen Marsim et.al. | 2510.15220 | null |
| 2025-10-16 | C4D: 4D Made from 3D through Dual Correspondences | Shizun Wang et.al. | 2510.14960 | null |
| 2025-10-16 | Spatially anchored Tactile Awareness for Robust Dexterous Manipulation | Jialei Huang et.al. | 2510.14647 | null |
| 2025-10-15 | DAMM-LOAM: Degeneracy Aware Multi-Metric LiDAR Odometry and Mapping | Nishant Chandna et.al. | 2510.13287 | null |
| 2025-10-15 | Convergence, design and training of continuous-time dropout as a random batch method | Antonio Álvarez-López et.al. | 2510.13134 | null |
| 2025-10-15 | True Self-Supervised Novel View Synthesis is Transferable | Thomas W. Mitchel et.al. | 2510.13063 | null |
| 2025-10-14 | SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding | Zhiliu Yang et.al. | 2510.12749 | null |
| 2025-10-14 | On the Use of Hierarchical Vision Foundation Models for Low-Cost Human Mesh Recovery and Pose Estimation | Shuhei Tarashima et.al. | 2510.12660 | null |
| 2025-10-13 | Lightweight Facial Landmark Detection in Thermal Images via Multi-Level Cross-Modal Knowledge Transfer | Qiyi Tong et.al. | 2510.11128 | null |
| 2025-10-13 | High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation | Runyang Feng et.al. | 2510.11017 | null |
| 2025-10-13 | DKPMV: Dense Keypoints Fusion from Multi-View RGB Frames for 6D Pose Estimation of Textureless Objects | Jiahong Chen et.al. | 2510.10933 | null |
| 2025-10-12 | MonoSE(3)-Diffusion: A Monocular SE(3) Diffusion Framework for Robust Camera-to-Robot Pose Estimation | Kangjian Zhu et.al. | 2510.10434 | null |
| 2025-10-11 | HccePose(BF): Predicting Front & Back Surfaces to Construct Ultra-Dense 2D-3D Correspondences for Pose Estimation | Yulin Wang et.al. | 2510.10177 | link |
| 2025-10-11 | Gesplat: Robust Pose-Free 3D Reconstruction via Geometry-Guided Gaussian Splatting | Jiahui Lu et.al. | 2510.10097 | null |
| 2025-10-11 | FORM: Fixed-Lag Odometry with Reparative Mapping utilizing Rotating LiDAR Sensors | Easton R. Potokar et.al. | 2510.09966 | null |
| 2025-10-10 | An uncertainty-aware framework for data-efficient multi-view animal pose estimation | Lenny Aharon et.al. | 2510.09903 | null |
| 2025-10-10 | Cross-Sensor Touch Generation | Samanta Rodriguez et.al. | 2510.09817 | null |
| 2025-10-10 | mmJoints: Expanding Joint Representations Beyond (x,y,z) in mmWave-Based 3D Pose Estimation | Zhenyu Wang et.al. | 2510.08970 | null |
| 2025-10-09 | ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation | Guanghao Li et.al. | 2510.08551 | link |
| 2025-10-09 | DexMan: Learning Bimanual Dexterous Manipulation from Human and Generated Videos | Jhen Hsieh et.al. | 2510.08475 | null |
| 2025-10-09 | GraphEnet: Event-driven Human Pose Estimation with a Graph Neural Network | Gaurvi Goyal et.al. | 2510.07990 | null |
| 2025-10-08 | TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics | Yi Han et.al. | 2510.07181 | null |
| 2025-10-07 | Human3R: Everyone Everywhere All at Once | Yue Chen et.al. | 2510.06219 | link |
| 2025-10-07 | DeLTa: Demonstration and Language-Guided Novel Transparent Object Manipulation | Taeyeop Lee et.al. | 2510.05662 | null |
| 2025-10-07 | Correlation-Aware Dual-View Pose and Velocity Estimation for Dynamic Robotic Manipulation | Mahboubeh Zarei et.al. | 2510.05536 | null |
| 2025-10-05 | Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation | Seunghyun Lee et.al. | 2510.04125 | null |
| 2025-10-04 | TCB-VIO: Tightly-Coupled Focal-Plane Binary-Enhanced Visual Inertial Odometry | Matthew Lisondra et.al. | 2510.03919 | null |
| 2025-10-04 | Adaptively Sampling-Reusing-Mixing Decomposed Gradients to Speed Up Sharpness Aware Minimization | Jiaxin Deng et.al. | 2510.03763 | null |
| 2025-10-03 | Efficient Surgical Robotic Instrument Pose Reconstruction in Real World Conditions Using Unified Feature Detection | Zekai Liang et.al. | 2510.03532 | null |
| 2025-10-02 | Visual Odometry with Transformers | Vlardimir Yugay et.al. | 2510.03348 | null |
| 2025-10-03 | Geometry Meets Vision: Revisiting Pretrained Semantics in Distilled Fields | Zhiting Mei et.al. | 2510.03104 | null |
| 2025-10-03 | VERNIER: an open-source software pushing marker pose estimation down to the micrometer and nanometer scales | Patrick Sandoz et.al. | 2510.02791 | null |
| 2025-10-02 | PhysHMR: Learning Humanoid Control Policies from Vision for Physically Plausible Human Motion Reconstruction | Qiao Feng et.al. | 2510.02566 | null |
| 2025-10-02 | Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities | Mario Medrano-Paredes et.al. | 2510.02264 | null |
| 2025-10-02 | Zero-shot Human Pose Estimation using Diffusion-based Inverse solvers | Sahil Bhandary Karnoor et.al. | 2510.02043 | null |
| 2025-10-02 | An Efficient Deep Template Matching and In-Plane Pose Estimation Method via Template-Aware Dynamic Convolution | Ke Jia et.al. | 2510.01678 | null |
| 2025-10-01 | Pose Estimation of a Thruster-Driven Bioinspired Multi-Link Robot | Nicholas B. Andrews et.al. | 2510.01485 | null |
| 2025-10-01 | Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models | Yanbo Xu et.al. | 2510.01184 | null |
| 2025-10-01 | Enabling High-Frequency Cross-Modality Visual Positioning Service for Accurate Drone Landing | Haoyang Wang et.al. | 2510.00646 | null |
| 2025-10-01 | Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation | Taeyun Woo et.al. | 2510.00527 | null |
| 2025-10-01 | Affordance-Guided Diffusion Prior for 3D Hand Reconstruction | Naru Suzuki et.al. | 2510.00506 | null |
| 2025-09-30 | TTT3R: 3D Reconstruction as Test-Time Training | Xingyu Chen et.al. | 2509.26645 | link |
| 2025-09-30 | A Multi-purpose Tracking Framework for Salmon Welfare Monitoring in Challenging Environments | Espen Uri Høgstedt et.al. | 2509.25969 | null |
| 2025-09-30 | Physics-Informed Learning for Human Whole-Body Kinematics Prediction via Sparse IMUs | Cheng Guo et.al. | 2509.25704 | null |
| 2025-09-29 | Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity | Tu-Hoa Pham et.al. | 2509.25520 | null |
| 2025-09-29 | VGGT-X: When VGGT Meets Dense Novel View Synthesis | Yang Liu et.al. | 2509.25191 | link |
| 2025-09-29 | PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos | Ting-Hsuan Liao et.al. | 2509.25183 | null |
| 2025-09-29 | SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation | Shuang Liang et.al. | 2509.24980 | link |
| 2025-09-29 | PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control | Haozhuo Zhang et.al. | 2509.24591 | null |
| 2025-09-29 | SCOPE: Semantic Conditioning for Sim2Real Category-Level Object Pose Estimation in Robotics | Peter Hönig et.al. | 2509.24572 | null |
| 2025-09-28 | GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State | Guole Shen et.al. | 2509.23737 | null |
| 2025-09-28 | Color-Pair Guided Robust Zero-Shot 6D Pose Estimation and Tracking of Cluttered Objects on Edge Devices | Xingjian Yang et.al. | 2509.23647 | null |
| 2025-09-27 | 3DPCNet: Pose Canonicalization for Robust Viewpoint-Invariant 3D Kinematic Analysis from Monocular RGB cameras | Tharindu Ekanayake et.al. | 2509.23455 | null |
| 2025-09-27 | Generative Modeling of Shape-Dependent Self-Contact Human Poses | Takehiko Ohkawa et.al. | 2509.23393 | null |
| 2025-09-27 | UniPose: Unified Cross-modality Pose Prior Propagation towards RGB-D data for Weakly Supervised 3D Human Pose Estimation | Jinghong Zheng et.al. | 2509.23376 | null |
| 2025-09-27 | GeLoc3r: Enhancing Relative Camera Pose Regression with Geometric Consistency Regularization | Jingxing Li et.al. | 2509.23038 | null |
| 2025-09-26 | Good Weights: Proactive, Adaptive Dead Reckoning Fusion for Continuous and Robust Visual SLAM | Yanwei Du et.al. | 2509.22910 | null |
| 2025-09-26 | ControlEvents: Controllable Synthesis of Event Camera Datawith Foundational Prior from Image Diffusion Models | Yixuan Hu et.al. | 2509.22864 | null |
| 2025-09-26 | An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose | Qifeng Wang et.al. | 2509.22058 | null |
| 2025-09-26 | SingRef6D: Monocular Novel Object Pose Estimation with a Single RGB Reference | Jiahui Wang et.al. | 2509.21927 | null |
| 2025-09-24 | mmHSense: Multi-Modal and Distributed mmWave ISAC Datasets for Human Sensing | Nabeel Nisar Bhat et.al. | 2509.21396 | null |
| 2025-09-25 | Finding 3D Positions of Distant Objects from Noisy Camera Movement and Semantic Segmentation Sequences | Julius Pesonen et.al. | 2509.20906 | null |
| 2025-09-25 | AI-Enabled Crater-Based Navigation for Lunar Mapping | Sofia McLeod et.al. | 2509.20748 | null |
| 2025-09-25 | EEG-Driven AR-Robot System for Zero-Touch Grasping Manipulation | Junzhe Wang et.al. | 2509.20656 | null |
| 2025-09-24 | Reflect3r: Single-View 3D Stereo Reconstruction Aided by Mirror Reflections | Jing Wu et.al. | 2509.20607 | null |
| 2025-09-24 | AJAHR: Amputated Joint Aware 3D Human Mesh Recovery | Hyunjin Cho et.al. | 2509.19939 | null |
| 2025-09-23 | Category-Level Object Shape and Pose Estimation in Less Than a Millisecond | Lorenzo Shaikewitz et.al. | 2509.18979 | null |
| 2025-09-23 | Towards Robust LiDAR Localization: Deep Learning-based Uncertainty Estimation | Minoo Dolatabadi et.al. | 2509.18954 | null |
| 2025-09-23 | Human-Interpretable Uncertainty Explanations for Point Cloud Registration | Johannes A. Gaus et.al. | 2509.18786 | null |
| 2025-09-23 | SINGER: An Onboard Generalist Vision-Language Navigation Policy for Drones | Maximilian Adang et.al. | 2509.18610 | null |
| 2025-09-22 | Selecting Optimal Camera Views for Gait Analysis: A Multi-Metric Assessment of 2D Projections | Dong Chen et.al. | 2509.17805 | null |
| 2025-09-22 | Evict3R: Training-Free Token Eviction for Memory-Bounded Streaming Visual Geometry Transformers | Soroush Mahdi et.al. | 2509.17650 | null |
| 2025-09-22 | VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video | Yu Liu et.al. | 2509.17647 | null |
| 2025-09-22 | Pose Estimation of a Cable-Driven Serpentine Manipulator Utilizing Intrinsic Dynamics via Physical Reservoir Computing | Kazutoshi Tanaka et.al. | 2509.17308 | null |
| 2025-09-21 | SPFSplatV2: Efficient Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views | Ranran Huang et.al. | 2509.17246 | null |
| 2025-09-21 | Leveraging RGB Images for Pre-Training of Event-Based Hand Pose Estimation | Ruicong Liu et.al. | 2509.16949 | null |
| 2025-09-19 | UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation | Mingdong Wu et.al. | 2509.15934 | null |
| 2025-09-19 | Self-Supervised Cross-Modal Learning for Image-to-Point Cloud Registration | Xingmei Wang et.al. | 2509.15882 | null |
| 2025-09-19 | STARC: See-Through-Wall Augmented Reality Framework for Human-Robot Collaboration in Emergency Response | Shenghai Yuan et.al. | 2509.15507 | null |
| 2025-09-18 | NeRF-based Visualization of 3D Cues Supporting Data-Driven Spacecraft Pose Estimation | Antoine Legrand et.al. | 2509.14890 | null |
| 2025-09-17 | SWA-PF: Semantic-Weighted Adaptive Particle Filter for Memory-Efficient 4-DoF UAV Localization in GNSS-Denied Environments | Jiayu Yuan et.al. | 2509.13795 | null |
| 2025-09-17 | Bridging the Synthetic-Real Gap: Supervised Domain Adaptation for Robust Spacecraft 6-DoF Pose Estimation | Inder Pal Singh et.al. | 2509.13792 | null |
| 2025-09-17 | UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry | Tae-Wook Um et.al. | 2509.13713 | null |
| 2025-09-17 | Gaussian Alignment for Relative Camera Pose Estimation via Single-View Reconstruction | Yumin Li et.al. | 2509.13652 | null |
| 2025-09-16 | Object Pose Estimation through Dexterous Touch | Amir-Hossein Shahidzadeh et.al. | 2509.13591 | null |
| 2025-09-16 | Using Visual Language Models to Control Bionic Hands: Assessment of Object Perception and Grasp Inference | Ozan Karaali et.al. | 2509.13572 | null |
| 2025-09-16 | ROOM: A Physics-Based Continuum Robot Simulator for Photorealistic Medical Datasets Generation | Salvatore Esposito et.al. | 2509.13177 | link |
| 2025-09-15 | 3D Human Pose and Shape Estimation from LiDAR Point Clouds: A Review | Salma Galaaoui et.al. | 2509.12197 | null |
| 2025-09-15 | Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation | Sebastian Diaz et.al. | 2509.12062 | null |
| 2025-09-15 | Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting | Yi-Hsin Li et.al. | 2509.11853 | null |
| 2025-09-15 | IMD: A 6-DoF Pose Estimation Benchmark for Industrial Metallic Objects | Ruimin Ma et.al. | 2509.11680 | null |
| 2025-09-14 | ActivePose: Active 6D Object Pose Estimation and Tracking for Robotic Manipulation | Sheng Liu et.al. | 2509.11364 | null |
| 2025-09-13 | AutoOEP – A Multi-modal Framework for Online Exam Proctoring | Aryan Kashyap Naveen et.al. | 2509.10887 | null |
| 2025-09-09 | HiLWS: A Human-in-the-Loop Weak Supervision Framework for Curating Clinical and Home Video Data for Neurological Assessment | Atefeh Irani et.al. | 2509.10557 | null |
| 2025-09-12 | Self-supervised Learning Of Visual Pose Estimation Without Pose Labels By Classifying LED States | Nicholas Carlotti et.al. | 2509.10405 | null |
| 2025-09-11 | MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos | Rutav Shah et.al. | 2509.09769 | link |
| 2025-09-10 | MultimodalHugs: Enabling Sign Language Processing in Hugging Face | Gerard Sant et.al. | 2509.09729 | null |
| 2025-09-09 | Australian Supermarket Object Set (ASOS): A Benchmark Dataset of Physical Objects and 3D Models for Robotics and Computer Vision | Akansel Cosgun et.al. | 2509.09720 | null |
| 2025-09-10 | iMatcher: Improve matching in point cloud registration via local-to-global geometric consistency learning | Karim Slimani et.al. | 2509.08982 | null |
| 2025-09-10 | PianoVAM: A Multimodal Piano Performance Dataset | Yonghyun Kim et.al. | 2509.08800 | null |
| 2025-09-10 | Deep Visual Odometry for Stereo Event Cameras | Sheng Zhong et.al. | 2509.08235 | null |
| 2025-09-09 | SVN-ICP: Uncertainty Estimation of ICP-based LiDAR Odometry using Stein Variational Newton | Shiping Ma et.al. | 2509.08069 | null |
| 2025-09-09 | One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation | Zheng Geng et.al. | 2509.07978 | link |
| 2025-09-09 | Parse Graph-Based Visual-Language Interaction for Human Pose Estimation | Shibang Liu et.al. | 2509.07385 | null |
| 2025-09-08 | H $_{2}$ OT: Hierarchical Hourglass Tokenizer for Efficient Video Pose Transformers | Wenhao Li et.al. | 2509.06956 | link |
| 2025-09-08 | Musculoskeletal simulation of limb movement biomechanics in Drosophila melanogaster | Pembe Gizem Özdil et.al. | 2509.06426 | null |
| 2025-09-07 | DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal Fusion | Mengmeng Liu et.al. | 2509.06023 | null |
| 2025-09-07 | Motion Aware ViT-based Framework for Monocular 6-DoF Spacecraft Pose Estimation | Jose Sosa et.al. | 2509.06000 | null |
| 2025-09-06 | Multi-LVI-SAM: A Robust LiDAR-Visual-Inertial Odometry for Multiple Fisheye Cameras | Xinyu Zhang et.al. | 2509.05740 | null |
| 2025-09-05 | WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool | Zizun Li et.al. | 2509.05296 | link |
| 2025-09-04 | Odometry Calibration and Pose Estimation of a 4WIS4WID Mobile Wall Climbing Robot | Branimir Ćaran et.al. | 2509.04016 | null |
| 2025-09-03 | SmartPoser: Arm Pose Estimation with a Smartphone and Smartwatch Using UWB and IMU Data | Nathan DeVrio et.al. | 2509.03451 | null |
| 2025-09-03 | Towards Realistic Hand-Object Interaction with Gravity-Field Based Diffusion Bridge | Miao Xu et.al. | 2509.03114 | null |
| 2025-09-03 | IL-SLAM: Intelligent Line-assisted SLAM Based on Feature Awareness for Dynamic Environments | Haolan Zhang et.al. | 2509.02972 | null |
| 2025-09-02 | Robotic 3D Flower Pose Estimation for Small-Scale Urban Farms | Harsh Muriki et.al. | 2509.02870 | null |
| 2025-09-02 | Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions | Beibei Zhou et.al. | 2509.02011 | null |
| 2025-09-02 | Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction | Xueyang Kang et.al. | 2509.01873 | null |
| 2025-09-01 | FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field | Fan Zhu et.al. | 2509.01547 | null |
| 2025-09-01 | Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation | Lee Chae-Yeon et.al. | 2509.01242 | null |
| 2025-09-01 | SR-SLAM: Scene-reliability Based RGB-D SLAM in Diverse Environments | Haolan Zhang et.al. | 2509.01111 | null |
| 2025-09-01 | An End-to-End Framework for Video Multi-Person Pose Estimation | Zhihong Wei et.al. | 2509.01095 | null |
| 2025-08-31 | UPGS: Unified Pose-aware Gaussian Splatting for Dynamic Scene Deblurring | Zhijing Wu et.al. | 2509.00831 | null |
| 2025-08-31 | DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments | Yi Liu et.al. | 2509.00741 | null |
| 2025-08-31 | MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation | Aviral Chharia et.al. | 2509.00649 | null |
| 2025-08-30 | Generative Visual Foresight Meets Task-Agnostic Pose Estimation in Robotic Table-Top Manipulation | Chuye Zhang et.al. | 2509.00361 | null |
| 2025-08-24 | Performance is not All You Need: Sustainability Considerations for Algorithms | Xiang Li et.al. | 2509.00045 | null |
| 2025-08-29 | Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning | Yuquan Bi et.al. | 2508.21363 | null |
| 2025-08-28 | PHD: Personalized 3D Human Body Fitting with Point Diffusion | Hsuan-I Ho et.al. | 2508.21257 | null |
| 2025-08-27 | ROBUST-MIPS: A Combined Skeletal Pose and Instance Segmentation Dataset for Laparoscopic Surgical Instruments | Zhe Han et.al. | 2508.21096 | null |
| 2025-08-28 | COMETH: Convex Optimization for Multiview Estimation and Tracking of Humans | Enrico Martini et.al. | 2508.20920 | null |
| 2025-08-28 | Estimating 2D Keypoints of Surgical Tools Using Vision-Language Models with Low-Rank Adaptation | Krit Duangprom et.al. | 2508.20830 | null |
| 2025-08-27 | WEBEYETRACK: Scalable Eye-Tracking for the Browser via On-Device Few-Shot Personalization | Eduardo Davalos et.al. | 2508.19544 | null |
| 2025-08-21 | PriorFormer: A Transformer for Real-time Monocular 3D Human Pose Estimation with Versatile Geometric Priors | Mohamed Adjel et.al. | 2508.18238 | null |
| 2025-08-25 | SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization | Junyuan Deng et.al. | 2508.17972 | link |
| 2025-08-25 | Camera Pose Refinement via 3D Gaussian Splatting | Lulu Hao et.al. | 2508.17876 | null |
| 2025-08-25 | DroneKey: Drone 3D Pose Estimation in Image Sequences using Gated Key-representation and Pose-adaptive Learning | Seo-Bin Hwang et.al. | 2508.17746 | null |
| 2025-08-25 | IDU: Incremental Dynamic Update of Existing 3D Virtual Environments with New Imagery Data | Meida Chen et.al. | 2508.17579 | null |
| 2025-08-24 | PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation | Xiaoyang Hao et.al. | 2508.17239 | link |
| 2025-08-23 | Fiducial Marker Splatting for High-Fidelity Robotics Simulations | Diram Tabaa et.al. | 2508.17012 | null |
| 2025-08-22 | An Investigation of Visual Foundation Models Robustness | Sandeep Gupta et.al. | 2508.16225 | null |
| 2025-08-21 | UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation | Zhaodong Jiang et.al. | 2508.15972 | null |
| 2025-08-21 | MExECON: Multi-view Extended Explicit Clothed humans Optimized via Normal integration | Fulden Ece Uğur et.al. | 2508.15500 | null |
| 2025-08-21 | Lang2Lift: A Framework for Language-Guided Pallet Detection and Pose Estimation Integrated in Autonomous Outdoor Forklift Operation | Huy Hoang Nguyen et.al. | 2508.15427 | null |
| 2025-08-20 | A Vision-Based Shared-Control Teleoperation Scheme for Controlling the Robotic Arm of a Four-Legged Robot | Murilo Vinicius da Silva et.al. | 2508.14994 | null |
| 2025-08-20 | You Only Pose Once: A Minimalist’s Detection Transformer for Monocular RGB Category-level 9D Multi-Object Pose Estimation | Hakjin Lee et.al. | 2508.14965 | null |
| 2025-08-19 | Heatmap Regression without Soft-Argmax for Facial Landmark Detection | Chiao-An Yang et.al. | 2508.14929 | null |
| 2025-08-20 | 6-DoF Object Tracking with Event-based Optical Flow and Frames | Zhichao Li et.al. | 2508.14776 | null |
| 2025-08-20 | Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels | Fabian Holst et.al. | 2508.14767 | null |
| 2025-08-20 | GeMS: Efficient Gaussian Splatting for Extreme Motion Blur | Gopi Raju Matta et.al. | 2508.14682 | null |
| 2025-08-20 | Consistent Pose Estimation of Unmanned Ground Vehicles through Terrain-Aided Multi-Sensor Fusion on Geometric Manifolds | Alexander Raab et.al. | 2508.14661 | null |
| 2025-08-20 | From Slices to Structures: Unsupervised 3D Reconstruction of Female Pelvic Anatomy from Freehand Transvaginal Ultrasound | Max Krähenmann et.al. | 2508.14552 | null |
| 2025-08-20 | HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation | Bing Han et.al. | 2508.14431 | null |
| 2025-08-20 | Learning Point Cloud Representations with Pose Continuity for Depth-Based Category-Level 6D Object Pose Estimation | Zhujun Li et.al. | 2508.14358 | null |
| 2025-08-20 | D $^2$ -LIO: Enhanced Optimization for LiDAR-IMU Odometry Considering Directional Degeneracy | Guodong Yao et.al. | 2508.14355 | null |
| 2025-08-19 | LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos | Chin-Yang Lin et.al. | 2508.14041 | link |
| 2025-08-19 | MR6D: Benchmarking 6D Pose Estimation for Mobile Robots | Anas Gouda et.al. | 2508.13775 | null |
| 2025-08-19 | RCGNet: RGB-based Category-Level 6D Object Pose Estimation with Geometric Guidance | Sheng Yu et.al. | 2508.13623 | null |
| 2025-08-18 | Physically Plausible Data Augmentations for Wearable IMU-based Human Activity Recognition Using Physics Simulation | Nobuyuki Oishi et.al. | 2508.13284 | null |
| 2025-08-18 | Stable Diffusion-Based Approach for Human De-Occlusion | Seung Young Noh et.al. | 2508.12663 | null |
| 2025-08-15 | Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction | Muzammil Khan et.al. | 2508.11282 | null |
| 2025-08-15 | A Coarse-to-Fine Human Pose Estimation Method based on Two-stage Distillation and Progressive Graph Neural Network | Zhangjian Ji et.al. | 2508.11212 | null |
| 2025-08-12 | ViPE: Video Pose Engine for 3D Geometric Perception | Jiahui Huang et.al. | 2508.10934 | null |
| 2025-08-14 | The SET Perceptual Factors Framework: Towards Assured Perception for Autonomous Systems | Troi Williams et.al. | 2508.10798 | null |
| 2025-08-14 | Lameness detection in dairy cows using pose estimation and bidirectional LSTMs | Helena Russello et.al. | 2508.10643 | null |
| 2025-08-14 | EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba | Quang Nguyen et.al. | 2508.10522 | null |
| 2025-08-14 | eMamba: Efficient Acceleration Framework for Mamba Models in Edge Computing | Jiyong Kim et.al. | 2508.10370 | null |
| 2025-08-13 | Predictive Uncertainty for Runtime Assurance of a Real-Time Computer Vision-Based Landing System | Romeo Valentin et.al. | 2508.09732 | null |
| 2025-08-13 | Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors | Giorgos Karvounas et.al. | 2508.09629 | null |
| 2025-08-12 | DiffPose-Animal: A Language-Conditioned Diffusion Framework for Animal Pose Estimation | Tianyu Xiong et.al. | 2508.08783 | null |
| 2025-08-12 | QoE-Aware Service Provision for Mobile AR Rendering: An Agent-Driven Approach | Conghao Zhou et.al. | 2508.08627 | null |
| 2025-08-11 | Forecasting Continuous Non-Conservative Dynamical Systems in SO(3) | Lennart Bastian et.al. | 2508.07775 | null |
| 2025-08-10 | Generic Calibration: Pose Ambiguity/Linear Solution and Parametric-hybrid Pipeline | Yuqi Han et.al. | 2508.07217 | null |
| 2025-08-09 | AugLift: Boosting Generalization in Lifting-based 3D Human Pose Estimation | Nikolai Warner et.al. | 2508.07112 | null |
| 2025-08-09 | VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions | Yash Garg et.al. | 2508.06757 | null |
| 2025-08-08 | DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera | Shaohua Pan et.al. | 2508.06139 | null |
| 2025-08-06 | Surf3R: Rapid Surface Reconstruction from Sparse RGB Views in Seconds | Haodong Zhu et.al. | 2508.04508 | null |
| 2025-08-06 | RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization | Yanyan Li et.al. | 2508.04335 | null |
| 2025-08-05 | OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World | Katherine Liu et.al. | 2508.03669 | null |
| 2025-08-05 | FPG-NAS: FLOPs-Aware Gated Differentiable Neural Architecture Search for Efficient 6DoF Pose Estimation | Nassim Ali Ousalah et.al. | 2508.03618 | null |
| 2025-08-05 | RadProPoser: A Framework for Human Pose Estimation with Uncertainty Quantification from Raw Radar Data | Jonas Leo Mueller et.al. | 2508.03578 | null |
| 2025-08-05 | Vision-based Perception System for Automated Delivery Robot-Pedestrians Interactions | Ergi Tushe et.al. | 2508.03541 | null |
| 2025-08-05 | Semantic Mosaicing of Histo-Pathology Image Fragments using Visual Foundation Models | Stefan Brandstätter et.al. | 2508.03524 | null |
| 2025-08-05 | BaroPoser: Real-time Human Motion Tracking from IMUs and Barometers in Everyday Devices | Libo Zhang et.al. | 2508.03313 | null |
| 2025-08-05 | MVTOP: Multi-View Transformer-based Object Pose-Estimation | Lukas Ranftl et.al. | 2508.03243 | null |
| 2025-08-05 | COFFEE: A Shadow-Resilient Real-Time Pose Estimator for Unknown Tumbling Asteroids using Sparse Neural Networks | Arion Zimmermann et.al. | 2508.03132 | null |
| 2025-08-04 | PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation | Zongyou Yang et.al. | 2508.02806 | null |
| 2025-08-04 | PMGS: Reconstruction of Projectile Motion across Large Spatiotemporal Spans via 3D Gaussian Splatting | Yijun Xu et.al. | 2508.02660 | null |
| 2025-08-04 | SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching | Xiangzeng Liu et.al. | 2508.02278 | null |
| 2025-08-04 | Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes | Tom Fischer et.al. | 2508.02157 | null |
| 2025-08-04 | YOLOv1 to YOLOv11: A Comprehensive Survey of Real-Time Object Detection Innovations and Challenges | Manikanta Kotthapalli et.al. | 2508.02067 | null |
| 2025-08-04 | StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion | Haoxin Yang et.al. | 2508.02056 | link |
| 2025-08-03 | CVD-SfM: A Cross-View Deep Front-end Structure-from-Motion System for Sparse Localization in Multi-Altitude Scenes | Yaxuan Li et.al. | 2508.01936 | null |
| 2025-08-03 | IMUCoCo: Enabling Flexible On-Body IMU Placement for Human Pose Estimation and Activity Recognition | Haozhe Zhou et.al. | 2508.01894 | null |
| 2025-08-03 | ChairPose: Pressure-based Chair Morphology Grounded Sitting Pose Estimation through Simulation-Assisted Training | Lala Shakti Swarup Ray et.al. | 2508.01850 | null |
| 2025-08-02 | No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views | Ranran Huang et.al. | 2508.01171 | link |
| 2025-08-01 | CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry | Jingchao Xie et.al. | 2508.00568 | null |
| 2025-07-31 | Mitigating Resolution-Drift in Federated Learning: Case of Keypoint Detection | Taeheon Lim et.al. | 2507.23461 | null |
| 2025-07-31 | FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models | Yiming Yang et.al. | 2507.23325 | null |
| 2025-07-30 | From Sharp to Blur: Unsupervised Domain Adaptation for 2D Human Pose Estimation Under Extreme Motion Blur Using Event Cameras | Youngho Kim et.al. | 2507.22438 | null |
| 2025-07-29 | LiteFat: Lightweight Spatio-Temporal Graph Learning for Real-Time Driver Fatigue Detection | Jing Ren et.al. | 2507.21756 | null |
| 2025-07-29 | Adaptive Prior Scene-Object SLAM for Dynamic Environments | Haolan Zhang et.al. | 2507.21709 | null |
| 2025-07-28 | PixelNav: Towards Model-based Vision-Only Navigation with Topological Graphs | Sergey Bakulin et.al. | 2507.20892 | null |
| 2025-07-28 | Beyond Line-of-Sight: Cooperative Localization Using Vision and V2X Communication | Annika Wong et.al. | 2507.20772 | null |
| 2025-07-28 | KASportsFormer: Kinematic Anatomy Enhanced Transformer for 3D Human Pose Estimation on Short Sports Scene Video | Zhuoer Yin et.al. | 2507.20763 | null |
| 2025-07-28 | Automated 3D-GS Registration and Fusion via Skeleton Alignment and Gaussian-Adaptive Features | Shiyang Liu et.al. | 2507.20480 | null |
| 2025-07-26 | A Structure-aware and Motion-adaptive Framework for 3D Human Pose Estimation with Mamba | Ye Lu et.al. | 2507.19852 | null |
| 2025-07-25 | Efficient Lines Detection for Robot Soccer | João G. Melo et.al. | 2507.19469 | null |
| 2025-07-25 | Fast Learning of Non-Cooperative Spacecraft 3D Models through Primitive Initialization | Pol Francesch Huc et.al. | 2507.19459 | null |
| 2025-07-24 | Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping | Chong Cheng et.al. | 2507.18541 | null |
| 2025-07-24 | NLML-HPE: Head Pose Estimation with Limited Data via Manifold Learning | Mahdi Ghafourian et.al. | 2507.18429 | null |
| 2025-07-24 | AF-RLIO: Adaptive Fusion of Radar-LiDAR-Inertial Information for Robust Odometry in Challenging Environments | Chenglong Qian et.al. | 2507.18317 | null |
| 2025-07-24 | Evaluation of facial landmark localization performance in a surgical setting | Ines Frajtag et.al. | 2507.18248 | null |
| 2025-07-24 | Emotion Recognition from Skeleton Data: A Comprehensive Survey | Haifeng Lu et.al. | 2507.18026 | null |
| 2025-07-23 | RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction | Yuqing Lan et.al. | 2507.17594 | null |
| 2025-07-23 | Physics-based Human Pose Estimation from a Single Moving RGB Camera | Ayce Idil Aytekin et.al. | 2507.17406 | null |
| 2025-07-21 | Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors | Mohamed Adjel et.al. | 2507.16850 | null |
| 2025-07-22 | Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers | Batu Candan et.al. | 2507.16214 | null |
| 2025-07-21 | TONUS: Neuromorphic human pose estimation for artistic sound co-creation | Jules Lecomte et.al. | 2507.15734 | null |
| 2025-07-21 | Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing | Boni Hu et.al. | 2507.15683 | null |
| 2025-07-21 | Dense-depth map guided deep Lidar-Visual Odometry with Sparse Point Clouds and Images | JunYing Huang et.al. | 2507.15496 | null |
| 2025-07-20 | 3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline | Kaishva Chintan Shah et.al. | 2507.14924 | null |
| 2025-07-20 | An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks | Xinyi Wu et.al. | 2507.14798 | null |
| 2025-07-22 | AI-Enhanced Precision in Sport Taekwondo: Increasing Fairness, Speed, and Trust in Competition (FST.ai) | Keivan Shariatmadar et.al. | 2507.14657 | null |
| 2025-07-18 | C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs | Yung-Hong Sun et.al. | 2507.14095 | null |
| 2025-07-21 | PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations | Yu Wei et.al. | 2507.13891 | null |
| 2025-07-18 | MaskHOI: Robust 3D Hand-Object Interaction Estimation via Masked Pre-training | Yuechen Xie et.al. | 2507.13673 | null |
| 2025-07-17 | $π^3$ : Scalable Permutation-Equivariant Visual Geometry Learning | Yifan Wang et.al. | 2507.13347 | link |
| 2025-07-17 | Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark | Junsu Kim et.al. | 2507.13314 | null |
| 2025-07-17 | DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model | Maulana Bisyir Azhari et.al. | 2507.13145 | null |
| 2025-07-17 | AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability | Tomohiro Suzuki et.al. | 2507.12905 | null |
| 2025-07-17 | From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation | Mengxi Liu et.al. | 2507.12884 | null |
| 2025-07-19 | SpatialTrackerV2: 3D Point Tracking Made Easy | Yuxi Xiao et.al. | 2507.12462 | link |
| 2025-07-16 | Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation | Antonio Finocchiaro et.al. | 2507.12292 | null |
| 2025-07-16 | UniLGL: Learning Uniform Place Recognition for FOV-limited/Panoramic LiDAR Global Localization | Hongming Shen et.al. | 2507.12194 | null |
| 2025-07-16 | BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images | Davide Di Nucci et.al. | 2507.12095 | null |
| 2025-07-16 | SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation | Beining Xu et.al. | 2507.12027 | null |
| 2025-07-16 | SEPose: A Synthetic Event-based Human Pose Estimation Dataset for Pedestrian Monitoring | Kaustav Chanda et.al. | 2507.11910 | null |
| 2025-07-15 | GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft | Weizhao Ma et.al. | 2507.11077 | null |
| 2025-07-15 | Joint angle model based learning to refine kinematic human pose estimation | Chang Peng et.al. | 2507.11075 | null |
| 2025-07-14 | Raci-Net: Ego-vehicle Odometry Estimation in Adverse Weather Conditions | Mohammadhossein Talebi et.al. | 2507.10376 | null |
| 2025-07-14 | Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures | Xinlong Ding et.al. | 2507.10265 | null |
| 2025-07-14 | ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users | Xiangyu Yin et.al. | 2507.10223 | link |
| 2025-07-13 | VST-Pose: A Velocity-Integrated Spatiotem-poral Attention Network for Human WiFi Pose Estimation | Xinyu Zhang et.al. | 2507.09672 | null |
| 2025-07-13 | EHPE: A Segmented Architecture for Enhanced Hand Pose Estimation | Bolun Zheng et.al. | 2507.09560 | null |
| 2025-07-13 | Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding | Yanchen Wang et.al. | 2507.09513 | null |
| 2025-07-12 | PoseLLM: Enhancing Language-Guided Human Pose Estimation with MLP Alignment | Dewen Zhang et.al. | 2507.09139 | null |
| 2025-07-10 | RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration | Chong Cheng et.al. | 2507.08136 | null |
| 2025-07-10 | SCREP: Scene Coordinate Regression and Evidential Learning-based Perception-Aware Trajectory Generation | Juyeop Han et.al. | 2507.07467 | null |
| 2025-07-09 | g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM | Quanjie Qiu et.al. | 2507.07142 | null |
| 2025-07-09 | Smartphone Exergames with Real-Time Markerless Motion Capture: Challenges and Trade-offs | Mathieu Phosanarack et.al. | 2507.06669 | null |
| 2025-07-09 | MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning | Yifan Yang et.al. | 2507.06662 | null |
| 2025-07-09 | Mask6D: Masked Pose Priors For 6D Object Pose Estimation | Yuechen Xie et.al. | 2507.06486 | null |
| 2025-07-08 | SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor Variations | Yegyu Han et.al. | 2507.05751 | null |
| 2025-07-08 | Event-RGB Fusion for Spacecraft Pose Estimation Under Harsh Lighting | Mohsi Jawaid et.al. | 2507.05698 | null |
| 2025-07-07 | W2W: A Simulated Exploration of IMU Placement Across the Human Body for Designing Smarter Wearable | Lala Shakti Swarup Ray et.al. | 2507.05532 | null |
| 2025-07-07 | UDF-GMA: Uncertainty Disentanglement and Fusion for General Movement Assessment | Zeqi Luo et.al. | 2507.04814 | null |
| 2025-07-06 | Thousand-Brains Systems: Sensorimotor Intelligence for Rapid, Robust Learning and Inference | Niels Leadholm et.al. | 2507.04494 | null |
| 2025-07-09 | Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM | Xiaolei Lang et.al. | 2507.04004 | null |
| 2025-07-05 | Accurate Pose Estimation Using Contact Manifold Sampling for Safe Peg-in-Hole Insertion of Complex Geometries | Abhay Negi et.al. | 2507.03925 | null |
| 2025-07-02 | Markerless Stride Length estimation in Athletic using Pose Estimation with monocular vision | Patryk Skorupski et.al. | 2507.03016 | null |
| 2025-07-03 | Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning | Buzhen Huang et.al. | 2507.02565 | null |
| 2025-07-03 | IMASHRIMP: Automatic White Shrimp (Penaeus vannamei) Biometrical Analysis from Laboratory Images Using Computer Vision and Deep Learning | Abiam Remache González et.al. | 2507.02519 | null |
| 2025-07-03 | 3D Heart Reconstruction from Sparse Pose-agnostic 2D Echocardiographic Slices | Zhurong Chen et.al. | 2507.02411 | null |
| 2025-07-03 | LMPNet for Weakly-supervised Keypoint Discovery | Pei Guo et.al. | 2507.02308 | null |
| 2025-07-02 | What does really matter in image goal navigation? | Gianluca Monaci et.al. | 2507.01667 | null |
| 2025-07-01 | 2024 NASA SUITS Report: LLM-Driven Immersive Augmented Reality User Interface for Robotics and Space Exploration | Kathy Zhuang et.al. | 2507.01206 | null |
| 2025-07-01 | Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation | Hao Xing et.al. | 2507.00752 | null |
| 2025-07-01 | LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment | Juelin Zhu et.al. | 2507.00659 | null |
| 2025-06-30 | Computer Vision for Objects used in Group Work: Challenges and Opportunities | Changsoo Jung et.al. | 2507.00224 | null |
| 2025-06-30 | Validation of AI-Based 3D Human Pose Estimation in a Cyber-Physical Environment | Lisa Marie Otto et.al. | 2506.23739 | null |
| 2025-06-30 | MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor Environments | Sai Krishna Ghanta et.al. | 2506.23514 | null |
| 2025-06-29 | TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints | Zhen Tan et.al. | 2506.23207 | null |
| 2025-06-28 | Deterministic Object Pose Confidence Region Estimation | Jinghao Wang et.al. | 2506.22720 | null |
| 2025-06-27 | Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration | Noora Sassali et.al. | 2506.22116 | null |
| 2025-06-27 | Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras | Petr Hruby et.al. | 2506.22069 | null |
| 2025-06-24 | ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes | Chenhao Zhang et.al. | 2506.21629 | link |
| 2025-06-26 | EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting | Taoyu Wu et.al. | 2506.21420 | null |
| 2025-06-26 | CURL-SLAM: Continuous and Compact LiDAR Mapping | Kaicheng Zhang et.al. | 2506.21077 | null |
| 2025-06-27 | DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation | Wenzhou Lyu et.al. | 2506.21034 | null |
| 2025-06-25 | How do Foundation Models Compare to Skeleton-Based Approaches for Gesture Recognition in Human-Robot Interaction? | Stephanie Käs et.al. | 2506.20795 | null |
| 2025-06-26 | Consensus-Driven Uncertainty for Robotic Grasping based on RGB Perception | Eric C. Joyce et.al. | 2506.20045 | null |
| 2025-06-24 | Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images | Stephanie Käs et.al. | 2506.19747 | null |
| 2025-06-23 | RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base | Kuanning Wang et.al. | 2506.18856 | null |
| 2025-06-19 | Reproducible Evaluation of Camera Auto-Exposure Methods in the Field: Platform, Benchmark and Lessons Learned | Olivier Gamache et.al. | 2506.18844 | null |
| 2025-06-23 | SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives | Yizhou Chen et.al. | 2506.18825 | null |
| 2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119 | link |
| 2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110 | null |
| 2025-06-20 | LunarLoc: Segment-Based Global Localization on the Moon | Annika Thomas et.al. | 2506.16940 | link |
| 2025-06-19 | ControlVLA: Few-shot Object-centric Adaptation for Pre-trained Vision-Language-Action Models | Puhao Li et.al. | 2506.16211 | null |
| 2025-06-19 | STAR-Pose: Efficient Low-Resolution Video Human Pose Estimation via Spatial-Temporal Adaptive Super-Resolution | Yucheng Jin et.al. | 2506.16061 | null |
| 2025-06-19 | KARL: Kalman-Filter Assisted Reinforcement Learner for Dynamic Object Tracking and Grasping | Kowndinya Boyalakuntla et.al. | 2506.15945 | null |
| 2025-06-19 | Beyond Audio and Pose: A General-Purpose Framework for Video Synchronization | Yosub Shin et.al. | 2506.15937 | null |
| 2025-06-18 | Improving Robotic Manipulation: Techniques for Object Pose Estimation, Accommodating Positional Uncertainty, and Disassembly Tasks from Examples | Viral Rasik Galaiya et.al. | 2506.15865 | null |
| 2025-06-18 | PRISM-Loc: a Lightweight Long-range LiDAR Localization in Urban Environments with Topological Maps | Kirill Muravyev et.al. | 2506.15849 | null |
| 2025-06-18 | Human Motion Capture from Loose and Sparse Inertial Sensors with Garment-aware Diffusion Models | Andela Ilic et.al. | 2506.15290 | null |
| 2025-06-18 | RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories | Qingsong Yan et.al. | 2506.15242 | null |
| 2025-06-17 | PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation | Ming Xu et.al. | 2506.14596 | null |
| 2025-06-17 | MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution | Zhiwen Shao et.al. | 2506.14511 | null |
| 2025-06-17 | Non-Overlap-Aware Egocentric Pose Estimation for Collaborative Perception in Connected Autonomy | Hong Huang et.al. | 2506.14180 | null |
| 2025-06-17 | TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping | Jeewon Kim et.al. | 2506.14178 | null |
| 2025-06-16 | Diffusion-based Inverse Observation Model for Artificial Skin | Ante Maric et.al. | 2506.13986 | null |
| 2025-06-16 | PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images | Lingteng Qiu et.al. | 2506.13766 | null |
| 2025-06-16 | JENGA: Object selection and pose estimation for robotic grasping from a stack | Sai Srinivas Jeevanandam et.al. | 2506.13425 | null |
| 2025-06-16 | Automatic Multi-View X-Ray/CT Registration Using Bone Substructure Contours | Roman Flepp et.al. | 2506.13292 | null |
| 2025-06-16 | DETRPose: Real-time end-to-end transformer model for multi-person pose estimation | Sebastian Janampa et.al. | 2506.13027 | link |
| 2025-06-15 | A large-scale, physically-based synthetic dataset for satellite pose estimation | Szabolcs Velkei et.al. | 2506.12782 | null |
| 2025-06-13 | ViTaSCOPE: Visuo-tactile Implicit Representation for In-hand Pose and Extrinsic Contact Estimation | Jayjun Lee et.al. | 2506.12239 | null |
| 2025-06-10 | Monocular 3D Hand Pose Estimation with Implicit Camera Alignment | Christos Pantazopoulos et.al. | 2506.11133 | null |
| 2025-06-12 | Occlusion-Aware 3D Hand-Object Pose Estimation with Masked AutoEncoders | Hui Yang et.al. | 2506.10816 | null |
| 2025-06-12 | In-Hand Object Pose Estimation via Visual-Tactile Fusion | Felix Nonnengießer et.al. | 2506.10787 | null |
| 2025-06-11 | Fluoroscopic Shape and Pose Tracking of Catheters with Custom Radiopaque Markers | Jared Lawson et.al. | 2506.09934 | null |
| 2025-06-11 | EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks | Athinoulla Konstantinou et.al. | 2506.09895 | link |
| 2025-06-11 | Accurate and efficient zero-shot 6D pose estimation with frozen foundation models | Andrea Caraffa et.al. | 2506.09784 | null |
| 2025-06-11 | CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings | Mattia Nardon et.al. | 2506.09699 | null |
| 2025-06-10 | Princeton365: A Diverse Dataset with Accurate Camera Pose | Karhan Kayan et.al. | 2506.09035 | null |
| 2025-06-10 | ArrowPose: Segmentation, Detection, and 5 DoF Pose Estimation Network for Colorless Point Clouds | Frederik Hagelskjaer et.al. | 2506.08699 | null |
| 2025-06-09 | UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References | Ming-Feng Li et.al. | 2506.07996 | null |
| 2025-06-09 | Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation | Yijie Deng et.al. | 2506.07338 | null |
| 2025-06-10 | From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models | Pablo Acuaviva et.al. | 2506.07280 | null |
| 2025-06-08 | GoTrack: Generic 6DoF Object Pose Refinement and Tracking | Van Nguyen Nguyen et.al. | 2506.07155 | null |
| 2025-06-08 | UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment | Wentao Zhao et.al. | 2506.07013 | null |
| 2025-06-07 | Deep Inertial Pose: A deep learning approach for human pose estimation | Sara M. Cerqueira et.al. | 2506.06850 | null |
| 2025-06-06 | Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments | Mingrui Li et.al. | 2506.05965 | null |
| 2025-06-06 | SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction | Yuchao Zheng et.al. | 2506.05935 | null |
| 2025-06-06 | CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy | Jiakai Zhang et.al. | 2506.05864 | null |
| 2025-06-06 | You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping | Jingshun Huang et.al. | 2506.05719 | null |
| 2025-06-05 | On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images | Andreas Meuleman et.al. | 2506.05558 | null |
| 2025-06-05 | Rectified Point Flow: Generic Point Cloud Pose Estimation | Tao Sun et.al. | 2506.05282 | null |
| 2025-06-05 | Realizing Text-Driven Motion Generation on NAO Robot: A Reinforcement Learning-Optimized Control Pipeline | Zihan Xu et.al. | 2506.05117 | link |
| 2025-06-05 | CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx | Lukas Picek et.al. | 2506.04931 | null |
| 2025-06-05 | SupeRANSAC: One RANSAC to Rule Them All | Daniel Barath et.al. | 2506.04803 | null |
| 2025-06-05 | LGM-Pose: A Lightweight Global Modeling Network for Real-time Human Pose Estimation | Biao Guo et.al. | 2506.04561 | null |
| 2025-06-04 | Photoreal Scene Reconstruction from an Egocentric Device | Zhaoyang Lv et.al. | 2506.04444 | link |
| 2025-06-04 | cuVSLAM: CUDA accelerated visual odometry | Alexander Korovko et.al. | 2506.04359 | null |
| 2025-06-04 | Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation | Tianyu Huang et.al. | 2506.04225 | null |
| 2025-06-04 | Accelerating SfM-based Pose Estimation with Dominating Set | Joji Joseph et.al. | 2506.03667 | null |
| 2025-06-03 | OpenFace 3.0: A Lightweight Multitask System for Comprehensive Facial Behavior Analysis | Jiewen Hu et.al. | 2506.02891 | null |
| 2025-06-03 | Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation | Mingjie Wei et.al. | 2506.02853 | null |
| 2025-06-03 | GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Shufan Qing et.al. | 2506.02736 | link |
| 2025-06-02 | Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction | Samuel Li et.al. | 2506.02265 | null |
| 2025-06-02 | E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models | Wenyan Cong et.al. | 2506.01933 | null |
| 2025-06-02 | SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation | Sang-Eun Lee et.al. | 2506.01691 | null |
| 2025-06-02 | Sheep Facial Pain Assessment Under Weighted Graph Neural Networks | Alam Noor et.al. | 2506.01468 | null |
| 2025-06-01 | TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction | Yiyao Huang et.al. | 2506.00953 | null |
| 2025-05-31 | XYZ-IBD: High-precision Bin-picking Dataset for Object 6D Pose Estimation Capturing Real-world Industrial Complexity | Junwen Huang et.al. | 2506.00599 | null |
| 2025-05-30 | Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions | Muhammad Suhail Saleem et.al. | 2506.00285 | null |
| 2025-05-30 | 6D Pose Estimation on Point Cloud Data through Prior Knowledge Integration: A Case Study in Autonomous Disassembly | Chengzhi Wu et.al. | 2505.24669 | null |
| 2025-05-30 | Category-Level 6D Object Pose Estimation in Agricultural Settings Using a Lattice-Deformation Framework and Diffusion-Augmented Synthetic Data | Marios Glytsos et.al. | 2505.24636 | null |
| 2025-05-30 | PCIE_Pose Solution for EgoExo4D Pose and Proficiency Estimation Challenge | Feng Chen et.al. | 2505.24411 | null |
| 2025-05-29 | Pose-free 3D Gaussian splatting via shape-ray estimation | Youngju Na et.al. | 2505.22978 | null |
| 2025-05-28 | TwinTrack: Bridging Vision and Contact Physics for Real-Time Tracking of Unknown Dynamic Objects | Wen Yang et.al. | 2505.22882 | null |
| 2025-05-28 | 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians | Hidenobu Matsuki et.al. | 2505.22859 | null |
| 2025-05-28 | MultiFormer: A Multi-Person Pose Estimation System Based on CSI and Attention Mechanism | Yanyi Qu et.al. | 2505.22555 | null |
| 2025-05-28 | Event-based Egocentric Human Pose Estimation in Dynamic Environment | Wataru Ikeda et.al. | 2505.22007 | null |
| 2025-05-27 | Spectral Compression Transformer with Line Pose Graph for Monocular 3D Human Pose Estimation | Zenghao Zheng et.al. | 2505.21309 | null |
| 2025-05-29 | ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction | Adeela Islam et.al. | 2505.21117 | null |
| 2025-05-27 | HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving | Bingxiang Kang et.al. | 2505.20906 | null |
| 2025-05-27 | Mamba-Driven Topology Fusion for Monocular 3-D Human Pose Estimation | Zenghao Zheng et.al. | 2505.20611 | null |
| 2025-05-28 | HAND Me the Data: Fast Robot Adaptation via Hand Path Retrieval | Matthew Hong et.al. | 2505.20455 | null |
| 2025-05-25 | Learning the Contact Manifold for Accurate Pose Estimation During Peg-in-Hole Insertion of Complex Geometries | Abhay Negi et.al. | 2505.19215 | null |
| 2025-05-24 | Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU | Yicheng Lin et.al. | 2505.18652 | null |
| 2025-05-24 | An Inertial Sequence Learning Framework for Vehicle Speed Estimation via Smartphone IMU | Xuan Xiao et.al. | 2505.18490 | null |
| 2025-05-23 | Pose Splatter: A 3D Gaussian Splatting Model for Quantifying Animal Pose and Appearance | Jack Goffinet et.al. | 2505.18342 | null |
| 2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973 | null |
| 2025-05-23 | Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery | Ming Hu et.al. | 2505.17677 | null |
| 2025-05-23 | PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation | Uyoung Jeong et.al. | 2505.17475 | link |
| 2025-05-22 | Towards Texture- And Shape-Independent 3D Keypoint Estimation in Birds | Valentin Schmuker et.al. | 2505.16633 | null |
| 2025-05-22 | GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation | Ming Yang et.al. | 2505.16144 | null |
| 2025-05-21 | Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation | Yihang Li et.al. | 2505.15098 | null |
| 2025-05-20 | UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction | Nisarga Nilavadi et.al. | 2505.14866 | null |
| 2025-05-19 | Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos | Ruoyu Wang et.al. | 2505.13440 | link |
| 2025-05-19 | KinTwin: Imitation Learning with Torque and Muscle Driven Biomechanical Models Enables Precise Replication of Able-Bodied and Impaired Movement from Markerless Motion Capture | R. James Cotton et.al. | 2505.13436 | null |
| 2025-05-19 | The Way Up: A Dataset for Hold Usage Detection in Sport Climbing | Anna Maschek et.al. | 2505.12854 | null |
| 2025-05-17 | Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation | Niaz Ahmad et.al. | 2505.12130 | null |
| 2025-05-17 | Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation | Zhiying Li et.al. | 2505.12009 | null |
| 2025-05-17 | ElderFallGuard: Real-Time IoT and Computer Vision-Based Fall Detection System for Elderly Safety | Tasrifur Riahi et.al. | 2505.11845 | null |
| 2025-05-16 | SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision | Utsav Rai et.al. | 2505.11439 | null |
| 2025-05-16 | MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection | Shrutarv Awasthi et.al. | 2505.11282 | null |
| 2025-05-16 | PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation | Saad Manzur et.al. | 2505.10888 | null |
| 2025-05-16 | RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects | Jaeguk Kim et.al. | 2505.10841 | null |
| 2025-05-14 | UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units | Huakun Liu et.al. | 2505.09393 | link |
| 2025-05-14 | APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression | Srinivas Ravuri et.al. | 2505.09356 | link |
| 2025-05-13 | Real-time Capable Learning-based Visual Tool Pose Correction via Differentiable Simulation | Shuyuan Yang et.al. | 2505.08875 | null |
| 2025-05-12 | Sleep Position Classification using Transfer Learning for Bed-based Pressure Sensors | Olivier Papillon et.al. | 2505.08111 | null |
| 2025-05-12 | Enabling Privacy-Aware AI-Based Ergonomic Analysis | Sander De Coninck et.al. | 2505.07306 | null |
| 2025-05-13 | Human Motion Prediction via Test-domain-aware Adaptation with Easily-available Human Motions Estimated from Videos | Katsuki Shimbo et.al. | 2505.07301 | null |
| 2025-05-12 | When Dance Video Archives Challenge Computer Vision | Philippe Colantoni et.al. | 2505.07249 | null |
| 2025-05-10 | CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground Environments | Shehryar Khattak et.al. | 2505.06483 | null |
| 2025-05-09 | Active Perception for Tactile Sensing: A Task-Agnostic Attention-Based Approach | Tim Schneider et.al. | 2505.06182 | null |
| 2025-05-08 | Semantic Style Transfer for Enhancing Animal Facial Landmark Detection | Anadil Hussein et.al. | 2505.05640 | null |
| 2025-05-08 | Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors | Zunjie Zhu et.al. | 2505.05336 | null |
| 2025-05-08 | Improving Global Motion Estimation in Sparse IMU-based Motion Capture with Physics | Xinyu Yi et.al. | 2505.05010 | null |
| 2025-05-08 | An Efficient Method for Accurate Pose Estimation and Error Correction of Cuboidal Objects | Utsav Rai et.al. | 2505.04962 | null |
| 2025-05-07 | Comparison of Visual Trackers for Biomechanical Analysis of Running | Luis F. Gomez et.al. | 2505.04713 | null |
| 2025-05-07 | Do We Still Need to Work on Odometry for Autonomous Driving? | Cedric Le Gentil et.al. | 2505.04438 | null |
| 2025-05-07 | HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation | Yajie Fu et.al. | 2505.04276 | link |
| 2025-05-07 | One2Any: One-Reference 6D Pose Estimation for Any Object | Mengya Liu et.al. | 2505.04109 | null |
| 2025-05-06 | Polar Coordinate-Based 2D Pose Prior with Neural Distance Field | Qi Gan et.al. | 2505.03445 | null |
| 2025-05-06 | LiftFeat: 3D Geometry-Aware Local Feature Matching | Yepeng Liu et.al. | 2505.03422 | link |
| 2025-05-06 | Artificial Behavior Intelligence: Technology, Challenges, and Future Directions | Kanghyun Jo et.al. | 2505.03315 | null |
| 2025-05-05 | Dance of Fireworks: An Interactive Broadcast Gymnastics Training System Based on Pose Estimation | Haotian Chen et.al. | 2505.02690 | null |
| 2025-05-05 | Corr2Distrib: Making Ambiguous Correspondences an Ally to Predict Reliable 6D Pose Distributions | Asma Brazi et.al. | 2505.02501 | null |
| 2025-05-05 | Finger Pose Estimation for Under-screen Fingerprint Sensor | Xiongjun Guan et.al. | 2505.02481 | link |
| 2025-05-05 | 6D Pose Estimation on Spoons and Hands | Kevin Tan et.al. | 2505.02335 | null |
| 2025-05-04 | Continuous Normalizing Flows for Uncertainty-Aware Human Pose Estimation | Shipeng Liu et.al. | 2505.02287 | null |
| 2025-05-04 | A Birotation Solution for Relative Pose Problems | Hongbo Zhao et.al. | 2505.02025 | null |
| 2025-05-03 | Near-field 5D Pose Estimation using Reconfigurable Intelligent Surfaces | Srikar Sharma Sadhu et.al. | 2505.01829 | null |
| 2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
| 2025-05-03 | PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth | Bu Jin et.al. | 2505.01729 | null |
| 2025-05-02 | T-Graph: Enhancing Sparse-view Camera Pose Estimation by Pairwise Translation Graph | Qingyu Xian et.al. | 2505.01207 | null |
| 2025-05-02 | 3D Human Pose Estimation via Spatial Graph Order Attention and Temporal Body Aware Transformer | Kamel Aouaidjia et.al. | 2505.01003 | null |
| 2025-05-01 | Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation? | Viktor Kocur et.al. | 2505.00866 | null |
| 2025-05-01 | P2P-Insole: Human Pose Estimation Using Foot Pressure Distribution and Motion Sensors | Atsuya Watanabe et.al. | 2505.00755 | null |
| 2025-05-01 | Dietary Intake Estimation via Continuous 3D Reconstruction of Food | Wallace Lee et.al. | 2505.00606 | null |
| 2025-05-02 | InterLoc: LiDAR-based Intersection Localization using Road Segmentation with Automated Evaluation Method | Nguyen Hoang Khoi Tran et.al. | 2505.00512 | null |
| 2025-04-30 | Self-Supervised Monocular Visual Drone Model Identification through Improved Occlusion Handling | Stavrow A. Bahnam et.al. | 2504.21695 | null |
| 2025-04-29 | Dance Style Recognition Using Laban Movement Analysis | Muhammad Turab et.al. | 2504.21166 | null |
| 2025-04-29 | Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining | Weizhen He et.al. | 2504.20800 | null |
| 2025-04-29 | A Survey on Event-based Optical Marker Systems | Nafiseh Jabbari Tofighi et.al. | 2504.20736 | null |
| 2025-04-29 | Large-scale visual SLAM for in-the-wild videos | Shuo Sun et.al. | 2504.20496 | null |
| 2025-05-01 | GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting | Jongwon Lee et.al. | 2504.20379 | null |
| 2025-05-01 | PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking | Xiatao Sun et.al. | 2504.20359 | null |
| 2025-04-28 | Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM | Leon Davies et.al. | 2504.19654 | null |
| 2025-04-28 | GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM | Leon Davies et.al. | 2504.19653 | null |
| 2025-04-28 | Category-Level and Open-Set Object Pose Estimation for Robotics | Peter Hönig et.al. | 2504.19572 | null |
| 2025-04-25 | Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift | Devansh R. Agrawal et.al. | 2504.18713 | null |
| 2025-04-25 | SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations | Shuting Zhao et.al. | 2504.18332 | null |
| 2025-04-25 | S3MOT: Monocular 3D Object Tracking with Selective State Space Model | Zhuohao Yan et.al. | 2504.18068 | null |
| 2025-04-22 | SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos | Yuxin Yao et.al. | 2504.17810 | null |
| 2025-04-24 | Dynamic Camera Poses and Where to Find Them | Chris Rockwell et.al. | 2504.17788 | null |
| 2025-04-24 | A Guide to Structureless Visual Localization | Vojtech Panek et.al. | 2504.17636 | null |
| 2025-04-24 | Object Pose Estimation by Camera Arm Control Based on the Next Viewpoint Estimation | Tomoki Mizuno et.al. | 2504.17424 | null |
| 2025-04-24 | Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization | Guangyang Zeng et.al. | 2504.17410 | null |
| 2025-04-23 | WiFi based Human Fall and Activity Recognition using Transformer based Encoder Decoder and Graph Neural Networks | Younggeol Cho et.al. | 2504.16655 | null |
| 2025-04-23 | Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection | Md Fahimuzzman Sohan et.al. | 2504.16404 | null |
| 2025-04-22 | SignX: The Foundation Model for Sign Recognition | Sen Fang et.al. | 2504.16315 | null |
| 2025-04-22 | GADS: A Super Lightweight Model for Head Pose Estimation | Menan Velayuthan et.al. | 2504.15751 | null |
| 2025-04-21 | Field Report on Ground Penetrating Radar for Localization at the Mars Desert Research Station | Anja Sheppard et.al. | 2504.15455 | null |
| 2025-04-21 | Vision6D: 3D-to-2D Interactive Visualization and Annotation Tool for 6D Pose Estimation | Yike Zhang et.al. | 2504.15329 | null |
| 2025-04-21 | Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs | Chun-Hsiao Yeh et.al. | 2504.15280 | link |
| 2025-04-21 | Instance-Adaptive Keypoint Learning with Local-to-Global Geometric Aggregation for Category-Level Object Pose Estimation | Xiao Zhang et.al. | 2504.15134 | null |
| 2025-04-20 | Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction | Weirong Chen et.al. | 2504.14516 | null |
| 2025-04-20 | SG-Reg: Generalizable and Efficient Scene Graph Registration | Chuhao Liu et.al. | 2504.14440 | link |
| 2025-04-18 | Imitation Learning with Precisely Labeled Human Demonstrations | Yilong Song et.al. | 2504.13803 | null |
| 2025-04-18 | Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction | Wenyu Li et.al. | 2504.13419 | null |
| 2025-04-17 | ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation | Hongyu Li et.al. | 2504.13179 | null |
| 2025-04-18 | ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos | Zetong Zhang et.al. | 2504.13167 | null |
| 2025-04-17 | Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms | Jingjing Liu et.al. | 2504.12699 | null |
| 2025-04-16 | MobilePoser: Real-Time Full-Body Pose Estimation and 3D Human Translation from IMUs in Mobile Consumer Devices | Vasco Xu et.al. | 2504.12492 | link |
| 2025-04-16 | Diffusion Based Robust LiDAR Place Recognition | Benjamin Krummenacher et.al. | 2504.12412 | null |
| 2025-04-16 | Regist3R: Incremental Registration with Stereo Foundation Model | Sidun Liu et.al. | 2504.12356 | null |
| 2025-04-16 | CoMotion: Concurrent Multi-person 3D Motion | Alejandro Newell et.al. | 2504.12186 | link |
| 2025-04-16 | No Fuss, Just Function – A Proposal for Non-Intrusive Full Body Tracking in XR for Meaningful Spatial Interactions | Elisabeth Mayer et.al. | 2504.11987 | null |
| 2025-04-16 | An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World | Xingwu Ji et.al. | 2504.11698 | link |
| 2025-04-17 | CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image | Jingshun Huang et.al. | 2504.11230 | null |
| 2025-04-15 | DMAGaze: Gaze Estimation Based on Feature Disentanglement and Multi-Scale Attention | Haohan Chen et.al. | 2504.11160 | null |
| 2025-04-14 | MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model | Jian Liu et.al. | 2504.10433 | null |
| 2025-04-14 | Benchmarking 3D Human Pose Estimation Models Under Occlusions | Filipa Lino et.al. | 2504.10350 | null |
| 2025-04-15 | Differentially Private 2D Human Pose Estimation | Kaushik Bhargav Sivangi et.al. | 2504.10190 | null |
| 2025-04-14 | TT3D: Table Tennis 3D Reconstruction | Thomas Gossard et.al. | 2504.10035 | null |
| 2025-04-14 | Efficient 2D to Full 3D Human Pose Uplifting including Joint Rotations | Katja Ludwig et.al. | 2504.09953 | null |
| 2025-04-14 | NeRF-Based Transparent Object Grasping Enhanced by Shape Priors | Yi Han et.al. | 2504.09868 | null |
| 2025-04-13 | EasyREG: Easy Depth-Based Markerless Registration and Tracking using Augmented Reality Device for Surgical Guidance | Yue Yang et.al. | 2504.09498 | null |
| 2025-04-12 | SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow | Qingyuan Wang et.al. | 2504.09160 | null |
| 2025-04-12 | A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds | Jizong Peng et.al. | 2504.09129 | null |
| 2025-04-12 | BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting | Jeongwan On et.al. | 2504.09097 | null |
| 2025-04-11 | The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation | Masashi Hatano et.al. | 2504.08654 | null |
| 2025-04-11 | MBE-ARI: A Multimodal Dataset Mapping Bi-directional Engagement in Animal-Robot Interaction | Ian Noronha et.al. | 2504.08646 | null |
| 2025-04-11 | Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review | Claudio Cimarelli et.al. | 2504.08588 | null |
| 2025-04-11 | Multi-person Physics-based Pose Estimation for Combat Sports | Hossein Feiz et.al. | 2504.08175 | null |
| 2025-04-10 | Towards Unconstrained 2D Pose Estimation of the Human Spine | Muhammad Saif Ullah Khan et.al. | 2504.08110 | link |
| 2025-04-10 | BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation | Yuanhong Yu et.al. | 2504.07955 | link |
| 2025-04-09 | DLTPose: 6DoF Pose Estimation From Accurate Dense Surface Point Estimates | Akash Jadhav et.al. | 2504.07335 | null |
| 2025-04-09 | Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation | Yu Qi et.al. | 2504.06961 | null |
| 2025-04-09 | GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes | Seunghyeok Back et.al. | 2504.06866 | link |
| 2025-04-09 | Setup-Invariant Augmented Reality for Teaching by Demonstration with Surgical Robots | Alexandre Banks et.al. | 2504.06677 | link |
| 2025-04-09 | HGMamba: Enhancing 3D Human Pose Estimation with a HyperGCN-Mamba Network | Hu Cui et.al. | 2504.06638 | null |
| 2025-04-08 | Leveraging Synthetic Adult Datasets for Unsupervised Infant Pose Estimation | Sarosij Bose et.al. | 2504.05789 | null |
| 2025-04-08 | SAP-CoPE: Social-Aware Planning using Cooperative Pose Estimation with Infrastructure Sensor Nodes | Minghao Ning et.al. | 2504.05727 | link |
| 2025-04-08 | POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction | Songyan Zhang et.al. | 2504.05692 | link |
| 2025-04-10 | Learning Affine Correspondences by Integrating Geometric Constraints | Pengju Sun et.al. | 2504.04834 | link |
| 2025-04-06 | A Convex and Global Solution for the P $n$ P Problem in 2D Forward-Looking Sonar | Jiayi Su et.al. | 2504.04445 | null |
| 2025-04-05 | 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS | Zhisheng Huang et.al. | 2504.04294 | null |
| 2025-04-02 | A Geometric Approach For Pose and Velocity Estimation Using IMU and Inertial/Body-Frame Measurements | Sifeddine Benahmed et.al. | 2504.03764 | null |
| 2025-04-04 | Robust Human Registration with Body Part Segmentation on Noisy Point Clouds | Kai Lascheit et.al. | 2504.03602 | null |
| 2025-04-04 | Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video | Jiaxin Guo et.al. | 2504.03198 | null |
| 2025-04-03 | Cooperative Inference for Real-Time 3D Human Pose Estimation in Multi-Device Edge Networks | Hyun-Ho Choi et.al. | 2504.03052 | null |
| 2025-04-03 | BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation | Van Nguyen Nguyen et.al. | 2504.02812 | link |
| 2025-04-03 | PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation | Lihua Liu et.al. | 2504.02617 | null |
| 2025-04-02 | Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation | Mingrui Ye et.al. | 2504.01764 | link |
| 2025-04-02 | ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue | Thomas Pritchard et.al. | 2504.01261 | link |
| 2025-04-01 | AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline | Lei Wang et.al. | 2504.00394 | null |
| 2025-03-31 | Easi3R: Estimating Disentangled Motion from DUSt3R Without Training | Xingyu Chen et.al. | 2503.24391 | link |
| 2025-03-31 | LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds | Masahiko Tsuji et.al. | 2503.23664 | null |
| 2025-03-30 | PhysPose: Refining 6D Object Poses with Physical Constraints | Martin Malenický et.al. | 2503.23587 | null |
| 2025-03-30 | Improving Indoor Localization Accuracy by Using an Efficient Implicit Neural Map Representation | Haofei Kuang et.al. | 2503.23480 | link |
| 2025-03-30 | SparseLoc: Sparse Open-Set Landmark-based Global Localization for Autonomous Navigation | Pranjal Paul et.al. | 2503.23465 | null |
| 2025-03-30 | HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation | Hongwei Zheng et.al. | 2503.23331 | null |
| 2025-03-29 | Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization | Jintao Cheng et.al. | 2503.23199 | null |
| 2025-03-28 | ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection | Nandakishor M et.al. | 2503.22363 | null |
| 2025-03-28 | GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion | Li-Heng Chen et.al. | 2503.22349 | null |
| 2025-03-27 | NeRF-based Point Cloud Reconstruction using a Stationary Camera for Agricultural Applications | Kibon Ku et.al. | 2503.21958 | null |
| 2025-03-27 | Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video | David Yifan Yao et.al. | 2503.21761 | link |
| 2025-03-27 | Reconstructing Humans with a Biomechanically Accurate Skeleton | Yan Xia et.al. | 2503.21751 | link |
| 2025-03-27 | OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation | Mallika Garg et.al. | 2503.21723 | null |
| 2025-03-27 | RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond | Daniel Bermuth et.al. | 2503.21692 | null |
| 2025-03-27 | STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM | Yongxu Wang et.al. | 2503.21425 | null |
| 2025-03-27 | Lidar-only Odometry based on Multiple Scan-to-Scan Alignments over a Moving Window | Aaron Kurda et.al. | 2503.21293 | null |
| 2025-03-27 | Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation | Junjie Chen et.al. | 2503.21140 | link |
| 2025-03-26 | DINeMo: Learning Neural Mesh Models with no 3D Annotations | Weijie Guo et.al. | 2503.20220 | null |
| 2025-03-25 | Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors | Yuke Lou et.al. | 2503.20118 | null |
| 2025-03-25 | Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders | Paul Koch et.al. | 2503.19947 | null |
| 2025-03-25 | Visuo-Tactile Object Pose Estimation for a Multi-Finger Robot Hand with Low-Resolution In-Hand Tactile Sensing | Lukas Mack et.al. | 2503.19893 | null |
| 2025-03-25 | Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving | Yusen Xie et.al. | 2503.19713 | null |
| 2025-03-25 | DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera Scenarios | Xiangting Meng et.al. | 2503.19625 | null |
| 2025-03-25 | Pose-Based Fall Detection System: Efficient Monitoring on Standard CPUs | Vinayak Mali et.al. | 2503.19501 | null |
| 2025-03-25 | Multi-modal 3D Pose and Shape Estimation with Computed Tomography | Mingxiao Tu et.al. | 2503.19405 | null |
| 2025-03-25 | From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting | Zhiwei Huang et.al. | 2503.19358 | null |
| 2025-03-25 | Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation | Zhuoran Zhao et.al. | 2503.19307 | link |
| 2025-03-25 | Any6D: Model-free 6D Pose Estimation of Novel Objects | Taeyeop Lee et.al. | 2503.18673 | link |
| 2025-03-24 | Structure-Aware Correspondence Learning for Relative Pose Estimation | Yihan Chen et.al. | 2503.18671 | null |
| 2025-03-24 | TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court Videos | Kazuhiro Yamada et.al. | 2503.18282 | null |
| 2025-03-23 | Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning | Xiang Fang et.al. | 2503.17938 | null |
| 2025-03-22 | Co-op: Correspondence-based Novel Object Pose Estimation | Sungphill Moon et.al. | 2503.17731 | null |
| 2025-03-21 | Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image | Jerred Chen et.al. | 2503.17358 | null |
| 2025-03-21 | Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors | Wonbong Jang et.al. | 2503.17316 | null |
| 2025-03-20 | ContactFusion: Stochastic Poisson Surface Maps from Visual and Contact Sensing | Aditya Kamireddypalli et.al. | 2503.16592 | null |
| 2025-03-19 | A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions | Saddam Hussain Khan et.al. | 2503.16546 | null |
| 2025-03-20 | Probabilistic Prompt Distribution Learning for Animal Pose Estimation | Jiyong Rao et.al. | 2503.16120 | link |
| 2025-03-20 | Automating 3D Dataset Generation with Neural Radiance Fields | P. Schulz et.al. | 2503.15997 | link |
| 2025-03-20 | Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras | Beilei Cui et.al. | 2503.15917 | null |
| 2025-03-19 | EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds | Yuanchao Yue et.al. | 2503.15284 | null |
| 2025-03-20 | GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation | Zinqin Huang et.al. | 2503.15110 | link |
| 2025-03-20 | Distilling 3D distinctive local descriptors for 6D pose estimation | Amir Hamza et.al. | 2503.15106 | null |
| 2025-03-18 | Validation of Human Pose Estimation and Human Mesh Recovery for Extracting Clinically Relevant Motion Data from Videos | Kai Armstrong et.al. | 2503.14760 | null |
| 2025-03-18 | SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model | Yucheng Mao et.al. | 2503.14463 | null |
| 2025-03-18 | SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation | Weihong Chen et.al. | 2503.14097 | null |
| 2025-03-18 | Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach | Tianshu Wu et.al. | 2503.14051 | null |
| 2025-03-19 | Learning Shape-Independent Transformation via Spherical Representations for Category-Level Object Pose Estimation | Huan Ren et.al. | 2503.13926 | null |
| 2025-03-17 | STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans | Shashikant Verma et.al. | 2503.13344 | null |
| 2025-03-17 | UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation | Yinqiao Wang et.al. | 2503.13303 | null |
| 2025-03-17 | Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation | Nassim Ali Ousalah et.al. | 2503.13053 | null |
| 2025-03-17 | PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data | ChangHee Yang et.al. | 2503.13025 | null |
| 2025-03-15 | Gun Detection Using Combined Human Pose and Weapon Appearance | Amulya Reddy Maligireddy et.al. | 2503.12215 | null |
| 2025-03-15 | TACO: Taming Diffusion for in-the-wild Video Amodal Completion | Ruijie Lu et.al. | 2503.12049 | null |
| 2025-03-14 | Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation | Hiroyasu Akada et.al. | 2503.11652 | null |
| 2025-03-14 | Online Test-time Adaptation for 3D Human Pose Estimation: A Practical Perspective with Estimated 2D Poses | Qiuxia Lin et.al. | 2503.11194 | null |
| 2025-03-14 | Fast and Robust Localization for Humanoid Soccer Robot via Iterative Landmark Matching | Ruochen Hou et.al. | 2503.11020 | null |
| 2025-03-13 | Clothes-Changing Person Re-identification Based On Skeleton Dynamics | Asaf Joseph et.al. | 2503.10759 | null |
| 2025-03-13 | Consistent multi-animal pose estimation in cattle using dynamic Kalman filter based tracking | Maarten Perneel et.al. | 2503.10450 | null |
| 2025-03-13 | 6D Object Pose Tracking in Internet Videos for Robotic Manipulation | Georgy Ponimatkin et.al. | 2503.10307 | null |
| 2025-03-13 | VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames | Zhiqi Li et.al. | 2503.10286 | null |
| 2025-03-12 | Physics-Aware Human-Object Rendering from Sparse Views via 3D Gaussian Splatting | Weiquan Wang et.al. | 2503.09640 | null |
| 2025-03-12 | GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals | Shuokang Huang et.al. | 2503.09537 | null |
| 2025-03-12 | MonoSLAM: Robust Monocular SLAM with Global Structure Optimization | Bingzheng Jiang et.al. | 2503.09296 | null |
| 2025-03-12 | Better Together: Unified Motion Capture and 3D Avatar Reconstruction | Arthur Moreau et.al. | 2503.09293 | null |
| 2025-03-11 | Acoustic Neural 3D Reconstruction Under Pose Drift | Tianxiang Lin et.al. | 2503.08930 | null |
| 2025-03-11 | Keypoint Semantic Integration for Improved Feature Matching in Outdoor Agricultural Environments | Rajitha de Silva et.al. | 2503.08843 | null |
| 2025-03-11 | Keypoint Detection and Description for Raw Bayer Images | Jiakai Lin et.al. | 2503.08673 | null |
| 2025-03-11 | SGNetPose+: Stepwise Goal-Driven Networks with Pose Information for Trajectory Prediction in Autonomous Driving | Akshat Ghiya et.al. | 2503.08016 | null |
| 2025-03-10 | Better Pose Initialization for Fast and Robust 2D/3D Pelvis Registration | Yehyun Suh et.al. | 2503.07767 | null |
| 2025-03-10 | HumanMM: Global Human Motion Recovery from Multi-shot Videos | Yuhong Zhang et.al. | 2503.07597 | link |
| 2025-03-11 | AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements | Calvin Yeung et.al. | 2503.07499 | null |
| 2025-03-10 | Multi-Robot System for Cooperative Exploration in Unknown Environments: A Survey | Chuqi Wang et.al. | 2503.07278 | null |
| 2025-03-10 | Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion | Mona Sheikh Zeinoddin et.al. | 2503.07204 | null |
| 2025-03-10 | Multi-Modal 3D Mesh Reconstruction from Images and Text | Melvin Reka et.al. | 2503.07190 | null |
| 2025-03-11 | PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM | Alan Dao et.al. | 2503.07111 | link |
| 2025-03-09 | AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation | Yang Zou et.al. | 2503.06660 | null |
| 2025-03-08 | NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features | Hongjia Zhai et.al. | 2503.06117 | null |
| 2025-03-08 | Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision | David C. Jeong et.al. | 2503.06089 | null |
| 2025-03-08 | ReJSHand: Efficient Real-Time Hand Pose Estimation and Mesh Reconstruction Using Refined Joint and Skeleton Features | Shan An et.al. | 2503.05995 | link |
| 2025-03-07 | Differentiable Rendering-based Pose Estimation for Surgical Robotic Instruments | Zekai Liang et.al. | 2503.05953 | null |
| 2025-03-07 | Novel Object 6D Pose Estimation with a Single Reference View | Jian Liu et.al. | 2503.05578 | link |
| 2025-03-07 | Multi-Grained Feature Pruning for Video-Based Human Pose Estimation | Zhigang Wang et.al. | 2503.05365 | null |
| 2025-03-07 | Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects | Justin Yu et.al. | 2503.05189 | null |
| 2025-03-07 | SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting | Linqi Yang et.al. | 2503.05174 | null |
| 2025-03-07 | GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting | Zheng Zhou et.al. | 2503.05161 | null |
| 2025-03-06 | MarsLGPR: Mars Rover Localization with Ground Penetrating Radar | Anja Sheppard et.al. | 2503.04944 | null |
| 2025-03-06 | ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem | Yu-Hsi Chen et.al. | 2503.04500 | link |
| 2025-03-05 | Active 6D Pose Estimation for Textureless Objects using Multi-View RGB Frames | Jun Yang et.al. | 2503.03726 | null |
| 2025-03-05 | Machine Learning in Biomechanics: Key Applications and Limitations in Walking, Running, and Sports Movements | Carlo Dindorf et.al. | 2503.03717 | null |
| 2025-03-05 | Improving 6D Object Pose Estimation of metallic Household and Industry Objects | Thomas Pöllabauer et.al. | 2503.03655 | null |
| 2025-03-05 | Tiny Lidars for Manipulator Self-Awareness: Sensor Characterization and Initial Localization Experiments | Giammarco Caroleo et.al. | 2503.03449 | null |
| 2025-03-05 | Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments | Jie Deng et.al. | 2503.03373 | null |
| 2025-03-05 | Supervised Visual Docking Network for Unmanned Surface Vehicles Using Auto-labeling in Real-world Water Environments | Yijie Chu et.al. | 2503.03282 | null |
| 2025-03-05 | SCORE: Saturated Consensus Relocalization in Semantic Line Maps | Haodong Jiang et.al. | 2503.03254 | null |
| 2025-03-04 | Monocular Person Localization under Camera Ego-motion | Yu Zhan et.al. | 2503.02916 | null |
| 2025-03-04 | PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers | Wooju Lee et.al. | 2503.02388 | null |
| 2025-03-04 | DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting | Haoyuan Li et.al. | 2503.02223 | null |
| 2025-03-04 | Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints | Yan Miao et.al. | 2503.02198 | null |
| 2025-03-03 | Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM | Marco Giberna et.al. | 2503.02050 | null |
| 2025-03-03 | Category-level Meta-learned NeRF Priors for Efficient Object Mapping | Saad Ejaz et.al. | 2503.01582 | null |
| 2025-03-03 | RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation | Shu Pan et.al. | 2503.01434 | null |
| 2025-03-03 | ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization | Anas Abdelkarim et.al. | 2503.01311 | null |
| 2025-03-03 | Convex Hull-based Algebraic Constraint for Visual Quadric SLAM | Xiaolong Yu et.al. | 2503.01254 | link |
| 2025-03-04 | Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction | Haolin Wang et.al. | 2503.00397 | null |
| 2025-03-01 | BGM2Pose: Active 3D Human Pose Estimation with Non-Stationary Sounds | Yuto Shibata et.al. | 2503.00389 | null |
| 2025-02-28 | BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports | Jing-Yuan Chang et.al. | 2502.21085 | null |
| 2025-02-28 | Two-Stream Spatial-Temporal Transformer Framework for Person Identification via Natural Conversational Keypoints | Masoumeh Chapariniya et.al. | 2502.20803 | null |
| 2025-02-27 | Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison | Jiageng Zhong et.al. | 2502.20154 | null |
| 2025-02-27 | BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground | Yufei Wei et.al. | 2502.20078 | null |
| 2025-02-28 | SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation | Zijie Zhou et.al. | 2502.20077 | link |
| 2025-02-27 | RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges | Thibaut Loiseau et.al. | 2502.19955 | null |
| 2025-02-27 | QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects | Elkhan Ismayilzada et.al. | 2502.19769 | null |
| 2025-02-27 | Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System | Shunkun Liang et.al. | 2502.19708 | null |
| 2025-02-26 | Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects | Petri Mäkinen et.al. | 2502.19169 | null |
| 2025-02-25 | EgoSim: An Egocentric Multi-view Simulator and Real Dataset for Body-worn Cameras during Motion and Activity | Dominik Hollidt et.al. | 2502.18373 | link |
| 2025-02-25 | Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation | Tianyang Xu et.al. | 2502.18214 | link |
| 2025-02-24 | V-HOP: Visuo-Haptic 6D Object Pose Tracking | Hongyu Li et.al. | 2502.17434 | null |
| 2025-02-23 | Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM | Yao Zhang et.al. | 2502.16495 | null |
| 2025-02-23 | DeProPose: Deficiency-Proof 3D Human Pose Estimation via Adaptive Multi-View Fusion | Jianbin Jiao et.al. | 2502.16419 | link |
| 2025-02-21 | RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes | Sicheng Yu et.al. | 2502.15633 | null |
| 2025-02-21 | SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training | Nie Lin et.al. | 2502.15251 | null |
| 2025-02-21 | Nonlinear Dynamical Systems for Automatic Face Annotation in Head Tracking and Pose Estimation | Thoa Thieu et.al. | 2502.15179 | null |
| 2025-02-20 | Design of a Visual Pose Estimation Algorithm for Moon Landing | Atakan Süslü et.al. | 2502.14942 | null |
| 2025-02-20 | Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2502.14931 | null |
| 2025-02-19 | EfficientPose 6D: Scalable and Efficient 6D Object Pose Estimation | Zixuan Fang et.al. | 2502.14061 | null |
| 2025-02-19 | Active Illumination for Visual Ego-Motion Estimation in the Dark | Francesco Crocetti et.al. | 2502.13708 | null |
| 2025-02-19 | Object-Pose Estimation With Neural Population Codes | Heiko Hoffmann et.al. | 2502.13403 | null |
| 2025-02-18 | Spatiotemporal Multi-Camera Calibration using Freely Moving People | Sang-Eun Lee et.al. | 2502.12546 | null |
| 2025-02-18 | Learning Transformation-Isomorphic Latent Space for Accurate Hand Pose Estimation | Kaiwen Ren et.al. | 2502.12535 | null |
| 2025-02-19 | FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views | Shangzhan Zhang et.al. | 2502.12138 | null |
| 2025-02-17 | Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection | Tessa Pulli et.al. | 2502.12027 | null |
| 2025-02-17 | SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking | Zijian Wu et.al. | 2502.11534 | null |
| 2025-02-18 | VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS | Ming Meng et.al. | 2502.10729 | link |
| 2025-02-15 | Semantics-aware Test-time Adaptation for 3D Human Pose Estimation | Qiuxia Lin et.al. | 2502.10724 | null |
| 2025-02-15 | Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video | Runyang Feng et.al. | 2502.10616 | null |
| 2025-02-14 | HIPPo: Harnessing Image-to-3D Priors for Model-free Zero-shot 6D Pose Estimation | Yibo Liu et.al. | 2502.10606 | null |
| 2025-02-14 | Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models | Chenrui Tie et.al. | 2502.10090 | null |
| 2025-02-13 | Metamorphic Testing for Pose Estimation Systems | Matias Duran et.al. | 2502.09460 | null |
| 2025-02-13 | BevSplat: Resolving Height Ambiguity via Feature-Based Gaussian Primitives for Weakly-Supervised Cross-View Localization | Qiwei Wang et.al. | 2502.09080 | null |
| 2025-02-14 | Siren Song: Manipulating Pose Estimation in XR Headsets Using Acoustic Attacks | Zijian Huang et.al. | 2502.08865 | null |
| 2025-02-12 | LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features | Shujie Zhou et.al. | 2502.08676 | link |
| 2025-02-12 | CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World | Yankai Fu et.al. | 2502.08449 | null |
| 2025-02-11 | GaRLIO: Gravity enhanced Radar-LiDAR-Inertial Odometry | Chiyun Noh et.al. | 2502.07703 | link |
| 2025-02-11 | Matrix3D: Large Photogrammetry Model All-in-One | Yuanxun Lu et.al. | 2502.07685 | null |
| 2025-02-08 | Vision-in-the-loop Simulation for Deep Monocular Pose Estimation of UAV in Ocean Environment | Maneesha Wickramasuriya et.al. | 2502.05409 | null |
| 2025-02-06 | Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation | Nathan Louis et.al. | 2502.04483 | link |
| 2025-02-06 | GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation | Weihang Li et.al. | 2502.04293 | null |
| 2025-02-06 | Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks | Yuhui Jin et.al. | 2502.03877 | null |
| 2025-02-05 | Mapping and Localization Using LiDAR Fiducial Markers | Yibo Liu et.al. | 2502.03510 | null |
| 2025-02-04 | Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation | Jian Liu et.al. | 2502.02525 | link |
| 2025-02-03 | CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation | Xiao Lin et.al. | 2502.01312 | null |
| 2025-02-03 | Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter | Dabin Kim et.al. | 2502.01092 | null |
| 2025-02-03 | ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking | Jianqiu Chen et.al. | 2502.01004 | null |
| 2025-01-31 | A Direct Semi-Exhaustive Search Method for Robust, Partial-to-Full Point Cloud Registration | Richard Cheng et.al. | 2502.00115 | null |
| 2025-01-31 | XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses | Bo Lan et.al. | 2501.19034 | link |
| 2025-01-30 | SimpleDepthPose: Fast and Reliable Human Pose Estimation with RGBD-Images | Daniel Bermuth et.al. | 2501.18478 | null |
| 2025-01-29 | Online Trajectory Replanner for Dynamically Grasping Irregular Objects | Minh Nhat Vu et.al. | 2501.17968 | null |
| 2025-01-28 | DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging | Muxi Chen et.al. | 2501.16751 | null |
| 2025-01-27 | Toward Efficient Generalization in 3D Human Pose Estimation via a Canonical Domain Approach | Hoosang Lee et.al. | 2501.16146 | null |
| 2025-01-27 | NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation | Jialun Cai et.al. | 2501.15763 | null |
| 2025-01-25 | Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos | Zhen-Hui Dong et.al. | 2501.15096 | null |
| 2025-01-25 | SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos | Yingying Jiao et.al. | 2501.15073 | null |
| 2025-01-24 | 3D/2D Registration of Angiograms using Silhouette-based Differentiable Rendering | Taewoong Lee et.al. | 2501.14918 | link |
| 2025-01-24 | Light3R-SfM: Towards Feed-forward Structure-from-Motion | Sven Elflein et.al. | 2501.14914 | null |
| 2025-01-24 | Glissando-Net: Deep sinGLe vIew category level poSe eStimation ANd 3D recOnstruction | Bo Sun et.al. | 2501.14896 | null |
| 2025-01-24 | Optimizing Grasping Precision for Industrial Pick-and-Place Tasks Through a Novel Visual Servoing Approach | Khairidine Benali et.al. | 2501.14557 | null |
| 2025-01-24 | LiDAR-Based Vehicle Detection and Tracking for Autonomous Racing | Marcello Cellina et.al. | 2501.14502 | null |
| 2025-01-24 | Optimizing Human Pose Estimation Through Focused Human and Joint Regions | Yingying Jiao et.al. | 2501.14439 | null |
| 2025-01-24 | Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation | Haipeng Chen et.al. | 2501.14356 | null |
| 2025-01-24 | HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting | Javier Yu et.al. | 2501.14147 | null |
| 2025-01-23 | Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass | Jianing Yang et.al. | 2501.13928 | null |
| 2025-01-23 | EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs | Yizhe Lv et.al. | 2501.13805 | link |
| 2025-01-23 | VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM | Gyuhyeon Pak et.al. | 2501.13402 | null |
| 2025-01-22 | Deep Learning-Based Image Recovery and Pose Estimation for Resident Space Objects | Louis Aberdeen et.al. | 2501.13009 | null |
| 2025-01-21 | BlanketGen2-Fit3D: Synthetic Blanket Augmentation Towards Improving Real-World In-Bed Blanket Occluded Human Pose Estimation | Tamás Karácsony et.al. | 2501.12318 | null |
| 2025-01-19 | Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation | Shibang Liu et.al. | 2501.11069 | null |
| 2025-01-17 | landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images | Jef Jonkers et.al. | 2501.10098 | link |
| 2025-01-16 | A New Teacher-Reviewer-Student Framework for Semi-supervised 2D Human Pose Estimation | Wulian Yun et.al. | 2501.09565 | null |
| 2025-01-21 | Towards Robust and Realistic Human Pose Estimation via WiFi Signals | Yang Chen et.al. | 2501.09411 | link |
| 2025-01-16 | RoboReflect: Robotic Reflective Reasoning for Grasping Ambiguous-Condition Objects | Zhen Luo et.al. | 2501.09307 | null |
| 2025-01-16 | BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module | Dongzhihan Wang et.al. | 2501.08659 | null |
| 2025-01-14 | Poseidon: A ViT-based Architecture for Multi-Frame Pose Estimation with Adaptive Frame Weighting and Multi-Scale Feature Fusion | Cesare Davide Pace et.al. | 2501.08446 | link |
| 2025-01-14 | Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation | Hansoo Park et.al. | 2501.08408 | null |
| 2025-01-14 | Predicting 4D Hand Trajectory from Monocular Videos | Yufei Ye et.al. | 2501.08329 | null |
| 2025-01-14 | A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation | Steven Landgraf et.al. | 2501.08188 | null |
| 2025-01-14 | AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation | Feng Zhang et.al. | 2501.08088 | null |
| 2025-01-14 | Robust Low-Light Human Pose Estimation through Illumination-Texture Modulation | Feng Zhang et.al. | 2501.08038 | null |
| 2025-01-14 | BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos | Farnoosh Koleini et.al. | 2501.07800 | null |
| 2025-01-13 | Fixing the Scale and Shift in Monocular Depth For Camera Pose Estimation | Yaqing Ding et.al. | 2501.07742 | link |
| 2025-01-13 | Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps | Saurabh Gupta et.al. | 2501.07399 | null |
| 2025-01-13 | Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics | Tze Ho Elden Tse et.al. | 2501.07100 | null |
| 2025-01-10 | eKalibr: Dynamic Intrinsic Calibration for Event Cameras From First Principles of Events | Shuolong Chen et.al. | 2501.05688 | null |
| 2025-01-09 | Relative Pose Estimation through Affine Corrections of Monocular Depth Priors | Yifan Yu et.al. | 2501.05446 | link |
| 2025-01-09 | From Simple to Complex Skills: The Case of In-Hand Object Reorientation | Haozhi Qi et.al. | 2501.05439 | null |
| 2025-01-11 | Towards Balanced Continual Multi-Modal Learning in Human Pose Estimation | Jiaxuan Peng et.al. | 2501.05264 | null |
| 2025-01-08 | KN-LIO: Geometric Kinematics and Neural Field Coupled LiDAR-Inertial Odometry | Zhong Wang et.al. | 2501.04263 | null |
| 2025-01-10 | MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer | Junsheng Luan et.al. | 2501.03630 | null |
| 2025-01-07 | TexHOI: Reconstructing Textures of 3D Unknown Objects in Monocular Hand-Object Interaction Scenes | Alakh Aggarwal et.al. | 2501.03525 | link |
| 2025-01-06 | Mobile Augmented Reality Framework with Fusional Localization and Pose Estimation | Songlin Hou et.al. | 2501.03336 | null |
| 2025-01-06 | SurgRIPE challenge: Benchmark of Surgical Robot Instrument Pose Estimation | Haozheng Xu et.al. | 2501.02990 | null |
| 2025-01-06 | HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos | Jinglei Zhang et.al. | 2501.02973 | null |
| 2025-01-06 | Spiking monocular event based 6D pose estimation for space application | Jonathan Courtois et.al. | 2501.02916 | null |
| 2025-01-06 | Universal Features Guided Zero-Shot Category-Level Object Pose Estimation | Wentian Qu et.al. | 2501.02831 | null |
| 2025-01-06 | Unsupervised Domain Adaptation for Occlusion Resilient Human Pose Estimation | Arindam Dutta et.al. | 2501.02773 | null |
| 2025-01-06 | WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation | Tianjian Jiang et.al. | 2501.02771 | null |
| 2025-01-05 | LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments | Haosong Yue et.al. | 2501.02580 | null |
| 2025-01-04 | ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle | Yinchuan Wang et.al. | 2501.02166 | link |
| 2025-01-03 | TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation | Jiajie Liu et.al. | 2501.01770 | null |
| 2025-01-03 | Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery | Baoru Huang et.al. | 2501.01752 | null |
| 2025-01-02 | On Unifying Video Generation and Camera Pose Estimation | Chun-Hao Paul Huang et.al. | 2501.01409 | null |
| 2025-01-02 | L3D-Pose: Lifting Pose for 3D Avatars from a Single Camera in the Wild | Soumyaratna Debnath et.al. | 2501.01174 | null |
| 2024-12-31 | Relative Pose Observability Analysis Using Dual Quaternions | Nicholas B. Andrews et.al. | 2501.00657 | null |
| 2024-12-31 | VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception | Zhaoliang Wan et.al. | 2501.00510 | null |
| 2024-12-30 | Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields | Evgenii Kruzhkov et.al. | 2412.20976 | null |
| 2024-12-30 | ReFlow6D: Refraction-Guided Transparent Object 6D Pose Estimation via Intermediate Representation Learning | Hrishikesh Gupta et.al. | 2412.20830 | link |
| 2024-12-30 | Frequency-aware Event Cloud Network | Hongwei Ren et.al. | 2412.20803 | null |
| 2024-12-30 | KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences | Keng-Wei Chang et.al. | 2412.20767 | null |
| 2024-12-30 | Towards nation-wide analytical healthcare infrastructures: A privacy-preserving augmented knee rehabilitation case study | Boris Bačić et.al. | 2412.20733 | null |
| 2024-12-29 | Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation | Qucheng Peng et.al. | 2412.20538 | link |
| 2024-12-28 | MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing | Shuo Wang et.al. | 2412.20082 | null |
| 2024-12-28 | GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting | Atticus J. Zeller et.al. | 2412.20056 | link |
| 2024-12-27 | Optimizing Local-Global Dependencies for Accurate 3D Human Pose Estimation | Guangsheng Xu et.al. | 2412.19676 | link |
| 2024-12-27 | Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images | Xudong Cai et.al. | 2412.19518 | null |
| 2024-12-26 | Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos | Changwoon Choi et.al. | 2412.19089 | null |
| 2024-12-23 | Reconstructing People, Places, and Cameras | Lea Müller et.al. | 2412.17806 | null |
| 2024-12-22 | Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry | Zhaoxing Zhang et.al. | 2412.16923 | null |
| 2024-12-21 | EasyVis2: A Real Time Multi-view 3D Visualization for Laparoscopic Surgery Training Enhanced by a Deep Neural Network YOLOv8-Pose | Yung-Hong Sun et.al. | 2412.16742 | null |
| 2024-12-21 | FACTS: Fine-Grained Action Classification for Tactical Sports | Christopher Lai et.al. | 2412.16454 | null |
| 2024-12-20 | Can Generative Video Models Help Pose Estimation? | Ruojin Cai et.al. | 2412.16155 | null |
| 2024-12-20 | Monkey Transfer Learning Can Improve Human Pose Estimation | Bradley Scott et.al. | 2412.15966 | null |
| 2024-12-19 | Scaling 4D Representations | João Carreira et.al. | 2412.15212 | link |
| 2024-12-13 | IMPROVE: Impact of Mobile Phones on Remote Online Virtual Education | Roberto Daza et.al. | 2412.14195 | link |
| 2024-12-18 | Level-Set Parameters: Novel Representation for 3D Shape Analysis | Huan Lei et.al. | 2412.13502 | null |
| 2024-12-18 | Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation | Xiaoqi An et.al. | 2412.13454 | null |
| 2024-12-17 | ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection | Jui-Che Chiang et.al. | 2412.13174 | link |
| 2024-12-17 | CondiMen: Conditional Multi-Person Mesh Recovery | Brégier Romain et.al. | 2412.13058 | null |
| 2024-12-17 | ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries | Wangyu Xue et.al. | 2412.12675 | null |
| 2024-12-16 | Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion | Adam Bethell et.al. | 2412.11420 | null |
| 2024-12-13 | ExeChecker: Where Did I Go Wrong? | Yiwen Gu et.al. | 2412.10573 | null |
| 2024-12-11 | CUPS: Improving Human Pose-Shape Estimators with Conformalized Deep Uncertainty | Harry Zhang et.al. | 2412.10431 | null |
| 2024-12-13 | RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting | Lizhi Bai et.al. | 2412.09868 | null |
| 2024-12-12 | Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos | Linyi Jin et.al. | 2412.09621 | link |
| 2024-12-12 | FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction | Jiale Xu et.al. | 2412.09573 | link |
| 2024-12-11 | BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation | Shengze Wang et.al. | 2412.08640 | null |
| 2024-12-12 | Drift-free Visual SLAM using Digital Twins | Roxane Merat et.al. | 2412.08496 | null |
| 2024-12-11 | Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization | Siyan Dong et.al. | 2412.08376 | link |
| 2024-12-10 | LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models | Ziqi Lu et.al. | 2412.07746 | null |
| 2024-12-09 | MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds | Zhenggang Tang et.al. | 2412.06974 | link |
| 2024-12-09 | An Efficient Scene Coordinate Encoding and Relocalization Method | Kuan Xu et.al. | 2412.06488 | link |
| 2024-12-09 | Attention-Enhanced Lightweight Hourglass Network for Human Pose Estimation | Marsha Mariya Kappan et.al. | 2412.06227 | null |
| 2024-12-06 | CCS: Continuous Learning for Customized Incremental Wireless Sensing Services | Qunhang Fu et.al. | 2412.04821 | null |
| 2024-12-05 | ProPLIKS: Probablistic 3D human body pose estimation | Karthik Shetty et.al. | 2412.04665 | null |
| 2024-12-05 | DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction | Ben Kaye et.al. | 2412.04464 | link |
| 2024-12-05 | Targeted Hard Sample Synthesis Based on Estimated Pose and Occlusion Error for Improved Object Pose Estimation | Alan Li et.al. | 2412.04279 | null |
| 2024-12-04 | Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis | Qitao Zhao et.al. | 2412.03570 | null |
| 2024-12-06 | NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images | Lingen Li et.al. | 2412.03517 | link |
| 2024-12-05 | A Bidirectional Siamese Recurrent Neural Network for Accurate Gait Recognition Using Body Landmarks | Proma Hossain Progga et.al. | 2412.03498 | null |
| 2024-12-04 | MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras | Huai Yu et.al. | 2412.03146 | link |
| 2024-12-04 | An indoor DSO-based ceiling-vision odometry system for indoor industrial environments | Abdelhak Bougouffa et.al. | 2412.02950 | null |
| 2024-12-03 | EgoCast: Forecasting Egocentric Human Pose in the Wild | Maria Escobar et.al. | 2412.02903 | null |
| 2024-12-02 | emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation | Sasha Salter et.al. | 2412.02725 | null |
| 2024-12-03 | ProbPose: A Probabilistic Approach to 2D Human Pose Estimation | Miroslav Purkrabek et.al. | 2412.02254 | link |
| 2024-12-03 | Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images | Xiangyong Lu et.al. | 2412.02197 | link |
| 2024-12-03 | CLERF: Contrastive LEaRning for Full Range Head Pose Estimation | Ting-Ruen Wei et.al. | 2412.02066 | null |
| 2024-12-02 | Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle | Miroslav Purkrabek et.al. | 2412.01562 | link |
| 2024-12-02 | 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting | Yufeng Jin et.al. | 2412.01543 | null |
| 2024-12-02 | HandOS: 3D Hand Reconstruction in One Stage | Xingyu Chen et.al. | 2412.01537 | null |
| 2024-12-02 | SF-Loc: A Visual Mapping and Geo-Localization System based on Sparse Visual Structure Frames | Yuxuan Zhou et.al. | 2412.01500 | null |
| 2024-12-02 | MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection | Yonghao Dang et.al. | 2412.01422 | null |
| 2024-12-02 | Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures | Qiyuan Shen et.al. | 2412.01299 | null |
| 2024-12-02 | CRISP: Object Pose and Shape Estimation with Test-Time Adaptation | Jingnan Shi et.al. | 2412.01052 | null |
| 2024-11-29 | Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling | Qirui Wu et.al. | 2411.19492 | null |
| 2024-11-29 | Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning | Yang You et.al. | 2411.19458 | link |
| 2024-11-28 | GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model | Rui Zhou et.al. | 2411.19289 | null |
| 2024-11-28 | HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos | Prithviraj Banerjee et.al. | 2411.19167 | null |
| 2024-11-28 | Lost & Found: Updating Dynamic 3D Scene Graphs from Egocentric Observations | Tjark Behrens et.al. | 2411.19162 | link |
| 2024-11-28 | Distributed Dual Quaternion Extended Kalman Filtering for Spacecraft Pose Estimation | Mathias Hudoba de Badyn et.al. | 2411.19033 | null |
| 2024-11-28 | Waterfall Transformer for Multi-person Pose Estimation | Navin Ranjan et.al. | 2411.18944 | null |
| 2024-12-02 | AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers | Sherwin Bahmani et.al. | 2411.18673 | null |
| 2024-11-27 | XR-MBT: Multi-modal Full Body Tracking for XR through Self-Supervision with Learned Depth Point Cloud Registration | Denys Rozumnyi et.al. | 2411.18377 | null |
| 2024-11-26 | Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors | Ziang Xu et.al. | 2411.17790 | null |
| 2024-11-26 | Geometric Point Attention Transformer for 3D Shape Reassembly | Jiahan Li et.al. | 2411.17788 | null |
| 2024-11-26 | RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training | Raktim Gautam Goswami et.al. | 2411.17662 | null |
| 2024-11-26 | Communication-Efficient Cooperative SLAMMOT via Determining the Number of Collaboration Vehicles | Susu Fang et.al. | 2411.17432 | null |
| 2024-11-26 | Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration | Junyuan Deng et.al. | 2411.17240 | link |
| 2024-11-27 | SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting | Gyeongjin Kang et.al. | 2411.17190 | link |
| 2024-11-26 | GMFlow: Global Motion-Guided Recurrent Flow for 6D Object Pose Estimation | Xin Liu et.al. | 2411.17174 | null |
| 2024-11-25 | Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Bernd Von Gimborn et.al. | 2411.16668 | null |
| 2024-11-25 | Edge Weight Prediction For Category-Agnostic Pose Estimation | Or Hirschorn et.al. | 2411.16665 | link |
| 2024-11-25 | SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis | Hyojun Go et.al. | 2411.16443 | link |
| 2024-11-25 | One Diffusion to Generate Them All | Duong H. Le et.al. | 2411.16318 | link |
| 2024-11-25 | UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image | Xingyu Liu et.al. | 2411.16106 | link |
| 2024-11-24 | Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching | Yujing Sun et.al. | 2411.15860 | link |
| 2024-11-24 | PEnG: Pose-Enhanced Geo-Localisation | Tavis Shore et.al. | 2411.15742 | link |
| 2024-11-22 | Personalization of Wearable Sensor-Based Joint Kinematic Estimation Using Computer Vision for Hip Exoskeleton Applications | Changseob Song et.al. | 2411.15366 | null |
| 2024-11-22 | mmWave Radar for Sit-to-Stand Analysis: A Comparative Study with Wearables and Kinect | Shuting Hu et.al. | 2411.14656 | null |
| 2024-11-21 | DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding | Tianhe Ren et.al. | 2411.14347 | link |
| 2024-11-21 | SEMPose: A Single End-to-end Network for Multi-object Pose Estimation | Xin Liu et.al. | 2411.14002 | null |
| 2024-11-21 | Dehazing-aided Multi-Rate Multi-Modal Pose Estimation Framework for Mitigating Visual Disturbances in Extreme Underwater Domain | Vidya Sudevan et.al. | 2411.13988 | null |
| 2024-11-21 | Hybrid-Neuromorphic Approach for Underwater Robotics Applications: A Conceptual Framework | Vidya Sudevan et.al. | 2411.13962 | null |
| 2024-11-20 | Developing Normative Gait Cycle Parameters for Clinical Analysis Using Human Pose Estimation | Rahm Ranjan et.al. | 2411.13716 | null |
| 2024-11-20 | Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction | Yi Gu et.al. | 2411.13620 | null |
| 2024-11-19 | VioPose: Violin Performance 4D Pose Estimation by Hierarchical Audiovisual Inference | Seong Jong Yoo et.al. | 2411.13607 | link |
| 2024-11-20 | DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild | Weicai Ye et.al. | 2411.13291 | null |
| 2024-11-20 | X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation | Yuchen Yang et.al. | 2411.13026 | link |
| 2024-11-19 | IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose | Fei Ren et.al. | 2411.12676 | null |
| 2024-11-15 | SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction | Yutao Tang et.al. | 2411.12592 | link |
| 2024-11-19 | GLOVER: Generalizable Open-Vocabulary Affordance Reasoning for Task-Oriented Grasping | Teli Ma et.al. | 2411.12286 | null |
| 2024-11-18 | IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos | Yunong Liu et.al. | 2411.11409 | link |
| 2024-11-15 | USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting | Kang Chen et.al. | 2411.10504 | link |
| 2024-11-13 | ReMP: Reusable Motion Prior for Multi-domain 3D Human Pose Estimation and Motion Inbetweening | Hojun Jang et.al. | 2411.09435 | null |
| 2024-11-13 | Generalized Pose Space Embeddings for Training In-the-Wild using Anaylis-by-Synthesis | Dominik Borer et.al. | 2411.08603 | null |
| 2024-11-13 | DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization | Yueming Xu et.al. | 2411.08373 | null |
| 2024-11-16 | RINO: Accurate, Robust Radar-Inertial Odometry with Non-Iterative Estimation | Shuocheng Yang et.al. | 2411.07699 | link |
| 2024-11-12 | Human Arm Pose Estimation with a Shoulder-worn Force-Myography Device for Human-Robot Interaction | Rotem Atari et.al. | 2411.07644 | null |
| 2024-11-12 | Towards Seamless Integration of Magnetic Tracking into Fluoroscopy-guided Interventions | Shuwei Xing et.al. | 2411.07495 | null |
| 2024-11-08 | Acoustic-based 3D Human Pose Estimation Robust to Human Position | Yusuke Oumi et.al. | 2411.07165 | null |
| 2024-11-11 | CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models | Junho Kim et.al. | 2411.06869 | null |
| 2024-11-11 | GenZ-ICP: Generalizable and Degeneracy-Robust LiDAR Odometry Using an Adaptive Weighting | Daehan Lee et.al. | 2411.06766 | null |
| 2024-11-11 | GTA-Net: An IoT-Integrated 3D Human Pose Estimation System for Real-Time Adolescent Sports Posture Correction | Shizhe Yuan et.al. | 2411.06725 | null |
| 2024-11-10 | Magnetic Field Aided Vehicle Localization with Acceleration Correction | Mrunmayee Deshpande et.al. | 2411.06543 | null |
| 2024-11-10 | Visuotactile-Based Learning for Insertion with Compliant Hands | Osher Azulay et.al. | 2411.06408 | null |
| 2024-11-08 | Poze: Sports Technique Feedback under Data Constraints | Agamdeep Singh et.al. | 2411.05734 | null |
| 2024-11-08 | DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions | Rafael Berral-Soler et.al. | 2411.05552 | link |
| 2024-11-08 | Tightly-Coupled, Speed-aided Monocular Visual-Inertial Localization in Topological Map | Chanuk Yang et.al. | 2411.05497 | null |
| 2024-11-08 | Relative Pose Estimation for Nonholonomic Robot Formation with UWB-IO Measurements | Kunrui Ze et.al. | 2411.05481 | null |
| 2024-11-07 | Social EgoMesh Estimation | Luca Scofano et.al. | 2411.04598 | link |
| 2024-11-07 | Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player’s Trajectory | Ali K. AlShami et.al. | 2411.04501 | null |
| 2024-11-07 | SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation | Xun Tu et.al. | 2411.04386 | null |
| 2024-11-08 | GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting | Jilan Mei et.al. | 2411.03807 | null |
| 2024-11-06 | Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage | Claus D. Hansen et.al. | 2411.03724 | null |
| 2024-11-05 | Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data | Seunggeun Chi et.al. | 2411.03561 | null |
| 2024-11-05 | HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features | Arnab Dey et.al. | 2411.03086 | null |
| 2024-11-04 | Semantic Masking and Visual Feature Matching for Robust Localization | Luisa Mao et.al. | 2411.01804 | null |
| 2024-11-03 | Activating Self-Attention for Multi-Scene Absolute Pose Regression | Miso Lee et.al. | 2411.01443 | link |
| 2024-11-04 | 3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction | Jongmin Lee et.al. | 2411.00543 | null |
| 2024-10-31 | Whole-Herd Elephant Pose Estimation from Drone Data for Collective Behavior Analysis | Brody McNutt et.al. | 2411.00196 | null |
| 2024-10-31 | No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images | Botao Ye et.al. | 2410.24207 | link |
| 2024-11-06 | SceneComplete: Open-World 3D Scene Completion in Complex Real World Environments for Robot Manipulation | Aditya Agarwal et.al. | 2410.23643 | null |
| 2024-10-30 | SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark | HyunJun Jung et.al. | 2410.22715 | null |
| 2024-10-29 | LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues | Hanqing Jiang et.al. | 2410.22213 | null |
| 2024-10-29 | PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting | Sunghwan Hong et.al. | 2410.22128 | link |
| 2024-10-29 | HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation | Zhoujie Xu et.al. | 2410.22079 | null |
| 2024-10-29 | EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image Data | Zhonghua Yi et.al. | 2410.21743 | null |
| 2024-10-28 | Synthetica: Large Scale Synthetic Data for Robot Perception | Ritvik Singh et.al. | 2410.21153 | null |
| 2024-10-29 | BLAPose: Enhancing 3D Human Pose Estimation with Bone Length Adjustment | Chih-Hsiang Hsu et.al. | 2410.20731 | link |
| 2024-11-01 | RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior | Mingjiang Liang et.al. | 2410.20358 | null |
| 2024-10-27 | Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions | Rawal Khirodkar et.al. | 2410.20294 | null |
| 2024-10-26 | Neural Fields in Robotics: A Survey | Muhammad Zubair Irshad et.al. | 2410.20220 | null |
| 2024-10-25 | DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems | Muhammad Zaeem Shahzad et.al. | 2410.19336 | null |
| 2024-10-24 | Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction | Junyi Chen et.al. | 2410.18962 | null |
| 2024-10-24 | VoxelKeypointFusion: Generalizable Multi-View Multi-Person Pose Estimation | Daniel Bermuth et.al. | 2410.18723 | null |
| 2024-10-23 | Robust Two-View Geometry Estimation with Implicit Differentiation | Vladislav Pyatov et.al. | 2410.17983 | link |
| 2024-10-23 | YOLOv11: An Overview of the Key Architectural Enhancements | Rahima Khanam et.al. | 2410.17725 | null |
| 2024-10-21 | Assisted Physical Interaction: Autonomous Aerial Robots with Neural Network Detection, Navigation, and Safety Layers | Andrea Berra et.al. | 2410.15802 | null |
| 2024-10-21 | ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos | Tao Tang et.al. | 2410.15582 | link |
| 2024-10-20 | Neural Active Structure-from-Motion in Dark and Textureless Environment | Kazuto Ichimaru et.al. | 2410.15378 | null |
| 2024-10-20 | POSE: Pose estimation Of virtual Sync Exhibit system | Hao-Tang Tsui et.al. | 2410.15343 | link |
| 2024-10-18 | Graph Optimality-Aware Stochastic LiDAR Bundle Adjustment with Progressive Spatial Smoothing | Jianping Li et.al. | 2410.14565 | null |
| 2024-10-18 | Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior | Calvin-Khang Ta et.al. | 2410.14540 | null |
| 2024-10-18 | Sim2real Cattle Joint Estimation in 3D point clouds | Okour Mohammad et.al. | 2410.14419 | null |
| 2024-10-18 | Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping | Renguang Chen et.al. | 2410.14161 | null |
| 2024-10-15 | From Real Artifacts to Virtual Reference: A Robust Framework for Translating Endoscopic Images | unyang Wu et.al. | 2410.13896 | null |
| 2024-10-17 | DualQuat-LOAM: LiDAR Odometry and Mapping parametrized on Dual Quaternions | Edison P. Velasco-Sánchez et.al. | 2410.13541 | null |
| 2024-10-17 | Object Pose Estimation Using Implicit Representation For Transparent Objects | Varun Burde et.al. | 2410.13465 | null |
| 2024-10-16 | Optimizing Multi-Task Learning for Accurate Spacecraft Pose Estimation | Francesco Evangelisti et.al. | 2410.12679 | null |
| 2024-10-15 | Contrastive Touch-to-Touch Pretraining | Samanta Rodriguez et.al. | 2410.11834 | null |
| 2024-10-18 | X-Fi: A Modality-Invariant Foundation Model for Multimodal Human Sensing | Xinyan Chen et.al. | 2410.10167 | null |
| 2024-10-13 | Occluded Human Pose Estimation based on Limb Joint Augmentation | Gangtao Han et.al. | 2410.09885 | null |
| 2024-10-15 | POPoS: Improving Efficient and Robust Facial Landmark Detection with Parallel Optimal Position Search | Chong-Yang Xiang et.al. | 2410.09583 | null |
| 2024-10-12 | Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors | Hritam Basak et.al. | 2410.09467 | null |
| 2024-10-12 | Towards Multi-Modal Animal Pose Estimation: An In-Depth Analysis | Qianyi Deng et.al. | 2410.09312 | link |
| 2024-10-11 | CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose Estimation | Jianyu Zhao et.al. | 2410.09010 | link |
| 2024-10-11 | Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization | Christian Schmidt et.al. | 2410.08743 | link |
| 2024-10-10 | Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation | Felix Petersen et.al. | 2410.08125 | null |
| 2024-10-10 | Robotic framework for autonomous manipulation of laboratory equipment with different degrees of transparency via 6D pose estimation | Maria Makarova et.al. | 2410.07801 | null |
| 2024-10-10 | Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos | Cuong Le et.al. | 2410.07795 | link |
| 2024-10-10 | Autonomous Driving in Unstructured Environments: How Far Have We Come? | Chen Min et.al. | 2410.07701 | null |
| 2024-10-10 | Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks | Minxing Zhang et.al. | 2410.07670 | null |
| 2024-10-09 | OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB | Yunzhi Lin et.al. | 2410.06694 | null |
| 2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
| 2024-10-08 | SpecTrack: Learned Multi-Rotation Tracking via Speckle Imaging | Ziyang Chen et.al. | 2410.06028 | null |
| 2024-10-08 | AIVIO: Closed-loop, Object-relative Navigation of UAVs with AI-aided Visual Inertial Odometry | Thomas Jantos et.al. | 2410.05996 | null |
| 2024-10-08 | Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? | Charalambos Tzamos et.al. | 2410.05984 | link |
| 2024-10-08 | FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance | Ruocheng Wang et.al. | 2410.05791 | null |
| 2024-10-07 | Comparison of marker-less 2D image-based methods for infant pose estimation | Lennart Jahn et.al. | 2410.04980 | null |
| 2024-10-06 | Enhancing 3D Human Pose Estimation Amidst Severe Occlusion with Dual Transformer Fusion | Mehwish Ghafoor et.al. | 2410.04574 | link |
| 2024-10-06 | LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation | Jianhao Jiao et.al. | 2410.04419 | null |
| 2024-10-05 | Test-Time Adaptation for Keypoint-Based Spacecraft Pose Estimation Based on Predicted-View Synthesis | Juan Ignacio Bravo Pérez-Villar et.al. | 2410.04298 | link |
| 2024-10-05 | A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems | Nikola Radulov et.al. | 2410.04242 | link |
| 2024-10-04 | Unsupervised Prior Learning: Discovering Categorical Pose Priors from Videos | Ziyu Wang et.al. | 2410.03858 | null |
| 2024-10-04 | Universal Global State Estimation for Inertial Navigation Systems | Sifeddine Benahmed et.al. | 2410.03846 | null |
| 2024-10-04 | MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion | Junyi Zhang et.al. | 2410.03825 | null |
| 2024-10-04 | Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images | Ci Li et.al. | 2410.03438 | null |
| 2024-10-04 | HRVMamba: High-Resolution Visual State Space Model for Dense Prediction | Hao Zhang et.al. | 2410.03174 | null |
| 2024-10-04 | CLIP-Clique: Graph-based Correspondence Matching Augmented by Vision Language Models for Object-based Global Localization | Shigemichi Matsuzaki et.al. | 2410.03054 | null |
| 2024-10-03 | Why Sample Space Matters: Keyframe Sampling Optimization for LiDAR-based Place Recognition | Nikolaos Stathoulopoulos et.al. | 2410.02643 | null |
| 2024-10-03 | Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features | Chengkai Hou et.al. | 2410.02237 | null |
| 2024-10-02 | SGBA: Semantic Gaussian Mixture Model-Based LiDAR Bundle Adjustment | Xingyu Ji et.al. | 2410.01618 | null |
| 2024-10-02 | SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network | Ahmed Tawfik Aboukhadra et.al. | 2410.01293 | null |
| 2024-10-01 | Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning Models | Jerry Yan et.al. | 2410.01061 | null |
| 2024-10-01 | RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations | Kaichen Zhou et.al. | 2410.00713 | link |
| 2024-10-01 | GERA: Geometric Embedding for Efficient Point Registration Analysis | Geng Li et.al. | 2410.00589 | null |
| 2024-09-30 | Continual Human Pose Estimation for Incremental Integration of Keypoints and Pose Variations | Muhammad Saif Ullah Khan et.al. | 2409.20469 | null |
| 2024-09-30 | Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies | Shalini Sarode et.al. | 2409.20237 | null |
| 2024-09-30 | PuzzleBoard: A New Camera Calibration Pattern with Position Encoding | Peer Stelldinger et.al. | 2409.20127 | link |
| 2024-09-30 | Robust Gaussian Splatting SLAM by Leveraging Loop Closure | Zunjie Zhu et.al. | 2409.20111 | null |
| 2024-09-30 | GearTrack: Automating 6D Pose Estimation | Yu Deng et.al. | 2409.19986 | null |
| 2024-09-29 | PPLNs: Parametric Piecewise Linear Networks for Event-Based Temporal Modeling and Beyond | Chen Song et.al. | 2409.19772 | null |
| 2024-09-29 | GelSlim 4.0: Focusing on Touch and Reproducibility | Andrea Sipos et.al. | 2409.19770 | null |
| 2024-09-27 | Robust Proximity Operations using Probabilistic Markov Models | Deep Parikh et.al. | 2409.19062 | null |
| 2024-09-27 | Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras | Yipeng Lu et.al. | 2409.18673 | null |
| 2024-09-27 | DynaWeightPnP: Toward global real-time 3D-2D solver in PnP without correspondences | Jingwei Song et.al. | 2409.18457 | null |
| 2024-09-26 | Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation | Mengchen Zhang et.al. | 2409.18261 | null |
| 2024-09-26 | AI-Powered Augmented Reality for Satellite Assembly, Integration and Test | Alvaro Patricio et.al. | 2409.18101 | null |
| 2024-09-27 | Leveraging Anthropometric Measurements to Improve Human Mesh Estimation and Ensure Consistent Body Shapes | Katja Ludwig et.al. | 2409.17671 | null |
| 2024-09-25 | Safe Leaf Manipulation for Accurate Shape and Pose Estimation of Occluded Fruits | Shaoxiong Yao et.al. | 2409.17389 | null |
| 2024-09-25 | Hierarchical Tri-manual Planning for Vision-assisted Fruit Harvesting with Quadrupedal Robots | Zhichao Liu et.al. | 2409.17116 | null |
| 2024-09-25 | Self-Sensing for Proprioception and Contact Detection in Soft Robots Using Shape Memory Alloy Artificial Muscles | Ran Jing et.al. | 2409.17111 | null |
| 2024-09-25 | Online 6DoF Pose Estimation in Forests using Cross-View Factor Graph Optimisation and Deep Learned Re-localisation | Lucas Carvalho de Lima et.al. | 2409.16680 | null |
| 2024-09-25 | FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation | Jingyi Tang et.al. | 2409.16600 | null |
| 2024-09-25 | Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots | Masoud Dayani Najafabadi et.al. | 2409.16595 | null |
| 2024-09-24 | PseudoNeg-MAE: Self-Supervised Point Cloud Learning using Conditional Pseudo-Negative Embeddings | Sutharsan Mahendren et.al. | 2409.15832 | null |
| 2024-09-24 | LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation | Ruida Zhang et.al. | 2409.15727 | null |
| 2024-09-23 | Framework for Robust Localization of UUVs and Mapping of Net Pens | David Botta et.al. | 2409.15475 | null |
| 2024-09-23 | FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera | Guoyang Zhao et.al. | 2409.15054 | link |
| 2024-09-23 | BranchPoseNet: Characterizing tree branching with a deep learning-based pose estimation approach | Stefano Puliti et.al. | 2409.14755 | link |
| 2024-09-23 | ERPoT: Effective and Reliable Pose Tracking for Mobile Robots Based on Lightweight and Compact Polygon Maps | Haiming Gao et.al. | 2409.14723 | null |
| 2024-09-22 | Tactile Functasets: Neural Implicit Representations of Tactile Datasets | Sikai Li et.al. | 2409.14592 | null |
| 2024-09-22 | AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way | Sining Huang et.al. | 2409.14577 | null |
| 2024-09-22 | DROP: Dexterous Reorientation via Online Planning | Albert H. Li et.al. | 2409.14562 | null |
| 2024-09-21 | Combining Absolute and Semi-Generalized Relative Poses for Visual Localization | Vojtech Panek et.al. | 2409.14269 | null |
| 2024-09-18 | SpotLight: Robotic Scene Understanding through Interaction and Affordance Detection | Tim Engelbracht et.al. | 2409.11870 | null |
| 2024-09-18 | End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation | Thomas Pöllabauer et.al. | 2409.11819 | null |
| 2024-09-18 | Bridging Domain Gap for Flight-Ready Spaceborne Vision | Tae Ha Park et.al. | 2409.11661 | null |
| 2024-09-17 | Good Grasps Only: A data engine for self-supervised fine-tuning of pose estimation using grasp poses for verification | Frederik Hagelskjær et.al. | 2409.11512 | null |
| 2024-09-17 | Training Datasets Generation for Machine Learning: Application to Vision Based Navigation | Jérémy Lebreton et.al. | 2409.11383 | null |
| 2024-09-17 | OmniGen: Unified Image Generation | Shitao Xiao et.al. | 2409.11340 | link |
| 2024-09-17 | ULOC: Learning to Localize in Complex Large-Scale Environments with Ultra-Wideband Ranges | Thien-Minh Nguyen et.al. | 2409.11122 | link |
| 2024-09-17 | Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB | Alessandro Simoni et.al. | 2409.11104 | null |
| 2024-09-21 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
| 2024-09-17 | Pose estimation of CubeSats via sensor fusion and Error-State Extended Kalman Filter | Deep Parikh et.al. | 2409.10815 | null |
| 2024-09-16 | CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera | Jingpei Lu et.al. | 2409.10441 | null |
| 2024-09-16 | HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models | Vineet Bhat et.al. | 2409.10419 | null |
| 2024-09-16 | 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? | Téo Guichoux et.al. | 2409.10357 | null |
| 2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
| 2024-09-15 | Precise Pick-and-Place using Score-Based Diffusion Networks | Shih-Wei Guo et.al. | 2409.09725 | null |
| 2024-09-15 | Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild | Nie Lin et.al. | 2409.09714 | null |
| 2024-09-15 | Proximity operations of CubeSats via sensor fusion of ultra-wideband range measurements with rate gyroscopes, accelerometers and monocular vision | Deep Parikh et.al. | 2409.09665 | null |
| 2024-09-15 | A Scalable Tabletop Satellite Automation Testbed:Design And Experiments | Deep Parikh et.al. | 2409.09633 | null |
| 2024-09-14 | MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry | Yuheng Qiu et.al. | 2409.09479 | null |
| 2024-09-14 | Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM | Haoying Li et.al. | 2409.09410 | null |
| 2024-09-13 | Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial Odometry | Yunus Bilge Kurt et.al. | 2409.08769 | link |
| 2024-09-13 | WheelPoser: Sparse-IMU Based Body Pose Estimation for Wheelchair Users | Yunzhi Li et.al. | 2409.08494 | null |
| 2024-09-12 | Bayesian Inverse Graphics for Few-Shot Concept Learning | Octavio Arriaga et.al. | 2409.08351 | null |
| 2024-09-12 | Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation | Samanta Rodriguez et.al. | 2409.08269 | null |
| 2024-09-12 | Covariance Intersection-based Invariant Kalman Filtering(DInCIKF) for Distributed Pose Estimation | Haoying Li et.al. | 2409.07933 | null |
| 2024-09-12 | GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions | Liang Feng et.al. | 2409.07798 | null |
| 2024-09-12 | GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution | Liang Feng et.al. | 2409.07752 | null |
| 2024-09-11 | FaVoR: Features via Voxel Rendering for Camera Relocalization | Vincenzo Polizzi et.al. | 2409.07571 | null |
| 2024-09-11 | Benchmarking 2D Egocentric Hand Pose Datasets | Olga Taran et.al. | 2409.07337 | null |
| 2024-09-11 | iKalibr-RGBD: Partially-Specialized Target-Free Visual-Inertial Spatiotemporal Calibration For RGBDs via Continuous-Time Velocity Estimation | Shuolong Chen et.al. | 2409.07116 | link |
| 2024-09-11 | Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry | Anbo Tao et.al. | 2409.06948 | null |
| 2024-09-10 | A Bayesian framework for active object recognition, pose estimation and shape transfer learning through touch | Haodong Zheng et.al. | 2409.06912 | null |
| 2024-09-11 | Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences | Shishir Reddy Vutukur et.al. | 2409.06683 | null |
| 2024-09-10 | PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation | Ginger Delmas et.al. | 2409.06535 | null |
| 2024-09-10 | Test-Time Certifiable Self-Supervision to Bridge the Sim2Real Gap in Event-Based Satellite Pose Estimation | Mohsi Jawaid et.al. | 2409.06240 | null |
| 2024-09-09 | From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models | Tessa Pulli et.al. | 2409.05413 | null |
| 2024-09-08 | HelmetPoser: A Helmet-Mounted IMU Dataset for Data-Driven Estimation of Human Head Motion in Diverse Conditions | Jianping Li et.al. | 2409.05006 | null |
| 2024-09-06 | Casper DPM: Cascaded Perceptual Dynamic Projection Mapping onto Hands | Yotam Erel et.al. | 2409.04397 | null |
| 2024-09-06 | GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers | Lorenza Prospero et.al. | 2409.04196 | null |
| 2024-09-06 | Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics | Woojin Cho et.al. | 2409.04033 | null |
| 2024-09-06 | Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments | Therese Joseph et.al. | 2409.03998 | null |
| 2024-09-09 | The Influence of Faulty Labels in Data Sets on Human Pose Estimation | Arnold Schwarz et.al. | 2409.03887 | null |
| 2024-09-05 | MaskVal: Simple but Effective Uncertainty Quantification for 6D Pose Estimation | Philipp Quentin et.al. | 2409.03556 | null |
| 2024-09-05 | UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking | Md. Mahfuzur Rahman et.al. | 2409.03245 | null |
| 2024-09-01 | Recoverable Anonymization for Pose Estimation: A Privacy-Enhancing Approach | Wenjun Huang et.al. | 2409.02715 | null |
| 2024-09-04 | Object Gaussian for Monocular 6D Pose Estimation from Sparse Views | Luqing Luo et.al. | 2409.02581 | null |
| 2024-09-03 | EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision | Yiming Zhao et.al. | 2409.02224 | null |
| 2024-09-03 | Deep learning for objective estimation of Parkinsonian tremor severity | Felipe Duque-Quiceno et.al. | 2409.02011 | null |
| 2024-09-03 | SPiKE: 3D Human Pose from Point Cloud Sequences | Irene Ballester et.al. | 2409.01879 | link |
| 2024-09-02 | Kalman Filtering for Precise Indoor Position and Orientation Estimation Using IMU and Acoustics on Riemannian Manifolds | Mohammed H. AlSharif et.al. | 2409.01002 | null |
| 2024-09-01 | Detection, Recognition and Pose Estimation of Tabletop Objects | Sanjuksha Nirgude et.al. | 2409.00869 | null |
| 2024-09-01 | DSLO: Deep Sequence LiDAR Odometry Based on Inconsistent Spatio-temporal Propagation | Huixin Zhang et.al. | 2409.00744 | link |
| 2024-09-01 | MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds | Ziqiang Dang et.al. | 2409.00736 | null |
| 2024-08-31 | ActionPose: Pretraining 3D Human Pose Estimation with the Dark Knowledge of Action | Longyun Liao et.al. | 2409.00449 | null |
| 2024-09-02 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
| 2024-08-30 | BOP-D: Revisiting 6D Pose Estimation Benchmark for Better Evaluation under Visual Ambiguities | Boris Meden et.al. | 2408.17297 | null |
| 2024-08-30 | EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs | Zhen Fan et.al. | 2408.17168 | null |
| 2024-09-01 | Generic Objects as Pose Probes for Few-Shot View Synthesis | Zhirui Gao et.al. | 2408.16690 | null |
| 2024-08-29 | OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation | Yuchen Che et.al. | 2408.16547 | link |
| 2024-08-29 | GRPose: Learning Graph Relations for Human Image Generation with Pose Priors | Xiangchen Yin et.al. | 2408.16540 | null |
| 2024-08-28 | Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators | Nikita Kister et.al. | 2408.16536 | null |
| 2024-08-28 | Multi-view Pose Fusion for Occlusion-Aware 3D Human Pose Estimation | Laura Bragagnolo et.al. | 2408.15810 | link |
| 2024-08-30 | Addressing the challenges of loop detection in agricultural environments | Nicolás Soncini et.al. | 2408.15761 | link |
| 2024-08-28 | Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph | Zherong Zhang et.al. | 2408.15750 | null |
| 2024-08-28 | Benchmarking ML Approaches to UWB-Based Range-Only Posture Recognition for Human Robot-Interaction | Salma Salimi et.al. | 2408.15717 | null |
| 2024-08-26 | Bengali Sign Language Recognition through Hand Pose Estimation using Multi-Branch Spatial-Temporal Attention Model | Abu Saleh Musa Miah et.al. | 2408.14111 | null |
| 2024-08-25 | InterTrack: Tracking Human Object Interaction without Object Templates | Xianghui Xie et.al. | 2408.13953 | null |
| 2024-08-24 | Temporally-consistent 3D Reconstruction of Birds | Johannes Hägerlind et.al. | 2408.13629 | null |
| 2024-08-24 | Explainable Convolutional Networks for Crater Detection and Lunar Landing Navigation | Jianing Song et.al. | 2408.13587 | null |
| 2024-08-27 | Sapiens: Foundation for Human Vision Models | Rawal Khirodkar et.al. | 2408.12569 | null |
| 2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | null |
| 2024-08-20 | ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data | Elia Bonetto et.al. | 2408.10831 | null |
| 2024-08-20 | MPL: Lifting 3D Human Pose from Multi-view 2D Poses | Seyed Abolfazl Ghasemzadeh et.al. | 2408.10805 | link |
| 2024-08-19 | RUMI: Rummaging Using Mutual Information | Sheng Zhong et.al. | 2408.10450 | null |
| 2024-08-19 | SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views | Chao Xu et.al. | 2408.10195 | null |
| 2024-08-19 | SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition | Wiktor Mucha et.al. | 2408.10037 | link |
| 2024-08-19 | Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation | Qianhui Men et.al. | 2408.09931 | null |
| 2024-08-18 | OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare | Chen Long-fei et.al. | 2408.09409 | null |
| 2024-08-17 | An Open-Source American Sign Language Fingerspell Recognition and Semantic Pose Retrieval Interface | Kevin Jose Thomas et.al. | 2408.09311 | link |
| 2024-08-16 | ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation | Hao Tang et.al. | 2408.09042 | null |
| 2024-08-16 | Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS | Wei Sun et.al. | 2408.08723 | null |
| 2024-08-16 | SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis | Xingyue Lin et.al. | 2408.08623 | null |
| 2024-08-15 | HyperTaxel: Hyper-Resolution for Taxel-Based Tactile Signals Through Contrastive Learning | Hongyu Li et.al. | 2408.08312 | null |
| 2024-08-15 | Comparative Evaluation of 3D Reconstruction Methods for Object Pose Estimation | Varun Burde et.al. | 2408.08234 | link |
| 2024-08-15 | Towards Practical Human Motion Prediction with LiDAR Point Clouds | Xiao Han et.al. | 2408.08202 | null |
| 2024-08-15 | Your Turn: Real-World Turning Angle Estimation for Parkinson’s Disease Severity Assessment | Qiushuo Cheng et.al. | 2408.08182 | null |
| 2024-08-15 | Polaris: Open-ended Interactive Robotic Manipulation via Syn2Real Visual Grounding and Large Language Models | Tianyu Wang et.al. | 2408.07975 | null |
| 2024-08-15 | GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Yutong Wang et.al. | 2408.07917 | link |
| 2024-08-13 | A Miniature Vision-Based Localization System for Indoor Blimps | Shicong Ma et.al. | 2408.06648 | null |
| 2024-08-12 | UniT: Unified Tactile Representation for Robot Learning | Zhengtong Xu et.al. | 2408.06481 | link |
| 2024-08-12 | Moo-ving Beyond Tradition: Revolutionizing Cattle Behavioural Phenotyping with Pose Estimation Techniques | Navid Ghassemi et.al. | 2408.06336 | null |
| 2024-08-12 | CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments | Yanpeng Jia et.al. | 2408.05981 | null |
| 2024-08-12 | PAFormer: Part Aware Transformer for Person Re-identification | Hyeono Jung et.al. | 2408.05918 | null |
| 2024-08-11 | SABER-6D: Shape Representation Based Implicit Object Pose Estimation | Shishir Reddy Vutukur et.al. | 2408.05867 | null |
| 2024-08-11 | Real-Time Drowsiness Detection Using Eye Aspect Ratio and Facial Landmark Detection | Varun Shiva Krishna Rupani et.al. | 2408.05836 | null |
| 2024-08-10 | Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis | Zhongche Qu et.al. | 2408.05635 | null |
| 2024-08-10 | Anticipation through Head Pose Estimation: a preliminary study | Federico Figari Tomenotti et.al. | 2408.05516 | null |
| 2024-08-09 | Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray Tracing | Lennart Niecksch et.al. | 2408.04979 | null |
| 2024-08-07 | PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model | Yunlong Huang et.al. | 2408.03540 | null |
| 2024-08-06 | Line-based 6-DoF Object Pose Estimation and Tracking With an Event Camera | Zibin Liu et.al. | 2408.03225 | link |
| 2024-08-06 | Training on the Fly: On-device Self-supervised Learning aboard Nano-drones within 20 mW | Elia Cereda et.al. | 2408.03168 | null |
| 2024-08-06 | BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications | G. Manni et.al. | 2408.03078 | link |
| 2024-08-07 | Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network | Xinyi Zhang et.al. | 2408.02922 | null |
| 2024-08-05 | Analyzing Data Efficiency and Performance of Machine Learning Algorithms for Assessing Low Back Pain Physical Rehabilitation Exercises | Aleksa Marusic et.al. | 2408.02855 | null |
| 2024-08-05 | Joint-Motion Mutual Learning for Pose Estimation in Videos | Sifan Wu et.al. | 2408.02285 | null |
| 2024-08-04 | AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos | Feichi Lu et.al. | 2408.02110 | null |
| 2024-08-04 | Generalized Maximum Likelihood Estimation for Perspective-n-Point Problem | Tian Zhan et.al. | 2408.01945 | null |
| 2024-08-03 | MotionTrace: IMU-based Field of View Prediction for Smartphone AR Interactions | Rahul Islam et.al. | 2408.01850 | null |
| 2024-08-03 | BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles | Lun Luo et.al. | 2408.01841 | null |
| 2024-08-03 | E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images | Yunshan Qi et.al. | 2408.01840 | null |
| 2024-08-03 | Survey on Emotion Recognition through Posture Detection and the possibility of its application in Virtual Reality | Leina Elansary et.al. | 2408.01728 | null |
| 2024-08-03 | Stimulating Imagination: Towards General-purpose Object Rearrangement | Jianyang Wu et.al. | 2408.01655 | null |
| 2024-08-02 | Full-range Head Pose Geometric Data Augmentations | Huei-Chung Hu et.al. | 2408.01566 | null |
| 2024-07-31 | Adapting Skills to Novel Grasps: A Self-Supervised Approach | Georgios Papagiannis et.al. | 2408.00178 | null |
| 2024-07-31 | Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods | Xusheng Luo et.al. | 2408.00117 | null |
| 2024-07-30 | HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation | Wencan Cheng et.al. | 2407.20542 | link |
| 2024-07-30 | Markers Identification for Relative Pose Estimation of an Uncooperative Target | Batu Candan et.al. | 2407.20515 | null |
| 2024-07-29 | BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation | Kieran Saunders et.al. | 2407.20437 | null |
| 2024-07-28 | Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph | Zhengcen Li et.al. | 2407.19497 | null |
| 2024-07-26 | Flexible graph convolutional network for 3D human pose estimation | Abu Taib Mohammed Shahjahan et.al. | 2407.19077 | null |
| 2024-07-26 | From 2D to 3D: AISG-SLA Visual Localization Challenge | Jialin Gao et.al. | 2407.18590 | null |
| 2024-07-28 | HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation | Zhenzhi Wang et.al. | 2407.17438 | link |
| 2024-07-24 | Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments | Wei Gao et.al. | 2407.17078 | null |
| 2024-07-30 | DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction | Xiaobiao Du et.al. | 2407.16988 | link |
| 2024-07-24 | Pose Estimation from Camera Images for Underwater Inspection | Luyuan Peng et.al. | 2407.16961 | null |
| 2024-07-23 | COALA: A Practical and Vision-Centric Federated Learning Platform | Weiming Zhuang et.al. | 2407.16560 | link |
| 2024-07-23 | Probabilistic Parameter Estimators and Calibration Metrics for Pose Estimation from Image Features | Romeo Valentin et.al. | 2407.16223 | null |
| 2024-07-23 | Optimal camera-robot pose estimation in linear time from points and lines | Guangyang Zeng et.al. | 2407.16151 | null |
| 2024-07-23 | 3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images | Jie Zhao et.al. | 2407.16137 | null |
| 2024-07-21 | CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models | Zheng Chong et.al. | 2407.15886 | link |
| 2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
| 2024-07-22 | Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection | Kangqi Ma et.al. | 2407.15771 | null |
| 2024-07-22 | 6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model | Matteo Bortolon et.al. | 2407.15484 | null |
| 2024-07-23 | Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions | Yihao Ai et.al. | 2407.15451 | null |
| 2024-07-22 | avaTTAR: Table Tennis Stroke Training with On-body and Detached Visualization in Augmented Reality | Dizhi Ma et.al. | 2407.15373 | null |
| 2024-07-20 | From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM | Lorenzo Montano-Oliván et.al. | 2407.14797 | null |
| 2024-07-19 | ESCAPE: Energy-based Selective Adaptive Correction for Out-of-distribution 3D Human Pose Estimation | Luke Bidulka et.al. | 2407.14605 | null |
| 2024-07-19 | 6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry | Sungho Chun et.al. | 2407.14136 | link |
| 2024-07-18 | RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark | Yuan-Hao Ho et.al. | 2407.13930 | null |
| 2024-07-19 | GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation | Bangyan Liao et.al. | 2407.13537 | null |
| 2024-07-18 | SCAPE: A Simple and Strong Category-Agnostic Pose Estimator | Yujia Liang et.al. | 2407.13483 | link |
| 2024-07-17 | SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Yiyang Chen et.al. | 2407.12667 | link |
| 2024-07-17 | Invertible Neural Warp for NeRF | Shin-Fang Chng et.al. | 2407.12354 | null |
| 2024-07-16 | NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models | Francesco Milano et.al. | 2407.12207 | link |
| 2024-07-16 | Monocular pose estimation of articulated surgical instruments in open surgery | Robert Spektor et.al. | 2407.12138 | null |
| 2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
| 2024-07-16 | TCFormer: Visual Recognition via Token Clustering Transformer | Wang Zeng et.al. | 2407.11321 | link |
| 2024-07-15 | A BlueROV2-based platform for underwater mapping experiments | Tudor Alinei-Poiana et.al. | 2407.10901 | null |
| 2024-07-15 | LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning | Zhuozhu Jian et.al. | 2407.10782 | null |
| 2024-07-15 | Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis | Antoine Legrand et.al. | 2407.10762 | null |
| 2024-07-16 | GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation | Haonan Wang et.al. | 2407.10756 | null |
| 2024-07-15 | Learning to Estimate the Pose of a Peer Robot in a Camera Image by Predicting the States of its LEDs | Nicholas Carlotti et.al. | 2407.10661 | null |
| 2024-07-15 | Deep-Learning-Based Markerless Pose Estimation Systems in Gait Analysis: DeepLabCut Custom Training and the Refinement Function | Giulia Panconi et.al. | 2407.10590 | null |
| 2024-07-14 | 3D Foundation Models Enable Simultaneous Geometry and Pose Estimation of Grasped Objects | Weiming Zhi et.al. | 2407.10331 | null |
| 2024-07-16 | psifx – Psychological and Social Interactions Feature Extraction Package | Guillaume Rochette et.al. | 2407.10266 | null |
| 2024-07-14 | Efficient Facial Landmark Detection for Embedded Systems | Ji-Jia Wu et.al. | 2407.10228 | null |
| 2024-07-14 | PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation | Nermin Samet et.al. | 2407.10220 | null |
| 2024-07-12 | iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning | Tom Fischer et.al. | 2407.09271 | link |
| 2024-07-12 | HUP-3D: A 3D multi-view synthetic dataset for assisted-egocentric hand-ultrasound pose estimation | Manuel Birlo et.al. | 2407.09215 | null |
| 2024-07-12 | KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting | Andrew Jeong et.al. | 2407.08909 | null |
| 2024-07-11 | RTMW: Real-Time Multi-Person 2D and 3D Whole-body Pose Estimation | Tao Jiang et.al. | 2407.08634 | link |
| 2024-07-11 | SRPose: Two-view Relative Pose Estimation with Sparse Keypoints | Rui Yin et.al. | 2407.08199 | link |
| 2024-07-11 | SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM | Neng Wang et.al. | 2407.08106 | link |
| 2024-07-10 | RoCap: A Robotic Data Collection Pipeline for the Pose Estimation of Appearance-Changing Objects | Jiahao Nick Li et.al. | 2407.08081 | null |
| 2024-07-10 | Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization | Jinjie Mai et.al. | 2407.08023 | link |
| 2024-07-10 | Greit-HRNet: Grouped Lightweight High-Resolution Network for Human Pose Estimation | Junjia Han et.al. | 2407.07389 | null |
| 2024-07-09 | Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images | Chuanrui Zhang et.al. | 2407.06984 | null |
| 2024-07-09 | Computer vision tasks for intelligent aerospace missions: An overview | Huilin Chen et.al. | 2407.06513 | null |
| 2024-07-08 | GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields | Weiyi Xue et.al. | 2407.05597 | null |
| 2024-07-10 | On the power of data augmentation for head pose estimation | Michael Welter et.al. | 2407.05357 | null |
| 2024-07-07 | SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning | Yi Feng et.al. | 2407.05283 | link |
| 2024-07-05 | Unsupervised Learning of Category-Level 3D Pose from Object-Centric Videos | Leonhard Sommer et.al. | 2407.04384 | link |
| 2024-07-04 | Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation | Laiyan Ding et.al. | 2407.04041 | link |
| 2024-07-04 | Markerless Multi-view 3D Human Pose Estimation: a survey | Ana Filipa Rodrigues Nogueira et.al. | 2407.03817 | null |
| 2024-07-04 | A Fast Dynamic Point Detection Method for LiDAR-Inertial Odometry in Driving Scenarios | Zikang Yuan et.al. | 2407.03590 | null |
| 2024-07-03 | Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation | Mengmeng Cui et.al. | 2407.02990 | null |
| 2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
| 2024-07-02 | SUPER: Seated Upper Body Pose Estimation using mmWave Radars | Bo Zhang et.al. | 2407.02455 | null |
| 2024-07-02 | ReliaAvatar: A Robust Real-Time Avatar Animator with Integrated Motion Prediction | Bo Qian et.al. | 2407.02129 | null |
| 2024-07-02 | Joint-Dataset Learning and Cross-Consistent Regularization for Text-to-Motion Retrieval | Nicola Messina et.al. | 2407.02104 | null |
| 2024-07-01 | Active Human Pose Estimation via an Autonomous UAV Agent | Jingxi Chen et.al. | 2407.01811 | null |
| 2024-07-01 | RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields | Haochen Jiang et.al. | 2407.01303 | null |
| 2024-07-01 | Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization | Ruofei Bai et.al. | 2407.01013 | null |
| 2024-06-30 | Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation | Adnan Abdullah et.al. | 2407.00848 | null |
| 2024-06-29 | When Robots Get Chatty: Grounding Multimodal Human-Robot Conversation and Collaboration | Philipp Allgeuer et.al. | 2407.00518 | null |
| 2024-06-28 | Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review | Moseli Mots’oehli et.al. | 2407.00252 | null |
| 2024-06-28 | EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans | Nicola Garau et.al. | 2406.19726 | null |
| 2024-06-28 | CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services | DongKi Noh et.al. | 2406.19634 | null |
| 2024-06-27 | Multimodal Visual-haptic pose estimation in the presence of transient occlusion | Michael Zechmair et.al. | 2406.19323 | null |
| 2024-06-27 | Human Modelling and Pose Estimation Overview | Pawel Knap et.al. | 2406.19290 | null |
| 2024-06-26 | Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference | Yuan Gao et.al. | 2406.18453 | link |
| 2024-06-27 | Automatic infant 2D pose estimation from videos: comparing seven deep neural network methods | Filipe Gama et.al. | 2406.17382 | null |
| 2024-06-24 | High-resolution open-vocabulary object 6D pose estimation | Jaime Corsetti et.al. | 2406.16384 | null |
| 2024-06-23 | Breaking the Frame: Image Retrieval by Visual Overlap Prediction | Tong Wei et.al. | 2406.16204 | link |
| 2024-06-21 | Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe | Sandeep Singh Sengar et.al. | 2406.15649 | link |
| 2024-06-24 | Investigating the impact of 2D gesture representation on co-speech gesture generation | Teo Guichoux et.al. | 2406.15111 | null |
| 2024-06-20 | Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data | Moira Shooter et.al. | 2406.14412 | null |
| 2024-06-20 | PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions | Sihan Ma et.al. | 2406.14367 | null |
| 2024-06-19 | NeRF-Feat: 6D Object Pose Estimation using Feature Rendering | Shishir Reddy Vutukur et.al. | 2406.13796 | null |
| 2024-06-19 | CNN Based Flank Predictor for Quadruped Animal Species | Vanessa Suessle et.al. | 2406.13588 | null |
| 2024-06-19 | MVSBoost: An Efficient Point Cloud-based 3D Reconstruction | Umair Haroon et.al. | 2406.13515 | null |
| 2024-06-19 | An Efficient yet High-Performance Method for Precise Radar-Based Imaging of Human Hand Poses | Johanna Bräunig et.al. | 2406.13464 | null |
| 2024-06-18 | Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings | Ruijie Tang et.al. | 2406.13048 | null |
| 2024-06-17 | Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Huaiji Zhou et.al. | 2406.11766 | null |
| 2024-06-17 | Domain Generalization for In-Orbit 6D Pose Estimation | Antoine Legrand et.al. | 2406.11743 | null |
| 2024-06-17 | SeamPose: Repurposing Seams as Capacitive Sensors in a Shirt for Upper-Body Pose Tracking | Tianhong Catherine Yu et.al. | 2406.11645 | null |
| 2024-06-14 | Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization | Wonho Song et.al. | 2406.11599 | null |
| 2024-06-15 | MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception | M. Mahbubur Rahman et.al. | 2406.10708 | null |
| 2024-06-15 | Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose Inference | Shayan Shekarforoush et.al. | 2406.10455 | null |
| 2024-06-14 | The BabyView dataset: High-resolution egocentric videos of infants’ and young children’s everyday experiences | Bria Long et.al. | 2406.10447 | null |
| 2024-06-14 | OpenCapBench: A Benchmark to Bridge Pose Estimation and Biomechanics | Yoni Gozlan et.al. | 2406.09788 | null |
| 2024-06-13 | ImageNet3D: Towards General-Purpose Object-Level 3D Understanding | Wufei Ma et.al. | 2406.09613 | link |
| 2024-06-13 | Deep Transformer Network for Monocular Pose Estimation of Ship-Based UAV | Maneesha Wickramasuriya et.al. | 2406.09260 | link |
| 2024-06-14 | Language-Driven Closed-Loop Grasping with Model-Predictive Trajectory Replanning | Huy Hoang Nguyen et.al. | 2406.09039 | null |
| 2024-06-14 | VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks | Jiannan Wu et.al. | 2406.08394 | link |
| 2024-06-12 | Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization | Jiaxin Deng et.al. | 2406.08001 | null |
| 2024-06-12 | IFTD: Image Feature Triangle Descriptor for Loop Detection in Driving Scenes | Fengtian Lang et.al. | 2406.07937 | link |
| 2024-06-12 | From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers | Swaminathan Gurumurthy et.al. | 2406.07785 | link |
| 2024-06-12 | SPIN: Spacecraft Imagery for Navigation | Javier Montalvo et.al. | 2406.07500 | link |
| 2024-06-11 | Realistic Data Generation for 6D Pose Estimation of Surgical Instruments | Juan Antonio Barragan et.al. | 2406.07328 | link |
| 2024-06-11 | SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale | Shester Gueuwou et.al. | 2406.06907 | null |
| 2024-06-10 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374 | link |
| 2024-06-08 | A preprocessing-based planning framework for utilizing contacts in high-precision insertion tasks | Muhammad Suhail Saleem et.al. | 2406.05522 | null |
| 2024-06-06 | GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang et.al. | 2406.04340 | link |
| 2024-06-06 | Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking | Jiyao Zhang et.al. | 2406.04316 | null |
| 2024-06-05 | Hi5: 2D Hand Pose Estimation with Zero Human Annotation | Masum Hasan et.al. | 2406.03599 | null |
| 2024-06-05 | Sparse Color-Code Net: Real-Time RGB-Based 6D Object Pose Estimation on Edge Devices | Xingjian Yang et.al. | 2406.02977 | null |
| 2024-06-04 | CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation | Dejia Xu et.al. | 2406.02509 | null |
| 2024-06-04 | HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model | Yu Tian et.al. | 2406.01914 | null |
| 2024-06-03 | A Robust Filter for Marker-less Multi-person Tracking in Human-Robot Interaction Scenarios | Enrico Martini et.al. | 2406.01832 | link |
| 2024-06-01 | Equivariant amortized inference of poses for cryo-EM | Larissa de Ruijter et.al. | 2406.01630 | null |
| 2024-06-03 | 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information | Sihan Wen et.al. | 2406.01196 | null |
| 2024-06-01 | CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation | Matan Rusanovsky et.al. | 2406.00384 | link |
| 2024-05-30 | Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection | Prashanth Chandran et.al. | 2405.20117 | null |
| 2024-05-30 | Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach | Muhammad Saif Ullah Khan et.al. | 2405.20084 | null |
| 2024-05-30 | TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM | Peifeng Jiang et.al. | 2405.19614 | null |
| 2024-05-29 | Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives | Mingqi Yuan et.al. | 2405.19531 | null |
| 2024-05-29 | Exploring AI-based Anonymization of Industrial Image and Video Data in the Context of Feature Preservation | Sabrina Cynthia Triess et.al. | 2405.19173 | null |
| 2024-05-28 | World Models for General Surgical Grasping | Hongbin Lin et.al. | 2405.17940 | null |
| 2024-05-27 | MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds | Jiahui Lei et.al. | 2405.17421 | link |
| 2024-05-27 | Occlusion Handling in 3D Human Pose Estimation with Perturbed Positional Encoding | Niloofar Azizi et.al. | 2405.17397 | null |
| 2024-05-27 | $\text{Di}^2\text{Pose}$ : Discrete Diffusion Model for Occluded 3D Human Pose Estimation | Weiquan Wang et.al. | 2405.17016 | null |
| 2024-05-27 | Clustering-based Learning for UAV Tracking and Pose Estimation | Jiaping Xiao et.al. | 2405.16867 | null |
| 2024-05-26 | Multi-Modal UAV Detection, Classification and Tracking Algorithm – Technical Report for CVPR 2024 UG2 Challenge | Tianchen Deng et.al. | 2405.16464 | link |
| 2024-05-25 | Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality | Hakim Ikebayashi et.al. | 2405.16008 | null |
| 2024-05-23 | CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments | Yang Zhou et.al. | 2405.14731 | link |
| 2024-05-23 | Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation | Daniel Kienzle et.al. | 2405.14467 | link |
| 2024-05-21 | Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos | Jayroop Ramesh et.al. | 2405.13235 | null |
| 2024-05-21 | Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations | Antoine Legrand et.al. | 2405.12728 | null |
| 2024-05-21 | PoseGravity: Pose Estimation from Points and Lines with Axis Prior | Akshay Chandrasekhar et.al. | 2405.12646 | link |
| 2024-05-19 | Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation | Zejun Gu et.al. | 2405.12247 | null |
| 2024-05-20 | AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements | Calvin Yeung et.al. | 2405.12070 | link |
| 2024-05-19 | Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries | Christiaan G. A. Viviers et.al. | 2405.11677 | link |
| 2024-05-19 | Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation | Zejun Gu et.al. | 2405.11448 | null |
| 2024-05-18 | PS6D: Point Cloud Based Symmetry-Aware 6D Object Pose Estimation in Robot Bin-Picking | Yifan Yang et.al. | 2405.11257 | null |
| 2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
| 2024-05-17 | Resolving Symmetry Ambiguity in Correspondence-based Methods for Instance-level Object Pose Estimation | Yongliang Lin et.al. | 2405.10557 | null |
| 2024-05-16 | Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder | Mohamed Ilyes Lakhal et.al. | 2405.10423 | null |
| 2024-05-17 | Toon3D: Seeing Cartoons from a New Perspective | Ethan Weber et.al. | 2405.10320 | null |
| 2024-05-15 | Task-adaptive Q-Face | Haomiao Sun et.al. | 2405.09059 | null |
| 2024-05-14 | RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images | Zong-Wei Hong et.al. | 2405.08483 | link |
| 2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
| 2024-05-13 | Deep Learning-Based Object Pose Estimation: A Comprehensive Survey | Jian Liu et.al. | 2405.07801 | link |
| 2024-05-13 | JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation | Xubo Luo et.al. | 2405.07429 | link |
| 2024-05-11 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027 | null |
| 2024-05-11 | AHPPEBot: Autonomous Robot for Tomato Harvesting based on Phenotyping and Pose Estimation | Xingxu Li et.al. | 2405.06959 | null |
| 2024-05-10 | CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras | James Tang et.al. | 2405.06845 | link |
| 2024-05-10 | MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization | Pengcheng Zhu et.al. | 2405.06241 | null |
| 2024-05-10 | Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera | Haixin Shi et.al. | 2405.05858 | null |
| 2024-05-09 | Semi-Autonomous Laparoscopic Robot Docking with Learned Hand-Eye Information Fusion | Huanyu Tian et.al. | 2405.05817 | null |
| 2024-05-09 | NeuRSS: Enhancing AUV Localization and Bathymetric Mapping with Neural Rendering for Sidescan SLAM | Yiping Xie et.al. | 2405.05807 | null |
| 2024-05-09 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
| 2024-05-08 | Adversary-Guided Motion Retargeting for Skeleton Anonymization | Thomas Carr et.al. | 2405.05428 | null |
| 2024-05-08 | FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models | Jinglin Xu et.al. | 2405.05216 | link |
| 2024-05-08 | ProbRadarM3F: mmWave Radar based Human Skeletal Pose Estimation with Probability Map Guided Multi-Format Feature Fusion | Bing Zhu et.al. | 2405.05164 | null |
| 2024-05-08 | GISR: Geometric Initialization and Silhouette-based Refinement for Single-View Robot Pose and Configuration Estimation | Ivan Bilić et.al. | 2405.04890 | null |
| 2024-05-07 | Learning Distributional Demonstration Spaces for Task-Specific Cross-Pose Estimation | Jenny Wang et.al. | 2405.04609 | null |
| 2024-05-07 | Speak the Same Language: Global LiDAR Registration on BIM Using Pose Hough Transform | Zhijian Qiao et.al. | 2405.03969 | null |
| 2024-05-07 | Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints | Xiongjun Guan et.al. | 2405.03959 | null |
| 2024-05-06 | Pose Priors from Language Models | Sanjay Subramanian et.al. | 2405.03689 | null |
| 2024-05-06 | Optimizing Hand Region Detection in MediaPipe Holistic Full-Body Pose Estimation to Improve Accuracy and Avoid Downstream Errors | Amit Moryossef et.al. | 2405.03545 | link |
| 2024-05-05 | Multi-hop graph transformer network for 3D human pose estimation | Zaedul Islam et.al. | 2405.03055 | null |
| 2024-05-05 | Blending Distributed NeRFs with Tri-stage Robust Pose Optimization | Baijun Ye et.al. | 2405.02880 | null |
| 2024-05-03 | WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD | Xuxin Cheng et.al. | 2405.02241 | null |
| 2024-05-03 | Probablistic Restoration with Adaptive Noise Sampling for 3D Human Pose Estimation | Xianzhou Zeng et.al. | 2405.02114 | link |
| 2024-05-03 | An Onboard Framework for Staircases Modeling Based on Point Clouds | Chun Qing et.al. | 2405.01918 | null |
| 2024-05-06 | ShadowNav: Autonomous Global Localization for Lunar Navigation in Darkness | Deegan Atha et.al. | 2405.01673 | null |
| 2024-05-02 | IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning | Ryan Hoque et.al. | 2405.01472 | null |
| 2024-05-02 | Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning | Liu Qiyuan et.al. | 2405.01284 | null |
| 2024-05-02 | Sports Analysis and VR Viewing System Based on Player Tracking and Pose Estimation with Multimodal and Multiview Sensors | Wenxuan Guo et.al. | 2405.01112 | null |
| 2024-05-02 | CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications | Jan Blumenkamp et.al. | 2405.01107 | null |
| 2024-05-04 | HandSSCA: 3D Hand Mesh Reconstruction with State Space Channel Attention from RGB images | Zixun Jiao et.al. | 2405.01066 | null |
| 2024-05-01 | Radar-Based Localization For Autonomous Ground Vehicles In Suburban Neighborhoods | Andrew J. Kramer et.al. | 2405.00600 | null |
| 2024-04-30 | Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging | Rayan Armani et.al. | 2404.19541 | link |
| 2024-04-30 | UniFS: Universal Few-shot Instance Perception with Point Representations | Sheng Jin et.al. | 2404.19401 | null |
| 2024-04-30 | Quater-GCN: Enhancing 3D Human Pose Estimation with Orientation and Semi-supervised Training | Xingyu Song et.al. | 2404.19279 | null |
| 2024-04-30 | XFeat: Accelerated Features for Lightweight Image Matching | Guilherme Potje et.al. | 2404.19174 | link |
| 2024-04-29 | Self-Avatar Animation in Virtual Reality: Impact of Motion Signals Artifacts on the Full-Body Pose Reconstruction | Antoine Maiorca et.al. | 2404.18628 | null |
| 2024-04-29 | Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle | Jungwoo Lee et.al. | 2404.18395 | null |
| 2024-04-29 | Reconstructing Satellites in 3D from Amateur Telescope Images | Zhiming Chang et.al. | 2404.18394 | null |
| 2024-04-27 | Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs | Yiming Bao et.al. | 2404.17837 | null |
| 2024-04-26 | Localization Through Particle Filter Powered Neural Network Estimated Monocular Camera Poses | Yi Shen et.al. | 2404.17685 | null |
| 2024-04-26 | SLAM for Indoor Mapping of Wide Area Construction Environments | Vincent Ress et.al. | 2404.17215 | null |
| 2024-04-25 | WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users | William Huang et.al. | 2404.17063 | link |
| 2024-04-25 | Transformer-Based Local Feature Matching for Multimodal Image Registration | Remi Delaunay et.al. | 2404.16802 | null |
| 2024-04-25 | DeepKalPose: An Enhanced Deep-Learning Kalman Filter for Temporally Consistent Monocular Vehicle Pose Estimation | Leandro Di Bella et.al. | 2404.16558 | null |
| 2024-04-25 | Efficient Solution of Point-Line Absolute Pose | Petr Hruby et.al. | 2404.16552 | link |
| 2024-04-25 | COBRA – COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images | Panagiotis Sapoutzoglou et.al. | 2404.16471 | link |
| 2024-04-25 | MegaParticles: Range-based 6-DoF Monte Carlo Localization with GPU-Accelerated Stein Particle Filter | Kenji Koide et.al. | 2404.16370 | null |
| 2024-04-24 | 3D Human Pose Estimation with Occlusions: Introducing BlendMimic3D Dataset and GCN Refinement | Filipa Lino et.al. | 2404.16136 | null |
| 2024-04-23 | SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation | Xiangyu Xu et.al. | 2404.15276 | link |
| 2024-04-25 | Domain adaptive pose estimation via multi-level alignment | Yugan Chen et.al. | 2404.14885 | link |
| 2024-04-23 | Semi-supervised 2D Human Pose Estimation via Adaptive Keypoint Masking | Kexin Meng et.al. | 2404.14835 | null |
| 2024-04-23 | UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues | Vandad Davoodnia et.al. | 2404.14634 | null |
| 2024-04-22 | DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation | Yonghao Dang et.al. | 2404.14025 | null |
| 2024-04-23 | CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory | Yunlong Ran et.al. | 2404.13896 | null |
| 2024-04-21 | Resampling-free Particle Filters in High-dimensions | Akhilan Boopathy et.al. | 2404.13698 | null |
| 2024-04-20 | EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment | Guanghao Li et.al. | 2404.13346 | link |
| 2024-04-18 | Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds | Oliver Lemke et.al. | 2404.12440 | null |
| 2024-04-18 | Gait Recognition from Highly Compressed Videos | Andrei Niculae et.al. | 2404.12183 | null |
| 2024-04-17 | Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding | George Retsinas et.al. | 2404.12144 | link |
| 2024-04-17 | Kathakali Hand Gesture Recognition With Minimal Data | Kavitha Raju et.al. | 2404.11205 | null |
| 2024-04-17 | GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement | Linfang Zheng et.al. | 2404.11139 | null |
| 2024-04-17 | CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation | Lianyu Hu et.al. | 2404.11111 | link |
| 2024-04-16 | HumMUSS: Human Motion Understanding using State Space Models | Arnab Kumar Mondal et.al. | 2404.10880 | null |
| 2024-04-16 | Invariant Kalman Filtering with Noise-Free Pseudo-Measurements | Sven Goffin et.al. | 2404.10687 | null |
| 2024-04-16 | The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement | Gabriele Trivigno et.al. | 2404.10438 | null |
| 2024-04-16 | GaitPoint+: A Gait Recognition Network Incorporating Point Cloud Analysis and Recycling | Huantao Ren et.al. | 2404.10213 | null |
| 2024-04-16 | LWIRPOSE: A novel LWIR Thermal Image Dataset and Benchmark | Avinash Upadhyay et.al. | 2404.10212 | link |
| 2024-04-15 | LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives | Jiadi Cui et.al. | 2404.09748 | null |
| 2024-04-14 | In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition | Wiktor Mucha et.al. | 2404.09308 | null |
| 2024-04-13 | DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector | Johan Edstedt et.al. | 2404.08928 | link |
| 2024-04-16 | 3D Human Scan With A Moving Event Camera | Kai Kohyama et.al. | 2404.08504 | null |
| 2024-04-11 | Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method | Tashmoy Ghosh et.al. | 2404.07649 | null |
| 2024-04-11 | GLID: Pre-training a Generalist Encoder-Decoder Vision Model | Jihao Liu et.al. | 2404.07603 | null |
| 2024-04-10 | Measuring proximity to standard planes during fetal brain ultrasound scanning | Chiara Di Vece et.al. | 2404.07124 | null |
| 2024-04-10 | MoCap-to-Visual Domain Adaptation for Efficient Human Mesh Estimation from 2D Keypoints | Bedirhan Uguz et.al. | 2404.07094 | null |
| 2024-04-10 | Gaussian-LIC: Photo-realistic LiDAR-Inertial-Camera SLAM with 3D Gaussian Splatting | Xiaolei Lang et.al. | 2404.06926 | link |
| 2024-04-09 | Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences | Axel Barroso-Laguna et.al. | 2404.06337 | link |
| 2024-04-09 | Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes | Tianchen Deng et.al. | 2404.06050 | null |
| 2024-04-09 | Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation | Zong-Wei Hong et.al. | 2404.06029 | null |
| 2024-04-08 | Learning 3D-Aware GANs from Unposed Images with Template Feature Field | Xinya Chen et.al. | 2404.05705 | null |
| 2024-04-08 | Learning a Category-level Object Pose Estimator without Pose Annotations | Fengrui Tian et.al. | 2404.05626 | null |
| 2024-04-08 | DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker | Jiapeng Wu et.al. | 2404.05518 | link |
| 2024-04-08 | Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks | Maksym Ivashechkin et.al. | 2404.05414 | null |
| 2024-04-08 | STITCH: Augmented Dexterity for Suture Throws Including Thread Coordination and Handoffs | Kush Hari et.al. | 2404.05151 | null |
| 2024-04-05 | ToolEENet: Tool Affordance 6D Pose Estimation | Yunlong Wang et.al. | 2404.04193 | null |
| 2024-04-04 | SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation | Sichen Chen et.al. | 2404.03518 | link |
| 2024-04-04 | Multi Positive Contrastive Learning with Pose-Consistent Generated Images | Sho Inayoshi et.al. | 2404.03256 | null |
| 2024-04-04 | HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud | Wencan Cheng et.al. | 2404.03159 | link |
| 2024-04-03 | Fusing Multi-sensor Input with State Information on TinyML Brains for Autonomous Nano-drones | Luca Crupi et.al. | 2404.02567 | null |
| 2024-04-03 | Semi-Supervised Unconstrained Head Pose Estimation in the Wild | Huayi Zhou et.al. | 2404.02544 | link |
| 2024-04-02 | 3D Congealing: 3D-Aware Image Alignment in the Wild | Yunzhi Zhang et.al. | 2404.02125 | null |
| 2024-04-02 | SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation | Vinkle Srivastav et.al. | 2404.02041 | link |
| 2024-04-01 | Marrying NeRF with Feature Matching for One-step Pose Estimation | Ronghan Chen et.al. | 2404.00891 | null |
| 2024-03-31 | Graph-Based vs. Error State Kalman Filter-Based Fusion Of 5G And Inertial Data For MAV Indoor Pose Estimation | Meisam Kabiri et.al. | 2404.00691 | null |
| 2024-03-31 | OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos | Dongyoung Choi et.al. | 2404.00676 | null |
| 2024-04-02 | KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation | Jihua Peng et.al. | 2404.00658 | link |
| 2024-03-29 | FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model | Molin Zhang et.al. | 2404.00132 | null |
| 2024-03-29 | Latent Embedding Clustering for Occlusion Robust Head Pose Estimation | José Celestino et.al. | 2403.20251 | null |
| 2024-03-29 | A Unified Framework for Human-centric Point Cloud Video Understanding | Yiteng Xu et.al. | 2403.20031 | null |
| 2024-04-01 | Video-Based Human Pose Regression via Decoupled Space-Time Aggregation | Jijie He et.al. | 2403.19926 | link |
| 2024-03-28 | Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation | Xiao Lin et.al. | 2403.19527 | link |
| 2024-03-27 | Object Pose Estimation via the Aggregation of Diffusion Features | Tianfu Wang et.al. | 2403.18791 | link |
| 2024-03-27 | RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation | Yang Tian et.al. | 2403.18259 | null |
| 2024-03-26 | Mathematical Foundation and Corrections for Full Range Head Pose Estimation | Huei-Chung Hu et.al. | 2403.18104 | null |
| 2024-03-26 | EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation | Chenhongyi Yang et.al. | 2403.18080 | link |
| 2024-03-26 | A Survey on 3D Egocentric Human Pose Estimation | Md Mushfiqur Azam et.al. | 2403.17893 | null |
| 2024-03-26 | GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction | Hrishav Bakul Barua et.al. | 2403.17837 | link |
| 2024-03-26 | DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions | Sammy Christen et.al. | 2403.17827 | null |
| 2024-03-26 | System Calibration of a Field Phenotyping Robot with Multiple High-Precision Profile Laser Scanners | Felix Esser et.al. | 2403.17788 | null |
| 2024-03-25 | Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos | Remy Sabathier et.al. | 2403.17103 | link |
| 2024-03-25 | Characterisation of the Intel RealSense D415 Stereo Depth Camera for Motion-Corrected CT Perfusion Imaging | Mahdieh Dashtbani Moghari et.al. | 2403.16490 | null |
| 2024-03-25 | Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects | Zicong Fan et.al. | 2403.16428 | null |
| 2024-03-25 | A Geometric Perspective on Fusing Gaussian Distributions on Lie Groups | Yixiao Ge et.al. | 2403.16411 | null |
| 2024-03-25 | ASDF: Assembly State Detection Utilizing Late Fusion by Integrating 6D Pose Estimation | Hannah Schieber et.al. | 2403.16400 | link |
| 2024-03-24 | KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments | Abdelrahman Younes et.al. | 2403.16238 | null |
| 2024-03-24 | Diffusion Model is a Good Pose Estimator from 3D RF-Vision | Junqiao Fan et.al. | 2403.16198 | null |
| 2024-03-23 | UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation | Yuliang Guo et.al. | 2403.15705 | link |
| 2024-03-22 | InterFusion: Text-Driven Generation of 3D Human-Object Interaction | Sisi Dai et.al. | 2403.15612 | link |
| 2024-03-22 | Augmented Reality Warnings in Roadway Work Zones: Evaluating the Effect of Modality on Worker Reaction Times | Sepehr Sabeti et.al. | 2403.15571 | null |
| 2024-03-22 | Gesture-Controlled Aerial Robot Formation for Human-Swarm Interaction in Safety Monitoring Applications | Vít Krátký et.al. | 2403.15333 | null |
| 2024-03-22 | WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization | Jialu Wang et.al. | 2403.15272 | null |
| 2024-03-22 | DITTO: Demonstration Imitation by Trajectory Transformation | Nick Heppert et.al. | 2403.15203 | link |
| 2024-03-22 | Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning | Bumsoo Kim et.al. | 2403.15048 | null |
| 2024-03-22 | Trajectory Regularization Enhances Self-Supervised Geometric Representation | Jiayun Wang et.al. | 2403.14973 | link |
| 2024-03-21 | VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding | Ahmad Mahmood et.al. | 2403.14743 | link |
| 2024-03-21 | Visibility-Aware Keypoint Localization for 6DoF Object Pose Estimation | Ruyi Lian et.al. | 2403.14559 | null |
| 2024-03-21 | Exploring 3D Human Pose Estimation and Forecasting from the Robot’s Perspective: The HARPER Dataset | Andrea Avogaro. Andrea Toaiari et.al. | 2403.14447 | null |
| 2024-03-21 | Evaluation and Deployment of LiDAR-based Place Recognition in Dense Forests | Haedam Oh et.al. | 2403.14326 | null |
| 2024-03-21 | Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation | Francesco Di Felice et.al. | 2403.14279 | null |
| 2024-03-20 | DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses | Chen Zhao et.al. | 2403.13683 | link |
| 2024-03-20 | Meta-Point Learning and Refining for Category-Agnostic Pose Estimation | Junjie Chen et.al. | 2403.13647 | link |
| 2024-03-20 | Advancing 6D Pose Estimation in Augmented Reality – Overcoming Projection Ambiguity with Uncontrolled Imagery | Mayura Manawadu et.al. | 2403.13434 | null |
| 2024-03-20 | DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation | Yamin Mao et.al. | 2403.13405 | null |
| 2024-03-20 | ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics | Qiaojun Yu et.al. | 2403.13365 | null |
| 2024-03-20 | MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination | Weiying Wang et.al. | 2403.13348 | null |
| 2024-03-19 | FaceXFormer: A Unified Transformer for Facial Analysis | Kartik Narayan et.al. | 2403.12960 | null |
| 2024-03-19 | WHAC: World-grounded Humans and Cameras | Wanqi Yin et.al. | 2403.12959 | link |
| 2024-03-19 | Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation | Jingtao Sun et.al. | 2403.12728 | link |
| 2024-03-19 | IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model | Matteo Bortolon et.al. | 2403.12682 | null |
| 2024-03-19 | In-Hand Following of Deformable Linear Objects Using Dexterous Fingers with Tactile Sensing | Mingrui Yu et.al. | 2403.12676 | null |
| 2024-03-19 | Self-learning Canonical Space for Multi-view 3D Human Pose Estimation | Xiaoben Li et.al. | 2403.12440 | null |
| 2024-03-19 | Human Mesh Recovery from Arbitrary Multi-view Images | Xiaoben Li et.al. | 2403.12434 | null |
| 2024-03-19 | XPose: eXplainable Human Pose Estimation | Luyu Qiu et.al. | 2403.12370 | null |
| 2024-03-18 | HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data | Mengqi Zhang et.al. | 2403.12011 | null |
| 2024-03-18 | Normalized Validity Scores for DNNs in Regression based Eye Feature Extraction | Wolfgang Fuhl et.al. | 2403.11665 | null |
| 2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
| 2024-03-18 | LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models | Yang Yang et.al. | 2403.11627 | link |
| 2024-03-18 | GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects | Sungphill Moon et.al. | 2403.11510 | null |
| 2024-03-17 | A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation | Qucheng Peng et.al. | 2403.11310 | null |
| 2024-03-17 | Compact 3D Gaussian Splatting For Dense Visual SLAM | Tianchen Deng et.al. | 2403.11247 | null |
| 2024-03-16 | Robotic Task Success Evaluation Under Multi-modal Non-Parametric Object Pose Uncertainty | Lakshadeep Naik et.al. | 2403.10874 | null |
| 2024-03-16 | DPPE: Dense Pose Estimation in a Plenoxels Environment using Gradient Approximation | Christopher Kolios et.al. | 2403.10773 | null |
| 2024-03-15 | GS-Pose: Cascaded Framework for Generalizable Segmentation-based 6D Object Pose Estimation | Dingding Cai et.al. | 2403.10683 | null |
| 2024-03-15 | CLOSURE: Fast Quantification of Pose Uncertainty Sets | Yihuai Gao et.al. | 2403.09990 | null |
| 2024-03-14 | Scalable Autonomous Drone Flight in the Forest with Visual-Inertial SLAM and Dense Submaps Built without LiDAR | Sebastián Barbas Laina et.al. | 2403.09596 | null |
| 2024-03-14 | Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting | Pawel Knap et.al. | 2403.09437 | null |
| 2024-03-14 | LM2D: Lyrics- and Music-Driven Dance Synthesis | Wenjie Yin et.al. | 2403.09407 | null |
| 2024-03-14 | SD-Net: Symmetric-Aware Keypoint Prediction and Domain Adaptation for 6D Pose Estimation In Bin-picking Scenarios | Ding-Tao Huang et.al. | 2403.09317 | link |
| 2024-03-14 | MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences using Attention-based Temporal Fusion | Arul Selvam Periyasamy et.al. | 2403.09309 | null |
| 2024-03-13 | Data Augmentation in Human-Centric Vision | Wentao Jiang et.al. | 2403.08650 | null |
| 2024-03-13 | PRAGO: Differentiable Multi-View Pose Optimization From Objectness Detections | Matteo Taiana et.al. | 2403.08586 | null |
| 2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | null |
| 2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125 | null |
| 2024-03-12 | MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation | Yuelong Li et.al. | 2403.08019 | null |
| 2024-03-12 | Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation | Kira Wursthorn et.al. | 2403.07741 | null |
| 2024-03-12 | Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving | JunDa Cheng et.al. | 2403.07535 | null |
| 2024-03-12 | Category-Agnostic Pose Estimation for Point Clouds | Bowen Liu et.al. | 2403.07437 | null |
| 2024-03-12 | Monocular Microscope to CT Registration using Pose Estimation of the Incus for Augmented Reality Cochlear Implant Surgery | Yike Zhang et.al. | 2403.07219 | null |
| 2024-03-11 | Real-Time Simulated Avatar from Head-Mounted Sensors | Zhengyi Luo et.al. | 2403.06862 | null |
| 2024-03-11 | Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition | Erkut Akdag et.al. | 2403.06577 | null |
| 2024-03-10 | Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation | Paweł A. Pierzchlewicz et.al. | 2403.06164 | link |
| 2024-03-10 | Diffusion Models Trained with Large Data Are Transferable Visual Models | Guangkai Xu et.al. | 2403.06090 | link |
| 2024-03-08 | Prepared for the Worst: A Learning-Based Adversarial Attack for Resilience Analysis of the ICP Algorithm | Ziyu Zhang et.al. | 2403.05666 | null |
| 2024-03-11 | Exploiting polar symmetry in designing equivariant observers for vision-based motion estimation | Tarek Bouazza et.al. | 2403.05450 | null |
| 2024-03-07 | Real-Time Planning Under Uncertainty for AUVs Using Virtual Maps | Ivana Collado-Gonzalez et.al. | 2403.04936 | null |
| 2024-03-07 | That’s My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation | Georgi Pramatarov et.al. | 2403.04755 | null |
| 2024-03-07 | Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser | Qingyuan Cai et.al. | 2403.04444 | link |
| 2024-03-09 | Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation | Ruicong Liu et.al. | 2403.04381 | null |
| 2024-03-05 | FAR: Flexible, Accurate and Robust 6DoF Relative Camera Pose Estimation | Chris Rockwell et.al. | 2403.03221 | null |
| 2024-03-05 | NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors | Yannan He et.al. | 2403.03122 | null |
| 2024-03-05 | Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection | Mohamed Afifi et.al. | 2403.03111 | null |
| 2024-03-05 | Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps | Timothy Chen et.al. | 2403.02751 | null |
| 2024-03-04 | PowerSkel: A Device-Free Framework Using CSI Signal for Human Skeleton Estimation in Power Station | Cunyi Yin et.al. | 2403.01913 | link |
| 2024-03-04 | A Simple Baseline for Efficient Hand Mesh Reconstruction | Zhishan Zhou et.al. | 2403.01813 | null |
| 2024-03-03 | MatchU: Matching Unseen Objects for 6D Pose Estimation from RGB-D Images | Junwen Huang et.al. | 2403.01517 | null |
| 2024-03-02 | Single-image camera calibration with model-free distortion correction | Katia Genovese et.al. | 2403.01263 | null |
| 2024-03-02 | Grid-based Fast and Structural Visual Odometry | Zhang Zhihe et.al. | 2403.01110 | null |
| 2024-03-01 | Optimal Robot Formations: Balancing Range-Based Observability and User-Defined Configurations | Syed Shabbir Ahmed et.al. | 2403.00988 | null |
| 2024-03-04 | TEXterity – Tactile Extrinsic deXterity: Simultaneous Tactile Estimation and Control for Extrinsic Dexterity | Sangwoon Kim et.al. | 2403.00049 | null |
| 2024-03-01 | Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach | Sarina Thomas et.al. | 2402.19062 | null |
| 2024-02-29 | Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey | Yang Liu et.al. | 2402.18844 | link |
| 2024-02-28 | Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Taeho Kang et.al. | 2402.18330 | link |
| 2024-02-28 | Location-guided Head Pose Estimation for Fisheye Image | Bing Li et.al. | 2402.18320 | null |
| 2024-02-28 | NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images | Jingrui Yu et.al. | 2402.18196 | link |
| 2024-02-28 | Six-Point Method for Multi-Camera Systems with Reduced Solution Space | Banglei Guan et.al. | 2402.18066 | null |
| 2024-02-27 | Real-Time Estimation of Relative Pose for UAVs Using a Dual-Channel Feature Association | Zhaoying Wang et.al. | 2402.17504 | null |
| 2024-02-26 | HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields | Haozhe Qi et.al. | 2402.17062 | link |
| 2024-02-26 | DRSI-Net: Dual-Residual Spatial Interaction Network for Multi-Person Pose Estimation | Shang Wu et.al. | 2402.16640 | null |
| 2024-02-26 | GEA: Reconstructing Expressive 3D Gaussian Avatar from Monocular Video | Xinqi Liu et.al. | 2402.16607 | null |
| 2024-02-26 | DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer | Yizhe Wu et.al. | 2402.16308 | null |
| 2024-02-25 | XAI-based gait analysis of patients walking with Knee-Ankle-Foot orthosis using video cameras | Arnav Mishra et.al. | 2402.16175 | null |
(<a href=../README.md>back to main</a>)