Optical Flow
Optical Flow
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2025-12-18 | A2VISR: An Active and Adaptive Ground-Aerial Localization System Using Visual Inertial and Single-Range Fusion | Sijia Chen et.al. | 2512.16367 | null |
| 2025-12-17 | GenAI-enabled Residual Motion Estimation for Energy-Efficient Semantic Video Communication | Shavbo Salehi et.al. | 2512.15481 | null |
| 2025-12-16 | Investigating the Efficacy of Topologically Derived Time Series for Flare Forecasting. II. XGBoost Model | Thomas Williams et.al. | 2512.14840 | null |
| 2025-12-16 | The Alignment of High-resolution Solar Prominence Images Observed by the New Vacuum Solar Telescope | Yunfang Cai et.al. | 2512.14201 | null |
| 2025-12-15 | Charge: A Comprehensive Novel View Synthesis Benchmark and Dataset to Bind Them All | Michal Nazarczuk et.al. | 2512.13639 | null |
| 2025-12-15 | Motus: A Unified Latent Action World Model | Hongzhe Bi et.al. | 2512.13030 | null |
| 2025-12-13 | A multi-viewpoint comparison of the velocity field of coronal propagating disturbances | Nina Stankovic et.al. | 2512.12333 | null |
| 2025-12-13 | SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation | Xuancheng Xu et.al. | 2512.12193 | null |
| 2025-12-12 | Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation | Yang Fei et.al. | 2512.11792 | null |
| 2025-12-12 | Surveillance Video-Based Traffic Accident Detection Using Transformer Architecture | Tanu Singh et.al. | 2512.11350 | null |
| 2025-12-12 | Physics-Informed Video Flare Synthesis and Removal Leveraging Motion Independence between Flare and Scene | Junqiao Wang et.al. | 2512.11327 | null |
| 2025-12-11 | Any4D: Unified Feed-Forward Metric 4D Reconstruction | Jay Karhade et.al. | 2512.10935 | null |
| 2025-12-11 | Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment | Han Li et.al. | 2512.10450 | null |
| 2025-12-11 | RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point Clouds | Jingyun Fu et.al. | 2512.10376 | null |
| 2025-12-10 | VHOI: Controllable Video Generation of Human-Object Interactions from Sparse Trajectories via Motion Densification | Wanyue Zhang et.al. | 2512.09646 | null |
| 2025-12-10 | Label-free Motion-Conditioned Diffusion Model for Cardiac Ultrasound Synthesis | Zhe Li et.al. | 2512.09418 | null |
| 2025-12-09 | GeoDiffMM: Geometry-Guided Conditional Diffusion for Motion Magnification | Xuedeng Liu et.al. | 2512.08325 | null |
| 2025-12-08 | UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation | Jiehui Huang et.al. | 2512.07831 | null |
| 2025-12-08 | UltrasODM: A Dual Stream Optical Flow Mamba Network for 3D Freehand Ultrasound Reconstruction | Mayank Anand et.al. | 2512.07756 | null |
| 2025-12-07 | Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation | Liyang Song et.al. | 2512.06888 | null |
| 2025-12-07 | Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training | Kaixuan Lu et.al. | 2512.06864 | null |
| 2025-12-04 | Towards Adaptive Fusion of Multimodal Deep Networks for Human Action Recognition | Novanto Yudistira et.al. | 2512.04943 | null |
| 2025-12-04 | SDG-Track: A Heterogeneous Observer-Follower Framework for High-Resolution UAV Tracking on Embedded Platforms | Jiawen Wen et.al. | 2512.04883 | null |
| 2025-12-04 | Vertical Planetary Landing on Sloped Terrain Using Optical Flow Divergence Estimates | Hann Woei Ho et.al. | 2512.04373 | null |
| 2025-12-04 | MAFNet:Multi-frequency Adaptive Fusion Network for Real-time Stereo Matching | Ao Xu et.al. | 2512.04358 | null |
| 2025-12-03 | Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation | Yuchen Deng et.al. | 2512.03590 | null |
| 2025-12-03 | Generalization Evaluation of Deep Stereo Matching Methods for UAV-Based Forestry Applications | Yida Lin et.al. | 2512.03427 | null |
| 2025-12-02 | LoVoRA: Text-guided and Mask-free Video Object Removal and Addition with Learnable Object-aware Localization | Zhihan Xiao et.al. | 2512.02933 | null |
| 2025-11-30 | PanFlow: Decoupled Motion Control for Panoramic Video Generation | Cheng Zhang et.al. | 2512.00832 | null |
| 2025-11-30 | CircleFlow: Flow-Guided Camera Blur Estimation using a Circle Grid Target | Jiajian He et.al. | 2512.00796 | null |
| 2025-11-30 | Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer | Dong In Lee et.al. | 2512.00677 | null |
| 2025-11-29 | What about gravity in video generation? Post-Training Newton’s Laws with Verifiable Rewards | Minh-Quan Le et.al. | 2512.00425 | null |
| 2025-11-29 | Odometry Without Correspondence from Inertially Constrained Ruled Surfaces | Chenqi Zhu et.al. | 2512.00327 | null |
| 2025-11-25 | Conceptual Evaluation of Deep Visual Stereo Odometry for the MARWIN Radiation Monitoring Robot in Accelerator Tunnels | André Dehne et.al. | 2512.00080 | null |
| 2025-11-27 | Gaussians on Fire: High-Frequency Reconstruction of Flames | Jakob Nazarenus et.al. | 2511.22459 | null |
| 2025-11-27 | Prompt-based Consistent Video Colorization | Silvia Dani et.al. | 2511.22330 | null |
| 2025-11-27 | IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer | Bo Chen et.al. | 2511.22167 | null |
| 2025-11-26 | MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training | Haotian Xue et.al. | 2511.21592 | null |
| 2025-11-25 | ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction | Yuanzhe Li et.al. | 2511.20020 | null |
| 2025-11-23 | UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization | Siyi Li et.al. | 2511.18254 | null |
| 2025-11-21 | Vision-Guided Optic Flow Navigation for Small Lunar Missions | Sean Cowan et.al. | 2511.17720 | null |
| 2025-11-21 | Flow-Guided Implicit Neural Representation for Motion-Aware Dynamic MRI Reconstruction | Baoqing Li et.al. | 2511.16948 | null |
| 2025-11-20 | EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering | Pierrick Bournez et.al. | 2511.16542 | null |
| 2025-11-20 | Investigating Optical Flow Computation: From Local Methods to a Multiresolution Horn-Schunck Implementation with Bilinear Interpolation | Haytham Ziani et.al. | 2511.16535 | null |
| 2025-11-20 | LAOF: Robust Latent Action Learning with Optical Flow Constraints | Xizhou Bu et.al. | 2511.16407 | null |
| 2025-11-18 | FlowRoI A Fast Optical Flow Driven Region of Interest Extraction Framework for High-Throughput Image Compression in Immune Cell Migration Analysis | Xiaowei Xu et.al. | 2511.14419 | null |
| 2025-11-17 | Inertia-Informed Orientation Priors for Event-Based Optical Flow Estimation | Pritam P. Karmokar et.al. | 2511.12961 | null |
| 2025-11-16 | DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality | Tushar Anand et.al. | 2511.12671 | null |
| 2025-11-15 | RadarMP: Motion Perception for 4D mmWave Radar in Autonomous Driving | Ruiqi Cheng et.al. | 2511.12117 | null |
| 2025-11-14 | DEFT-LLM: Disentangled Expert Feature Tuning for Micro-Expression Recognition | Ren Zhang et.al. | 2511.10948 | null |
| 2025-11-12 | Density Estimation and Crowd Counting | Balachandra Devarangadi Sunil et.al. | 2511.09723 | null |
| 2025-11-12 | SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields | Sangheon Yang et.al. | 2511.09072 | null |
| 2025-11-12 | Neural B-frame Video Compression with Bi-directional Reference Harmonization | Yuxi Liu et.al. | 2511.08938 | null |
| 2025-11-11 | Visual Bridge: Universal Visual Perception Representations Generating | Yilin Gao et.al. | 2511.07877 | null |
| 2025-11-11 | ViPRA: Video Prediction for Robot Actions | Sandeep Routray et.al. | 2511.07732 | null |
| 2025-11-10 | FlowFeat: Pixel-Dense Embedding of Motion Profiles | Nikita Araslanov et.al. | 2511.07696 | null |
| 2025-11-10 | ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise Search | Zhenjie Liu et.al. | 2511.06833 | null |
| 2025-11-09 | VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes | Zhengyu Zou et.al. | 2511.06408 | null |
| 2025-11-09 | Physics-Informed Deformable Gaussian Splatting: Towards Unified Constitutive Laws for Time-Evolving Material Field | Haoqin Hong et.al. | 2511.06299 | null |
| 2025-11-08 | MiVID: Multi-Strategic Self-Supervision for Video Frame Interpolation using Diffusion Model | Priyansh Srivastava et.al. | 2511.06019 | null |
| 2025-11-07 | Precipitation nowcasting of satellite data using physically-aligned neural networks | Antônio Catão et.al. | 2511.05471 | null |
| 2025-11-06 | Hadronic Processes in Advection-Dominated Accretion Flow as the Origin of TeV Excesses in BL Lac Objects | Ji-Shun Lian et.al. | 2511.04202 | null |
| 2025-11-06 | Murray’s Law as an Entropy-per-Information-Cost Extremum | Justin Bennett et.al. | 2511.04022 | null |
| 2025-11-05 | DentalSplat: Dental Occlusion Novel View Synthesis from Sparse Intra-Oral Photographs | Yiyi Miao et.al. | 2511.03099 | null |
| 2025-11-04 | Estimation of Segmental Longitudinal Strain in Transesophageal Echocardiography by Deep Learning | Anders Austlid Taskén et.al. | 2511.02210 | null |
| 2025-11-03 | UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback | Ropeway Liu et.al. | 2511.01678 | link |
| 2025-11-03 | Discriminately Treating Motion Components Evolves Joint Depth and Ego-Motion Learning | Mengtan Zhang et.al. | 2511.01502 | null |
| 2025-11-03 | CaRLi-V: Camera-RADAR-LiDAR Point-Wise 3D Velocity Estimation | Landson Guo et.al. | 2511.01383 | null |
| 2025-11-02 | Cosmic Ray Acceleration by Turbulence-Driven Magnetic Reconnection and the Origin of the Neutrinos in NGC 1068 | Luana Passos-Reis et.al. | 2511.01112 | null |
| 2025-11-01 | GDROS: A Geometry-Guided Dense Registration Framework for Optical-SAR Images under Large Geometric Transformations | Zixuan Sun et.al. | 2511.00598 | null |
| 2025-10-31 | Optical Micromanipulations based on Model Predictive Control of Thermoviscous Flows | Elena Erben et.al. | 2510.27609 | null |
| 2025-10-31 | Towards a Multi-Embodied Grasping Agent | Roman Freiberg et.al. | 2510.27420 | null |
| 2025-10-31 | Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis | Weiming Chen et.al. | 2510.27324 | null |
| 2025-10-30 | The Quest for Generalizable Motion Generation: Data, Model, and Evaluation | Jing Lin et.al. | 2510.26794 | link |
| 2025-10-30 | Towards Reliable Sea Ice Drift Estimation in the Arctic Deep Learning Optical Flow on RADARSAT-2 | Daniela Martin et.al. | 2510.26653 | null |
| 2025-10-30 | LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation | Xiangqing Zheng et.al. | 2510.26412 | null |
| 2025-10-30 | Microwave Cytometry with Machine Learning for Shape-Resolved Microplastic Detection | Sayedus Salehin et.al. | 2510.26377 | null |
| 2025-10-30 | MoTDiff: High-resolution Motion Trajectory estimation from a single blurred image using Diffusion models | Wontae Choi et.al. | 2510.26173 | null |
| 2025-10-30 | JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting | Yuxuan Li et.al. | 2510.26117 | null |
| 2025-10-29 | Photoacoustics on the go: An Embedded Photoacoustic Sensing Platform | Talia Xu et.al. | 2510.25256 | null |
| 2025-10-14 | DrivingScene: A Multi-Task Online Feed-Forward 3D Gaussian Splatting Method for Dynamic Driving Scenes | Qirui Hou et.al. | 2510.24734 | null |
| 2025-10-28 | LEVITAS: Levitodynamics for Accurate Individual Particle Sensing in Space | Rafal Gajewski et.al. | 2510.24524 | null |
| 2025-10-28 | Benchmarking Microsaccade Recognition with Event Cameras: A Novel Dataset and Evaluation | Waseem Shariff et.al. | 2510.24231 | null |
| 2025-10-28 | Radiatively-Cooled Mass Transfer: Disk Properties and L2 outflows across Mass Transfer Rates | Peter Scherbak et.al. | 2510.24127 | null |
| 2025-10-27 | Yesnt: Are Diffusion Relighting Models Ready for Capture Stage Compositing? A Hybrid Alternative to Bridge the Gap | Elisabeth Jüttner et.al. | 2510.23494 | null |
| 2025-10-27 | FlowCapX: Physics-Grounded Flow Capture with Long-Term Consistency | Ningxiao Tao et.al. | 2510.23122 | null |
| 2025-10-27 | EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction | Taoyu Wu et.al. | 2510.23087 | null |
| 2025-10-25 | Glymphatic Clearance in the Optic Nerve: A Multidomain Electro-osmostic Model | Shanfeng Xiao et.al. | 2510.22271 | null |
| 2025-10-25 | ACG: Action Coherence Guidance for Flow-based VLA models | Minho Park et.al. | 2510.22201 | link |
| 2025-10-25 | STG-Avatar: Animatable Human Avatars via Spacetime Gaussian | Guangan Jiang et.al. | 2510.22140 | link |
| 2025-10-25 | CogStereo: Neural Stereo Matching with Implicit Spatial Cognition Embedding | Lihuang Fang et.al. | 2510.22119 | null |
| 2025-10-24 | Epipolar Geometry Improves Video Generation Models | Orest Kupyn et.al. | 2510.21615 | null |
| 2025-10-24 | Shadow and Polarization Images of Rotating Black Holes in Kalb-Ramond Gravity Illuminated by Several Thick Accretion Disks | Chen-Yu Yang et.al. | 2510.21229 | null |
| 2025-10-23 | CUPID: Generative 3D Reconstruction via Joint Object and Pose Modeling | Binbin Huang et.al. | 2510.20776 | null |
| 2025-10-23 | RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling | Bingjie Gao et.al. | 2510.20206 | link |
| 2025-10-23 | Inverse Image-Based Rendering for Light Field Generation from Single Images | Hyunjun Jung et.al. | 2510.20132 | null |
| 2025-10-22 | Magnetic flux cancellation in the solar atmosphere through 3D realistic numerical modeling | F. Moreno-Insertis et.al. | 2510.19993 | null |
| 2025-10-21 | MoAlign: Motion-Centric Representation Alignment for Video Diffusion Models | Aritra Bhowmik et.al. | 2510.19022 | null |
| 2025-10-21 | Characterizing primary atomization of cryogenic LOX/Nitrogen and LOX/Helium sprays by visualizations coupled to Phase Doppler Interferometry | Nicolas Fdida et.al. | 2510.18543 | null |
| 2025-10-21 | VelocityNet: Real-Time Crowd Anomaly Detection via Person-Specific Velocity Analysis | Fatima AlGhamdi et.al. | 2510.18187 | null |
| 2025-10-20 | Clumpy Outflows from Super-Eddington Accreting Black Holes I: Radiation Hydrodynamics Simulations and Observational Implications | Haojie Hu et.al. | 2510.17696 | null |
| 2025-10-20 | AV1 Motion Vector Fidelity and Application for Efficient Optical Flow | Julien Zouein et.al. | 2510.17427 | null |
| 2025-10-20 | Real critical exponents from the $\varepsilon$-expansion in an interacting $U(1)$ model with non-Hermitian $Z_4$ anisotropy | Eduard Naichuk et.al. | 2510.17224 | null |
| 2025-10-18 | CAZ catalog and optical light curves of 7918 blazar-selected AGN | Pouya M. Kouch et.al. | 2510.16584 | null |
| 2025-10-18 | Multiwavelength spectroscopic observations of a quiescent prominence | Jianchao Xue et.al. | 2510.16288 | null |
| 2025-10-17 | Tracking optical variability and outflows across the accretion states of the black hole transient MAXI J1820+070 | M. C. Baglio et.al. | 2510.16124 | null |
| 2025-10-17 | DGME-T: Directional Grid Motion Encoding for Transformer-Based Historical Camera Movement Classification | Tingyu Lin et.al. | 2510.15725 | null |
| 2025-10-17 | Experimental and simulation study of resin infiltration in carbon fiber rovings | Dominik Burr et.al. | 2510.15648 | null |
| 2025-10-17 | Iterative Motion Compensation for Canonical 3D Reconstruction from UAV Plant Images Captured in Windy Conditions | Andre Rochow et.al. | 2510.15491 | null |
| 2025-10-17 | A Novel Combined Optical Flow Approach for Comprehensive Micro-Expression Recognition | Vu Tram Anh Khuong et.al. | 2510.15471 | null |
| 2025-10-17 | MAVR-Net: Robust Multi-View Learning for MAV Action Recognition with Cross-View Attention | Nengbo Zhang et.al. | 2510.15448 | null |
| 2025-10-16 | A low-cost, open-source maskless photolithography stepper for microfabrication | B. Joel Gonzalez et.al. | 2510.15082 | null |
| 2025-10-16 | Terra: Explorable Native 3D World Model with Point Latents | Yuanhui Huang et.al. | 2510.14977 | null |
| 2025-10-16 | C4D: 4D Made from 3D through Dual Correspondences | Shizun Wang et.al. | 2510.14960 | link |
| 2025-10-15 | An Explicit M1 Radiation-hydrodynamics Scheme for Three-dimensional Protostellar Evolution | Kazutaka Kimura et.al. | 2510.13949 | null |
| 2025-10-15 | Removing Cost Volumes from Optical Flow Estimators | Simon Kiefhaber et.al. | 2510.13317 | link |
| 2025-10-15 | Scalable Generalized Meta-Spanners Enabling Parallel Multitasking Optical Manipulation | Tianyue Li et.al. | 2510.13146 | null |
| 2025-10-15 | Macroscopic Self-Trapping and Dynamical Phase Transition in Momentum Space Bose-Einstein Condensates | Colby Schimelfenig et.al. | 2510.13056 | null |
| 2025-10-14 | What If : Understanding Motion Through Sparse Interactions | Stefan Andreas Baumann et.al. | 2510.12777 | link |
| 2025-10-14 | E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization | Wenpu Li et.al. | 2510.12753 | null |
| 2025-10-14 | SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding | Zhiliu Yang et.al. | 2510.12749 | null |
| 2025-10-14 | JWST and Keck Observations of the Off-Nuclear TDE AT 2024tvd: A Massive Nuclear Star Cluster and Minor-Merger Origin for its Black Hole | Kishore C. Patra et.al. | 2510.12572 | null |
| 2025-10-14 | CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion | Jinzhou Lin et.al. | 2510.12362 | null |
| 2025-10-13 | Canalized hyperbolic magnetoexciton polaritons by Shubnikov-de Haas effect in van der Waals semiconductors | Guangyi Jia et.al. | 2510.11163 | null |
| 2025-10-13 | Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling | Tianyi Tan et.al. | 2510.11083 | null |
| 2025-10-13 | IUT-Plug: A Plug-in tool for Interleaved Image-Text Generation | Zeteng Lin et.al. | 2510.10969 | null |
| 2025-10-12 | Black Holes in the Shadow: The Missing High-Ionization Lines in the Earliest JWST AGNs | Greta Zucchi et.al. | 2510.10772 | null |
| 2025-10-12 | Injecting Frame-Event Complementary Fusion into Diffusion for Optical Flow in Challenging Scenes | Haonan Wang et.al. | 2510.10577 | null |
| 2025-10-11 | Ortho-Fuse: Orthomosaic Generation for Sparse High-Resolution Crop Health Maps Through Intermediate Optical Flow Estimation | Rugved Katole et.al. | 2510.10360 | null |
| 2025-10-11 | Gesplat: Robust Pose-Free 3D Reconstruction via Geometry-Guided Gaussian Splatting | Jiahui Lu et.al. | 2510.10097 | null |
| 2025-10-10 | Dynamic Weight-based Temporal Aggregation for Low-light Video Enhancement | Ruirui Lin et.al. | 2510.09450 | null |
| 2025-10-10 | Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians | Jin-Chuan Shi et.al. | 2510.09438 | null |
| 2025-10-10 | Stable Video Infinity: Infinite-Length Video Generation with Error Recycling | Wuyang Li et.al. | 2510.09212 | null |
| 2025-10-09 | Re-Identifying Kākā with AI-Automated Video Key Frame Extraction | Paula Maddigan et.al. | 2510.08775 | null |
| 2025-10-09 | When Light Bends to the Collective Will: A Theory and Vision for Adaptive Photonic Scale-up Domains | Vamsi Addanki et.al. | 2510.08072 | null |
| 2025-10-09 | FMANet: A Novel Dual-Phase Optical Flow Approach with Fusion Motion Attention Network for Robust Micro-expression Recognition | Luu Tu Nguyen et.al. | 2510.07810 | null |
| 2025-10-09 | Trajectory Conditioned Cross-embodiment Skill Transfer | YuHang Tang et.al. | 2510.07773 | null |
| 2025-10-09 | DEGS: Deformable Event-based 3D Gaussian Splatting from RGB and Event Stream | Junhao He et.al. | 2510.07752 | null |
| 2025-10-07 | Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model | Danush Kumar Venkatesh et.al. | 2510.07345 | null |
| 2025-10-08 | Content-Adaptive Inference for State-of-the-art Learned Video Compression | Ahmet Bilican et.al. | 2510.07283 | null |
| 2025-10-07 | Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow | Ruyang Liu et.al. | 2510.05836 | null |
| 2025-10-07 | Midway Network: Learning Representations for Recognition and Motion from Latent Dynamics | Christopher Hoang et.al. | 2510.05558 | null |
| 2025-10-06 | AvatarVTON: 4D Virtual Try-On for Animatable Avatars | Zicheng Jiang et.al. | 2510.04822 | null |
| 2025-10-05 | Learning Efficient Meshflow and Optical Flow from Event Cameras | Xinglong Luo et.al. | 2510.04111 | null |
| 2025-10-03 | Test-Time Defense Against Adversarial Attacks via Stochastic Resonance of Latent Ensembles | Dong Lao et.al. | 2510.03224 | null |
| 2025-09-26 | Temporal-Aware Iterative Speech Model for Dementia Detection | Chukwuemeka Ugwu et.al. | 2510.00030 | null |
| 2025-09-30 | Uncovering Zero-Shot Generalization Gaps in Time-Series Foundation Models Using Real-World Videos | Lujun Li et.al. | 2509.26347 | null |
| 2025-09-30 | Rare-event detection in a backward-facing-step flow using live optical-flow velocimetry: observation of an upstream jet burst | Juan Pimienta et.al. | 2509.25983 | null |
| 2025-09-30 | High Resolution and High-Speed Live Optical Flow Velocimetry | Juan Pimienta et.al. | 2509.25924 | null |
| 2025-09-29 | Fast Feature Field ( $\text{F}^3$ ): A Predictive Representation of Events | Richeek Das et.al. | 2509.25146 | null |
| 2025-09-29 | Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse-view Videos | Yingdong Hu et.al. | 2509.24209 | null |
| 2025-09-26 | DeLiVR: Differential Spatiotemporal Lie Bias for Efficient Video Deraining | Shuning Sun et.al. | 2509.21719 | null |
| 2025-09-25 | MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation | Xinyu Liu et.al. | 2509.21265 | link |
| 2025-09-23 | Surgical Video Understanding with Label Interpolation | Garam Kim et.al. | 2509.18802 | null |
| 2025-09-22 | I2VWM: Robust Watermarking for Image to Video Generation | Guanjie Wang et.al. | 2509.17773 | null |
| 2025-09-19 | Global Regulation and Excitation via Attention Tuning for Stereo Matching | Jiahao Li et.al. | 2509.15891 | null |
| 2025-09-19 | Interpretable Modeling of Articulatory Temporal Dynamics from real-time MRI for Phoneme Recognition | Jay Park et.al. | 2509.15689 | null |
| 2025-09-18 | WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance | Chenxi Song et.al. | 2509.15130 | link |
| 2025-09-18 | BEV-ODOM2: Enhanced BEV-based Monocular Visual Odometry with PV-BEV Fusion and Dense Flow Supervision for Ground Robots | Yufei Wei et.al. | 2509.14636 | null |
| 2025-09-17 | UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry | Tae-Wook Um et.al. | 2509.13713 | null |
| 2025-09-16 | ROOM: A Physics-Based Continuum Robot Simulator for Photorealistic Medical Datasets Generation | Salvatore Esposito et.al. | 2509.13177 | link |
| 2025-09-16 | 4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar | Xiao Tang et.al. | 2509.12931 | null |
| 2025-09-15 | The Filter Echo: A General Tool for Filter Visualisation | Daniel Gaa et.al. | 2509.11932 | null |
| 2025-09-10 | World Modeling with Probabilistic Structure Integration | Klemen Kotar et.al. | 2509.09737 | null |
| 2025-09-10 | Computational Imaging for Enhanced Computer Vision | Humera Shaikh et.al. | 2509.08712 | null |
| 2025-09-10 | FractalPINN-Flow: A Fractal-Inspired Network for Unsupervised Optical Flow Estimation with Total Variation Regularization | Sara Behnamian et.al. | 2509.08670 | null |
| 2025-09-10 | Deep Visual Odometry for Stereo Event Cameras | Sheng Zhong et.al. | 2509.08235 | null |
| 2025-09-08 | MIORe & VAR-MIORe: Benchmarks to Push the Boundaries of Restoration | George Ciubotariu et.al. | 2509.06803 | null |
| 2025-09-07 | Motion Aware ViT-based Framework for Monocular 6-DoF Spacecraft Pose Estimation | Jose Sosa et.al. | 2509.06000 | null |
| 2025-09-05 | FlowSeek: Optical Flow Made Easier with Depth Foundation Models and Motion Bases | Matteo Poggi et.al. | 2509.05297 | null |
| 2025-09-05 | A biologically inspired separable learning vision model for real-time traffic object perception in Dark | Hulin Li et.al. | 2509.05012 | null |
| 2025-09-02 | Motion-Refined DINOSAUR for Unsupervised Multi-Object Discovery | Xinrui Gong et.al. | 2509.02545 | null |
| 2025-08-30 | Encoder-Only Image Registration | Xiang Chen et.al. | 2509.00451 | null |
| 2025-08-25 | MESTI-MEGANet: Micro-expression Spatio-Temporal Image and Micro-expression Gradient Attention Networks for Micro-expression Recognition | Luu Tu Nguyen et.al. | 2509.00056 | null |
| 2025-08-28 | Observer Design for Optical Flow-Based Visual-Inertial Odometry with Almost-Global Convergence | Tarek Bouazza et.al. | 2508.21163 | null |
| 2025-08-27 | AutoQ-VIS: Improving Unsupervised Video Instance Segmentation via Automatic Quality Assessment | Kaixuan Lu et.al. | 2508.19808 | null |
| 2025-08-27 | Context-aware Sparse Spatiotemporal Learning for Event-based Vision | Shenqi Wang et.al. | 2508.19806 | null |
| 2025-08-25 | DoGFlow: Self-Supervised LiDAR Scene Flow via Cross-Modal Doppler Guidance | Ajinkya Khoche et.al. | 2508.18506 | null |
| 2025-08-25 | FlowVLA: Visual Chain of Thought-based Motion Reasoning for Vision-Language-Action Models | Zhide Zhong et.al. | 2508.18269 | null |
| 2025-08-23 | DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method | Qingwen Zhang et.al. | 2508.17054 | null |
| 2025-08-19 | MF-LPR $^2$ : Multi-Frame License Plate Image Restoration and Recognition using Optical Flow | Kihyun Na et.al. | 2508.14797 | null |
| 2025-08-20 | 6-DoF Object Tracking with Event-based Optical Flow and Frames | Zhichao Li et.al. | 2508.14776 | null |
| 2025-08-20 | Multiscale Video Transformers for Class Agnostic Segmentation in Autonomous Driving | Leila Cheshmi et.al. | 2508.14729 | null |
| 2025-08-20 | Reliable Smoke Detection via Optical Flow-Guided Feature Fusion and Transformer-Based Uncertainty Modeling | Nitish Kumar Mahala et.al. | 2508.14597 | null |
| 2025-08-18 | Deformation of the panoramic sphere into an ellipsoid to induce self-motion in telepresence users | Eetu Laukka et.al. | 2508.12925 | null |
| 2025-08-18 | Discrete Approximate Circle Bundles | Brad Turow et.al. | 2508.12914 | null |
| 2025-08-14 | Cooperative Face Liveness Detection from Optical Flow | Artem Sokolov et.al. | 2508.10786 | null |
| 2025-08-14 | Beyond conventional vision: RGB-event fusion for robust object detection in dynamic traffic scenarios | Zhanwen Liu et.al. | 2508.10704 | null |
| 2025-08-13 | HKT: A Biologically Inspired Framework for Modular Hereditary Knowledge Transfer in Neural Networks | Yanick Chistian Tchenko et.al. | 2508.09743 | null |
| 2025-08-11 | DiTVR: Zero-Shot Diffusion Transformer for Video Restoration | Sicheng Gao et.al. | 2508.07811 | null |
| 2025-08-08 | Fast Motion Estimation and Context-Aware Refinement for Efficient Bayer-Domain Video Vision | Haichao Wang et.al. | 2508.05990 | null |
| 2025-08-07 | Revealing Latent Information: A Physics-inspired Self-supervised Pre-training Framework for Noisy and Sparse Events | Lin Zhu et.al. | 2508.05507 | null |
| 2025-08-07 | TRKT: Weakly Supervised Dynamic Scene Graph Generation with Temporal-enhanced Relation-aware Knowledge Transferring | Zhu Xu et.al. | 2508.04943 | null |
| 2025-08-06 | Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline | Linqing Zhao et.al. | 2508.04597 | null |
| 2025-08-06 | Improving Tactile Gesture Recognition with Optical Flow | Shaohong Zhong et.al. | 2508.04338 | null |
| 2025-08-05 | Video Demoireing using Focused-Defocused Dual-Camera System | Xuan Dong et.al. | 2508.03449 | null |
| 2025-08-05 | ActionSink: Toward Precise Robot Manipulation with Dynamic Integration of Action Flow | Shanshan Guo et.al. | 2508.03218 | null |
| 2025-08-01 | PMR: Physical Model-Driven Multi-Stage Restoration of Turbulent Dynamic Videos | Tao Wu et.al. | 2508.00406 | null |
| 2025-08-01 | Occlusion-robust Stylization for Drawing-based 3D Animation | Sunjae Yoon et.al. | 2508.00398 | null |
| 2025-08-01 | Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency | Xi Xue et.al. | 2508.00397 | null |
| 2025-08-01 | Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging | Tianshuang Qiu et.al. | 2508.00354 | null |
| 2025-07-31 | World Consistency Score: A Unified Metric for Video Generation Quality | Akshat Rakheja et.al. | 2508.00144 | null |
| 2025-07-31 | Enhanced Velocity Field Modeling for Gaussian Video Reconstruction | Zhenyang Li et.al. | 2507.23704 | null |
| 2025-07-30 | Learning to Prune Branches in Modern Tree-Fruit Orchards | Abhinav Jain et.al. | 2507.23015 | null |
| 2025-07-30 | Estimating 2D Camera Motion with Hybrid Motion Basis | Haipeng Li et.al. | 2507.22480 | null |
| 2025-07-29 | Unleashing the Power of Motion and Depth: A Selective Fusion Strategy for RGB-D Video Salient Object Detection | Jiahao He et.al. | 2507.21857 | null |
| 2025-07-25 | Event-Based De-Snowing for Autonomous Driving | Manasi Muglikar et.al. | 2507.20901 | null |
| 2025-07-28 | Hanging Around: Cognitive Inspired Reasoning for Reactive Robotics | Mihai Pomarlan et.al. | 2507.20832 | null |
| 2025-07-26 | DepthFlow: Exploiting Depth-Flow Structural Correlations for Unsupervised Video Object Segmentation | Suhwan Cho et.al. | 2507.19790 | null |
| 2025-07-26 | TransFlow: Motion Knowledge Transfer from Video Diffusion Models to Video Salient Object Detection | Suhwan Cho et.al. | 2507.19789 | null |
| 2025-07-25 | Video Self-Distillation for Single-Image Encoders: A Step Toward Physically Plausible Perception | Marcel Simon et.al. | 2507.19272 | null |
| 2025-07-20 | Systole-Conditioned Generative Cardiac Motion | Shahar Zuler et.al. | 2507.15894 | null |
| 2025-07-23 | EndoControlMag: Robust Endoscopic Vascular Motion Magnification with Periodic Reference Resetting and Hierarchical Tissue-aware Dual-Mask Contro | An Wang et.al. | 2507.15292 | null |
| 2025-07-19 | Motion Segmentation and Egomotion Estimation from Event-Based Normal Flow | Zhiyuan Hua et.al. | 2507.14500 | null |
| 2025-07-18 | DUSTrack: Semi-automated point tracking in ultrasound videos | Praneeth Namburi et.al. | 2507.14368 | null |
| 2025-07-18 | Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation | Masahiro Ogawa et.al. | 2507.13628 | null |
| 2025-07-17 | Latent Policy Steering with Embodiment-Agnostic Pretrained World Models | Yiqi Wang et.al. | 2507.13340 | null |
| 2025-07-17 | Channel-wise Motion Features for Efficient Motion Segmentation | Riku Inoue et.al. | 2507.13082 | null |
| 2025-07-16 | Understanding visual attention beehind bee-inspired UAV navigation | Pranav Rajbhandari et.al. | 2507.11992 | null |
| 2025-07-14 | Well-posedness of an optical flow based optimal control formulation for image registration | Johannes Haubner et.al. | 2507.10188 | null |
| 2025-07-14 | Taming Modern Point Tracking for Speckle Tracking Echocardiography via Impartial Motion | Md Abulkalam Azad et.al. | 2507.10127 | null |
| 2025-07-14 | MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second | Chenguo Lin et.al. | 2507.10065 | link |
| 2025-07-11 | Taming generative video models for zero-shot optical flow extraction | Seungwoo Kim et.al. | 2507.09082 | link |
| 2025-07-11 | An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan | Mengyuan Liu et.al. | 2507.08690 | null |
| 2025-07-11 | PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models | Yongjian Zhang et.al. | 2507.08400 | null |
| 2025-07-11 | MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion | Jihao Gu et.al. | 2507.08344 | null |
| 2025-07-10 | X-RAFT: Cross-Modal Non-Rigid Registration of Blue and White Light Neurosurgical Hyperspectral Images | Charlie Budd et.al. | 2507.07747 | null |
| 2025-07-09 | mmFlux: Crowd Flow Analytics with Commodity mmWave MIMO Radar | Anurag Pallaprolu et.al. | 2507.07331 | null |
| 2025-07-08 | Learning to Track Any Points from Human Motion | Inès Hyeonsu Kim et.al. | 2507.06233 | null |
| 2025-07-07 | MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation | Yucheng Wang et.al. | 2507.05092 | null |
| 2025-07-07 | TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation | Zonglin Lyu et.al. | 2507.04984 | link |
| 2025-07-10 | MCFormer: A Multi-Cost-Volume Network and Comprehensive Benchmark for Particle Image Velocimetry | Zicheng Lin et.al. | 2507.04750 | null |
| 2025-07-06 | FB-Diff: Fourier Basis-guided Diffusion for Temporal Interpolation of 4D Medical Imaging | Xin You et.al. | 2507.04547 | null |
| 2025-07-05 | VISC: mmWave Radar Scene Flow Estimation using Pervasive Visual-Inertial Supervision | Kezhong Liu et.al. | 2507.03938 | null |
| 2025-07-03 | Flow-CDNet: A Novel Network for Detecting Both Slow and Fast Changes in Bitemporal Images | Haoxuan Li et.al. | 2507.02307 | null |
| 2025-07-01 | TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency | Minye Shao et.al. | 2507.00802 | link |
| 2025-07-01 | DIJE: Dense Image Jacobian Estimation for Robust Robotic Self-Recognition and Visual Servoing | Yasunori Toshimitsu et.al. | 2507.00446 | null |
| 2025-06-30 | C3VDv2 – Colonoscopy 3D video dataset with enhanced realism | Mayank V. Golhar et.al. | 2506.24074 | null |
| 2025-07-03 | PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View | Longliang Liu et.al. | 2506.23897 | null |
| 2025-06-30 | Proteus-ID: ID-Consistent and Motion-Coherent Video Customization | Guiyu Zhang et.al. | 2506.23729 | null |
| 2025-06-29 | MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation | Vladislav Bargatin et.al. | 2506.23151 | link |
| 2025-06-26 | WAFT: Warping-Alone Field Transforms for Optical Flow | Yihan Wang et.al. | 2506.21526 | null |
| 2025-06-26 | EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting | Taoyu Wu et.al. | 2506.21420 | null |
| 2025-06-25 | Feature Hallucination for Self-supervised Action Recognition | Lei Wang et.al. | 2506.20342 | null |
| 2025-06-24 | Online camera-pose-free stereo endoscopic tissue deformation recovery with tissue-invariant vision-biomechanics consistency | Jiahe Chen et.al. | 2506.19388 | null |
| 2025-06-23 | Flow-Aware Diffusion for Real-Time VR Restoration: Enhancing Spatiotemporal Coherence and Efficiency | Yitong Zhu et.al. | 2506.18786 | null |
| 2025-06-24 | Multimodal Fusion SLAM with Fourier Attention | Youjie Zhou et.al. | 2506.18204 | null |
| 2025-06-19 | EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training | Liangjing Shao et.al. | 2506.16017 | link |
| 2025-06-17 | MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution | Zhiwen Shao et.al. | 2506.14511 | link |
| 2025-06-17 | KDMOS:Knowledge Distillation for Motion Segmentation | Chunyu Cao et.al. | 2506.14130 | link |
| 2025-06-21 | Inference-Time Gaze Refinement for Micro-Expression Recognition: Enhancing Event-Based Eye Tracking with Motion-Aware Post-Processing | Nuwan Bandara et.al. | 2506.12524 | link |
| 2025-06-13 | MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution | Linfeng He et.al. | 2506.11768 | null |
| 2025-06-12 | Post-Training Quantization for Video Matting | Tianrui Zhu et.al. | 2506.10840 | null |
| 2025-06-11 | DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos | Chieh Hubert Lin et.al. | 2506.09997 | null |
| 2025-06-10 | UFM: A Simple Path towards Unified Dense Correspondence with Flow | Yuchen Zhang et.al. | 2506.09278 | link |
| 2025-06-10 | Princeton365: A Diverse Dataset with Accurate Camera Pose | Karhan Kayan et.al. | 2506.09035 | null |
| 2025-06-09 | Spatio-Temporal State Space Model For Efficient Event-Based Optical Flow | Muhammad Ahmed Humais et.al. | 2506.07878 | link |
| 2025-06-09 | Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images | Yingping Liang et.al. | 2506.07740 | null |
| 2025-06-13 | Consistent Video Editing as Flow-Driven Image-to-Video Generation | Ge Wang et.al. | 2506.07713 | null |
| 2025-06-08 | AllTracker: Efficient Dense Point Tracking at High Resolution | Adam W. Harley et.al. | 2506.07310 | null |
| 2025-06-08 | GoTrack: Generic 6DoF Object Pose Refinement and Tracking | Van Nguyen Nguyen et.al. | 2506.07155 | link |
| 2025-06-07 | EV-LayerSegNet: Self-supervised Motion Segmentation using Event Cameras | Youssef Farah et.al. | 2506.06596 | null |
| 2025-06-06 | 3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model | Hongyan Zhi et.al. | 2506.06199 | link |
| 2025-06-06 | Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments | Mingrui Li et.al. | 2506.05965 | null |
| 2025-06-05 | VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction | Ziyue Zhu et.al. | 2506.05563 | null |
| 2025-06-05 | DualX-VSR: Dual Axial Spatial $\times$ Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation | Shuo Cao et.al. | 2506.04830 | null |
| 2025-06-04 | JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting | Yang Xiao et.al. | 2506.03872 | null |
| 2025-06-04 | EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation | Daikun Liu et.al. | 2506.03512 | null |
| 2025-06-03 | Learning Optical Flow Field via Neural Ordinary Differential Equation | Leyla Mirvakhabova et.al. | 2506.03290 | null |
| 2025-06-03 | LinkTo-Anime: A 2D Animation Optical Flow Dataset from 3D Model Rendering | Xiaoyi Feng et.al. | 2506.02733 | null |
| 2025-06-03 | LumosFlow: Motion-Guided Long Video Generation | Jiahao Chen et.al. | 2506.02497 | null |
| 2025-06-02 | MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow | Jakob Schmid et.al. | 2506.01443 | null |
| 2025-06-01 | MOOSE: Pay Attention to Temporal Dynamics for Video Understanding via Optical Flows | Hong Nguyen et.al. | 2506.01119 | null |
| 2025-05-31 | Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline | Zhaoying Wang et.al. | 2506.00546 | null |
| 2025-05-31 | Improving Optical Flow and Stereo Depth Estimation by Leveraging Uncertainty-Based Learning Difficulties | Jisoo Jeong et.al. | 2506.00324 | null |
| 2025-05-30 | Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction | Chenyou Fan et.al. | 2505.24156 | null |
| 2025-05-29 | Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing | Tongtong Su et.al. | 2505.23134 | link |
| 2025-05-27 | Object Concepts Emerge from Motion | Haoqian Liang et.al. | 2505.21635 | null |
| 2025-05-26 | A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking | Zixiang Zhao et.al. | 2505.19858 | null |
| 2025-05-23 | Brightness-Invariant Tracking Estimation in Tagged MRI | Zhangxing Bian et.al. | 2505.18365 | null |
| 2025-05-31 | CTRL-GS: Cascaded Temporal Residue Learning for 4D Gaussian Splatting | Karly Hou et.al. | 2505.18306 | null |
| 2025-05-23 | Real-time Traffic Accident Anticipation with Feature Reuse | Inpyo Song et.al. | 2505.17449 | null |
| 2025-05-22 | Efficient Correlation Volume Sampling for Ultra-High-Resolution Optical Flow Estimation | Karlis Martins Briedis et.al. | 2505.16942 | null |
| 2025-05-22 | V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation | Hanyue Lou et.al. | 2505.16797 | null |
| 2025-05-21 | SENSE – Sensor-Enhanced Neural Shear Stress Estimation for Quantitative Oilfilm Visualizations | Lennart Rohlfs et.al. | 2505.15697 | null |
| 2025-05-19 | RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers | Ahmet Berke Gokmen et.al. | 2505.13344 | link |
| 2025-05-19 | eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks | Jad Mansour et.al. | 2505.13309 | null |
| 2025-05-19 | FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching | Alp Eren Sari et.al. | 2505.13174 | null |
| 2025-05-19 | Just Dance with $π$ ! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection | Snehashis Majhi et.al. | 2505.13123 | null |
| 2025-05-17 | MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos | Hongyi Zhou et.al. | 2505.11868 | null |
| 2025-05-16 | Planar Velocity Estimation for Fast-Moving Mobile Robots Using Event-Based Optical Flow | Liam Boyle et.al. | 2505.11116 | null |
| 2025-05-15 | TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation | Manthan Patel et.al. | 2505.10696 | null |
| 2025-05-15 | A label-free sub-diffractive technique for 3D intracellular tomography using thermally induced convection currents | Jayesh Goswami et.al. | 2505.10112 | null |
| 2025-05-15 | FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation | Jun Guo et.al. | 2505.10075 | null |
| 2025-05-14 | FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling | Yue Wen et.al. | 2505.09406 | null |
| 2025-05-14 | RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo | Jenny Schmalfuss et.al. | 2505.09368 | null |
| 2025-05-13 | Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection | Ayush K. Rai et.al. | 2505.08561 | null |
| 2025-05-13 | TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection | Wenkui Yang et.al. | 2505.08437 | link |
| 2025-05-13 | EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation | Hanle Zheng et.al. | 2505.08235 | null |
| 2025-05-13 | Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images | Ziteng Liu et.al. | 2505.08178 | null |
| 2025-05-12 | Asynchronous Multi-Object Tracking with an Event Camera | Angus Apps et.al. | 2505.08126 | link |
| 2025-05-11 | MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception | Zhengye Zhang et.al. | 2505.07007 | link |
| 2025-05-13 | Detection of Moving Objects Using Self-motion Constraints on Optic Flow | Hope Lutwak et.al. | 2505.06686 | null |
| 2025-05-08 | Nonlinear Motion-Guided and Spatio-Temporal Aware Network for Unsupervised Event-Based Optical Flow | Zuntao Liu et.al. | 2505.05089 | null |
| 2025-05-08 | A Simple Detector with Frame Dynamics is a Strong Tracker | Chenxu Peng et.al. | 2505.04917 | link |
| 2025-05-06 | Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment | João Alves et.al. | 2505.03554 | link |
| 2025-05-06 | TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion | Haoyue Liu et.al. | 2505.03116 | null |
| 2025-05-04 | Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance | Yingkai Zhang et.al. | 2505.02109 | null |
| 2025-05-02 | Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation | Zhen Yao et.al. | 2505.01548 | link |
| 2025-04-30 | AnimalMotionCLIP: Embedding motion in CLIP for Animal Behavior Analysis | Enmin Zhong et.al. | 2505.00569 | null |
| 2025-04-29 | LPVIMO-SAM: Tightly-coupled LiDAR/Polarization Vision/Inertial/Magnetometer/Optical Flow Odometry via Smoothing and Mapping | Derui Shan et.al. | 2504.20380 | null |
| 2025-04-28 | STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction | Zhimin Liao et.al. | 2504.19749 | null |
| 2025-04-25 | RapidPIV: Full Flow-Field kHz PIV for Real-Time Display and Control | Scott A. Bollt et.al. | 2504.17987 | null |
| 2025-04-22 | Motion-Enhanced Nonlocal Similarity Implicit Neural Representation for Infrared Dim and Small Target Detection | Pei Liu et.al. | 2504.15665 | null |
| 2025-04-22 | DiTPainter: Efficient Video Inpainting with Diffusion Transformers | Xian Wu et.al. | 2504.15661 | null |
| 2025-04-21 | PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV | Qianyu Zhu et.al. | 2504.14952 | link |
| 2025-04-21 | Multimodal Non-Semantic Feature Fusion for Predicting Segment Access Frequency in Lecture Archives | Ruozhu Sheng et.al. | 2504.14927 | null |
| 2025-04-20 | FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models | Kuanting Wu et.al. | 2504.14535 | null |
| 2025-04-18 | Neural Ganglion Sensors: Learning Task-specific Event Cameras Inspired by the Neural Circuit of the Human Retina | Haley M. So et.al. | 2504.13457 | null |
| 2025-04-18 | MicroFlow: Domain-Specific Optical Flow for Ground Deformation Estimation in Seismic Events | Juliette Bertrand et.al. | 2504.13452 | null |
| 2025-04-18 | Event-Enhanced Blurry Video Super-Resolution | Dachun Kai et.al. | 2504.13042 | link |
| 2025-04-17 | SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration | Xi Tong et.al. | 2504.12869 | null |
| 2025-04-17 | SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping | Yun-Cheng Li et.al. | 2504.12619 | null |
| 2025-04-14 | Perturbed State Space Feature Encoders for Optical Flow with Event Cameras | Gokul Raju Govinda Raju et.al. | 2504.10669 | null |
| 2025-04-15 | WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs | Nguyen Ngoc Dat et.al. | 2504.10165 | null |
| 2025-04-12 | SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow | Qingyuan Wang et.al. | 2504.09160 | null |
| 2025-04-11 | Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review | Claudio Cimarelli et.al. | 2504.08588 | null |
| 2025-04-10 | Extending Visual Dynamics for Video-to-Music Generation | Xiaohao Liu et.al. | 2504.07594 | null |
| 2025-04-08 | Intrinsic Saliency Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation | Xiangyu Zheng et.al. | 2504.05904 | null |
| 2025-04-07 | Towards Efficient Real-Time Video Motion Transfer via Generative Time Series Modeling | Tasmiah Haque et.al. | 2504.05537 | null |
| 2025-04-06 | FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency | Shiyan Liu et.al. | 2504.04427 | null |
| 2025-04-05 | Simultaneous Motion And Noise Estimation with Event Cameras | Shintaro Shiba et.al. | 2504.04029 | null |
| 2025-04-04 | 3D Scene Understanding Through Local Random Access Sequence Modeling | Wanhee Lee et.al. | 2504.03875 | link |
| 2025-04-03 | L-LBVC: Long-Term Motion Estimation and Prediction for Learned Bi-Directional Video Compression | Yongqi Zhai et.al. | 2504.02560 | null |
| 2025-04-03 | Estimating Scene Flow in Robot Surroundings with Distributed Miniaturized Time-of-Flight Sensors | Jack Sander et.al. | 2504.02439 | null |
| 2025-04-01 | Beyond Wide-Angle Images: Unsupervised Video Portrait Correction via Spatiotemporal Diffusion Adaptation | Wenbo Nie et.al. | 2504.00401 | null |
| 2025-04-01 | Hierarchical Flow Diffusion for Efficient Frame Interpolation | Yang Hai et.al. | 2504.00380 | null |
| 2025-03-31 | Easi3R: Estimating Disentangled Motion from DUSt3R Without Training | Xingyu Chen et.al. | 2503.24391 | link |
| 2025-04-03 | Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey | Haoyang Wang et.al. | 2503.22943 | null |
| 2025-03-28 | Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision | Rulin Zhou et.al. | 2503.22394 | null |
| 2025-03-28 | VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow | Yancong Lin et.al. | 2503.22328 | link |
| 2025-03-28 | Segment Any Motion in Videos | Nan Huang et.al. | 2503.22268 | null |
| 2025-03-28 | Synergistic Bleeding Region and Point Detection in Surgical Videos | Jialun Pei et.al. | 2503.22174 | null |
| 2025-03-27 | VADMamba: Exploring State Space Models for Fast Video Anomaly Detection | Jiahao Lyu et.al. | 2503.21169 | link |
| 2025-03-27 | Can Video Diffusion Model Reconstruct 4D Geometry? | Jinjie Mai et.al. | 2503.21082 | null |
| 2025-03-25 | Burst Image Super-Resolution with Mamba | Ozan Unal et.al. | 2503.19634 | null |
| 2025-03-24 | NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting | Yulong Zheng et.al. | 2503.18794 | link |
| 2025-03-27 | MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion | Yikun Ma et.al. | 2503.17695 | null |
| 2025-03-21 | Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks | Bhishma Dedhia et.al. | 2503.17539 | null |
| 2025-03-21 | Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras | Shuang Guo et.al. | 2503.17262 | link |
| 2025-03-20 | 4D Gaussian Splatting SLAM | Yanyan Li et.al. | 2503.16710 | null |
| 2025-03-20 | Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction | Edgar Sucar et.al. | 2503.16318 | null |
| 2025-03-20 | EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation | Zihao Zhang et.al. | 2503.15831 | null |
| 2025-03-19 | Toward Scalable, Flexible Scene Flow for Point Clouds | Kyle Vedder et.al. | 2503.15666 | null |
| 2025-03-19 | DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework | Henrique Morimitsu et.al. | 2503.14880 | link |
| 2025-03-19 | Temporal-Consistent Video Restoration with Pre-trained Diffusion Models | Hengkang Wang et.al. | 2503.14863 | null |
| 2025-03-19 | SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments | Yinqi Chen et.al. | 2503.14837 | null |
| 2025-03-18 | GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics | Tingyang Xiao et.al. | 2503.14247 | link |
| 2025-03-17 | UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural Networks | Yuanbin Qian et.al. | 2503.12905 | link |
| 2025-03-16 | ProbDiffFlow: An Efficient Learning-Free Framework for Probabilistic Single-Image Optical Flow Estimation | Mo Zhou et.al. | 2503.12348 | null |
| 2025-03-17 | EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation | Zengyu Wan et.al. | 2503.11371 | null |
| 2025-03-14 | FG-DFPN: Flow Guided Deformable Frame Prediction Network | M. Akın Yılmaz et.al. | 2503.11343 | link |
| 2025-03-14 | Zero-TIG: Temporal Consistency-Aware Zero-Shot Illumination-Guided Low-light Video Enhancement | Yini Li et.al. | 2503.11175 | null |
| 2025-03-14 | A High-Accuracy Alignment Approach for Solar Images of Different Wavelengths | Yun Wang et.al. | 2503.11035 | null |
| 2025-03-13 | Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations | Xunzhi Zheng et.al. | 2503.10464 | null |
| 2025-03-13 | Markerless Tracking-Based Registration for Medical Image Motion Correction | Luisa Neubig et.al. | 2503.10260 | null |
| 2025-03-13 | TARS: Traffic-Aware Radar Scene Flow Estimation | Jialong Wu et.al. | 2503.10210 | null |
| 2025-03-13 | ST-FlowNet: An Efficient Spiking Neural Network for Event-Based Optical Flow Estimation | Hongze Sun et.al. | 2503.10195 | null |
| 2025-03-12 | Investigation of Frame Differences as Motion Cues for Video Object Segmentation | Sota Kawamura et.al. | 2503.09132 | null |
| 2025-03-11 | TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting | Fengyi Zhang et.al. | 2503.08485 | null |
| 2025-03-11 | Feature Alignment with Equivariant Convolutions for Burst Image Super-Resolution | Xinyi Liu et.al. | 2503.08300 | null |
| 2025-03-10 | MambaFlow: A Mamba-Centric Architecture for End-to-End Optical Flow Estimation | Juntian Du et.al. | 2503.07046 | null |
| 2025-03-11 | Bridge Frame and Event: Common Spatiotemporal Fusion for High-Dynamic Scene Optical Flow | Hanyu Zhou et.al. | 2503.06992 | null |
| 2025-03-09 | Online Dense Point Tracking with Streaming Memory | Qiaole Dong et.al. | 2503.06471 | link |
| 2025-03-10 | VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control | Yuxuan Bian et.al. | 2503.05639 | link |
| 2025-03-07 | Stereo Any Video: Temporally Consistent Stereo Matching | Junpeng Jing et.al. | 2503.05549 | null |
| 2025-03-06 | Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation | David T. Hoffmann et.al. | 2503.04718 | null |
| 2025-03-06 | Implicit Neural Representation for Video and Image Super-Resolution | Mary Aiyetigbo et.al. | 2503.04665 | null |
| 2025-03-09 | ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem | Yu-Hsi Chen et.al. | 2503.04500 | link |
| 2025-03-05 | Video Super-Resolution: All You Need is a Video Diffusion Model | Zhihao Zhan et.al. | 2503.03355 | null |
| 2025-03-05 | BAT: Learning Event-based Optical Flow with Bidirectional Adaptive Temporal Correlation | Gangwei Xu et.al. | 2503.03256 | null |
| 2025-03-05 | Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria | Asma A. Almutairi et.al. | 2503.03100 | null |
| 2025-03-04 | Anomaly detection in non-stationary videos using time-recursive differencing network based prediction | Gargi V. Pillai et.al. | 2503.02234 | null |
| 2025-03-03 | MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features | Chao Ye et.al. | 2503.01571 | link |
| 2025-03-03 | AI-Driven Relocation Tracking in Dynamic Kitchen Environments | Arash Nasr Esfahani et.al. | 2503.01547 | link |
| 2025-03-02 | Vid2Fluid: 3D Dynamic Fluid Assets from Single-View Videos with Generative Gaussian Splatting | Zhiwei Zhao et.al. | 2503.00868 | null |
| 2025-03-02 | HiMo: High-Speed Objects Motion Compensation in Point Clouds | Qingwen Zhang et.al. | 2503.00803 | null |
| 2025-02-28 | EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration | Kuangyi Chen et.al. | 2503.00167 | null |
| 2025-02-24 | MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation | Jiehao Luo et.al. | 2502.16907 | link |
| 2025-02-21 | Peripheral Teleportation: A Rest Frame Design to Mitigate Cybersickness During Virtual Locomotion | Tongyu Nie et.al. | 2502.15227 | null |
| 2025-02-20 | Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance | Meng Wang et.al. | 2502.14520 | null |
| 2025-02-18 | L4P: Low-Level 4D Vision Perception Unified | Abhishek Badki et.al. | 2502.13078 | null |
| 2025-02-18 | Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection | Zijian Cao et.al. | 2502.12735 | null |
| 2025-02-17 | Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance | Jixiang Chen et.al. | 2502.11971 | null |
| 2025-02-17 | Stonefish: Supporting Machine Learning Research in Marine Robotics | Michele Grimaldi et.al. | 2502.11887 | null |
| 2025-02-15 | Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach | Mouhamad Chehaitly et.al. | 2502.10876 | null |
| 2025-02-15 | Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video | Runyang Feng et.al. | 2502.10616 | null |
| 2025-02-11 | A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision | Hao Ai et.al. | 2502.10444 | null |
| 2025-02-12 | FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis | Wonjoon Jin et.al. | 2502.08244 | null |
| 2025-02-11 | Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors | Lin-Zhuo Chen et.al. | 2502.07615 | null |
| 2025-02-18 | A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction | Yongfan Chen et.al. | 2502.05503 | link |
| 2025-02-05 | MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Xinyao Liao et.al. | 2502.03207 | null |
| 2025-02-03 | XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications | Shangjin Zhai et.al. | 2502.01297 | null |
| 2025-01-28 | Image Velocimetry using Direct Displacement Field estimation with Neural Networks for Fluids | Efraín Magaña et.al. | 2501.18641 | null |
| 2025-02-02 | REMOTE: Real-time Ego-motion Tracking for Various Endoscopes via Multimodal Visual Feature Learning | Liangjing Shao et.al. | 2501.18124 | null |
| 2025-01-29 | SSF: Sparse Long-Range Scene Flow for Autonomous Driving | Ajinkya Khoche et.al. | 2501.17821 | link |
| 2025-01-28 | Improved Encoding for Overfitted Video Codecs | Thomas Leguay et.al. | 2501.16976 | null |
| 2025-01-28 | Assessing ultrasonic and optical flow velocimetry in a millifluidic device using oil-in-water emulsions as blood mimicking fluid | Estelle Lu et.al. | 2501.16959 | null |
| 2025-01-28 | Extending Information Bottleneck Attribution to Video Sequences | Veronika Solopova et.al. | 2501.16889 | link |
| 2025-02-04 | Event-Based Adaptive Koopman Framework for Optic Flow-Guided Landing on Moving Platforms | Bazeela Banday et.al. | 2501.16868 | null |
| 2025-01-28 | SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios | Yinqi Chen et.al. | 2501.16754 | null |
| 2025-01-23 | GC-ConsFlow: Leveraging Optical Flow Residuals and Global Context for Robust Deepfake Detection | Jiaxin Chen et.al. | 2501.13435 | null |
| 2025-01-22 | MONA: Moving Object Detection from Videos Shot by Dynamic Camera | Boxun Hu et.al. | 2501.13183 | null |
| 2025-01-22 | Machine Learning Modeling for Multi-order Human Visual Motion Processing | Zitang Sun et.al. | 2501.12810 | link |
| 2025-01-21 | Efficient Dynamic Image Reconstruction with motion estimation | Toluwani Okunola et.al. | 2501.12497 | null |
| 2025-01-21 | Learning segmentation from point trajectories | Laurynas Karazija et.al. | 2501.12392 | link |
| 2025-01-22 | Video Depth Anything: Consistent Depth Estimation for Super-Long Videos | Sili Chen et.al. | 2501.12375 | link |
| 2025-01-21 | VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models | Chaohao Xie et.al. | 2501.12267 | null |
| 2025-01-20 | Event-based vision for egomotion estimation using precise event timing | Hugh Greatorex et.al. | 2501.11554 | null |
| 2025-01-19 | BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution | Eunjin Kim et.al. | 2501.11043 | null |
| 2025-01-25 | Quadcopter Position Hold Function using Optical Flow in a Smartphone-based Flight Computer | Noel P. Caliston et.al. | 2501.10752 | null |
| 2025-01-18 | Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection | Yifang Xu et.al. | 2501.10692 | null |
| 2025-01-20 | Zero-Shot Monocular Scene Flow Estimation in the Wild | Yiqing Liang et.al. | 2501.10357 | null |
| 2025-01-20 | GSTAR: Gaussian Surface Tracking and Reconstruction | Chengwei Zheng et.al. | 2501.10283 | null |
| 2025-01-17 | DiffuEraser: A Diffusion Model for Video Inpainting | Xiaowen Li et.al. | 2501.10018 | link |
| 2025-01-16 | VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization | Zixun Fang et.al. | 2501.09499 | null |
| 2025-01-16 | DEFOM-Stereo: Depth Foundation Model Based Stereo Matching | Hualie Jiang et.al. | 2501.09466 | link |
| 2025-01-16 | Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise | Ryan Burgert et.al. | 2501.08331 | link |
| 2025-01-13 | Aligning First, Then Fusing: A Novel Weakly Supervised Multimodal Violence Detection Method | Wenping Jin et.al. | 2501.07496 | link |
| 2025-01-08 | Edit as You See: Image-guided Video Editing via Masked Motion Modeling | Zhi-Lin Huang et.al. | 2501.04325 | null |
| 2025-01-06 | TinySense: A Lighter Weight and More Power-efficient Avionics System for Flying Insect-scale Robots | Zhitao Yu et.al. | 2501.03416 | null |
| 2025-01-06 | ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking | Tingyang Zhang et.al. | 2501.03220 | null |
| 2025-01-05 | AHMSA-Net: Adaptive Hierarchical Multi-Scale Attention Network for Micro-Expression Recognition | Lijun Zhang et.al. | 2501.02539 | null |
| 2025-01-01 | Spatially-guided Temporal Aggregation for Robust Event-RGB Optical Flow Estimation | Qianang Zhou et.al. | 2501.00838 | null |
| 2025-01-05 | How Honeybees Perceive and Traverse Apertures | Timothy Jakobi et.al. | 2501.00646 | null |
| 2024-12-31 | STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes | Jiawei Yang et.al. | 2501.00602 | null |
| 2024-12-29 | Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition | Xiu-Feng Huang et.al. | 2412.20327 | link |
| 2024-12-28 | Enhancing Marine Debris Acoustic Monitoring by Optical Flow-Based Motion Vector Analysis | Xiaoteng Zhou et.al. | 2412.20085 | null |
| 2024-12-27 | Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark | Lukas Picek et.al. | 2412.19944 | null |
| 2024-12-27 | Generalized Uncertainty-Based Evidential Fusion with Hybrid Multi-Head Attention for Weak-Supervised Temporal Action Localization | Yuanpeng He et.al. | 2412.19418 | link |
| 2024-12-23 | FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation | Min Lin et.al. | 2412.17366 | null |
| 2025-01-03 | Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry | Zhaoxing Zhang et.al. | 2412.16923 | null |
| 2024-12-20 | SOUS VIDE: Cooking Visual Drone Navigation Policies in a Gaussian Splatting Vacuum | JunEn Low et.al. | 2412.16346 | null |
| 2024-12-20 | MotiF: Making Text Count in Image Animation with Motion Focal Loss | Shijie Wang et.al. | 2412.16153 | null |
| 2024-12-18 | Dynamic semantic VSLAM with known and unknown objects | Sanghyoup Gu et.al. | 2412.14359 | null |
| 2024-12-18 | SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation | Tong Chen et.al. | 2412.14018 | null |
| 2024-12-17 | CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices | Andrei Znobishchev et.al. | 2412.13273 | null |
| 2024-12-17 | Complex extension of optical flow and its practical evaluation for undersampled dynamic MRI | Matthias J. Ehrhardt et.al. | 2412.12711 | null |
| 2024-12-17 | GG-SSMs: Graph-Generating State Space Models | Nikola Zubić et.al. | 2412.12423 | null |
| 2024-12-16 | Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising | Zikang Chen et.al. | 2412.11820 | link |
| 2024-12-16 | Exploring More from Multiple Gait Modalities for Human Identification | Dongyang Jin et.al. | 2412.11495 | link |
| 2024-12-16 | BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions | Wonyong Seo et.al. | 2412.11365 | null |
| 2024-12-15 | Learning Normal Flow Directly From Event Neighborhoods | Dehao Yuan et.al. | 2412.11284 | link |
| 2024-12-13 | BatDeck – Ultra Low-power Ultrasonic Ego-velocity Estimation and Obstacle Avoidance on Nano-drones | Hanna Müller et.al. | 2412.10048 | null |
| 2024-12-12 | A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data | Alice Ruget et.al. | 2412.09427 | null |
| 2024-12-12 | eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction | Jad Mansour et.al. | 2412.09209 | link |
| 2024-12-12 | ResFlow: Fine-tuning Residual Optical Flow for Event-based High Temporal Resolution Motion Estimation | Qianang Zhou et.al. | 2412.09105 | null |
| 2024-12-12 | Mojito: Motion Trajectory and Intensity Control for Video Generation | Xuehai He et.al. | 2412.08948 | null |
| 2024-12-12 | Labits: Layered Bidirectional Time Surfaces Representation for Event Camera-based Continuous Dense Trajectory Estimation | Zhongyang Zhang et.al. | 2412.08849 | null |
| 2024-12-11 | Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Zhigang Cen et.al. | 2412.08034 | null |
| 2024-12-10 | EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision | Qiang Qu et.al. | 2412.07080 | link |
| 2024-12-09 | Local Attention Transformers for High-Detail Optical Flow Upsampling | Alexander Gielisse et.al. | 2412.06439 | null |
| 2024-12-08 | MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation | Shuwei Shi et.al. | 2412.05848 | null |
| 2024-12-05 | Deep Learning and Hybrid Approaches for Dynamic Scene Analysis, Object Detection and Motion Tracking | Shahran Rahman Alve et.al. | 2412.05331 | null |
| 2024-12-04 | Advancing Auto-Regressive Continuation for Video Frames | Ruibo Ming et.al. | 2412.03758 | null |
| 2024-12-03 | Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Hiroki Furuta et.al. | 2412.02617 | null |
| 2024-12-02 | STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation | Sunghun Yang et.al. | 2412.01090 | null |
| 2024-12-01 | Advanced Video Inpainting Using Optical Flow-Guided Efficient Diffusion | Bohai Gu et.al. | 2412.00857 | null |
| 2024-11-30 | A conditional Generative Adversarial network model for the Weather4Cast 2024 Challenge | Atharva Deshpande et.al. | 2412.00451 | null |
| 2024-11-30 | Hybrid Local-Global Context Learning for Neural Video Compression | Yongqi Zhai et.al. | 2412.00446 | null |
| 2024-11-27 | RoMo: Robust Motion Segmentation Improves Structure from Motion | Lily Goli et.al. | 2411.18650 | null |
| 2024-11-27 | ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching | Yangrui Dong et.al. | 2411.18174 | null |
| 2024-11-27 | An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition | Song-Jiang Lai et.al. | 2411.18002 | null |
| 2024-11-26 | Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors | Zhengfei Kuang et.al. | 2411.17249 | null |
| 2024-11-25 | Context-Aware Input Orchestration for Video Inpainting | Hoyoung Kim et.al. | 2411.16926 | null |
| 2024-11-22 | TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks | Prajna G. Malettira et.al. | 2411.16711 | null |
| 2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
| 2024-11-23 | Optical-Flow Guided Prompt Optimization for Coherent Video Generation | Hyelin Nam et.al. | 2411.15540 | null |
| 2024-11-22 | Benchmarking the Robustness of Optical Flow Estimation to Corruptions | Zhonghua Yi et.al. | 2411.14865 | null |
| 2024-11-21 | EdgeFlowNet: 100FPS@1W Dense Optical Flow For Tiny Mobile Robots | Sai Ramana Kiran Pinnama Raju et.al. | 2411.14576 | null |
| 2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423 | null |
| 2024-11-21 | Transforming Static Images Using Generative Models for Video Salient Object Detection | Suhwan Cho et.al. | 2411.13975 | link |
| 2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
| 2024-11-20 | DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild | Weicai Ye et.al. | 2411.13291 | null |
| 2024-11-20 | Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark | Bing Cao et.al. | 2411.13056 | null |
| 2024-11-16 | AnimateAnything: Consistent and Controllable Animation for Video Generation | Guojun Lei et.al. | 2411.10836 | null |
| 2024-11-15 | OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models | Mathis Koroglu et.al. | 2411.10501 | null |
| 2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
| 2024-11-14 | MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation | Jonas Serych et.al. | 2411.09551 | link |
| 2024-11-15 | UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos | Chengbo Yuan et.al. | 2411.09145 | null |
| 2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
| 2024-11-12 | DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection | Shawn Li et.al. | 2411.08227 | link |
| 2024-11-17 | Scaling Properties of Diffusion Models for Perceptual Tasks | Rahul Ravishankar et.al. | 2411.08034 | null |
| 2024-11-11 | Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters | Corwin Grant Jeon MacMillan et.al. | 2411.05225 | null |
| 2024-11-07 | Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera | Yu Hu et.al. | 2411.04413 | null |
| 2024-11-07 | AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation | Mingyu Sheng et.al. | 2411.03695 | link |
| 2024-11-04 | Neural optical flow for planar and stereo PIV | Andrew I. Masker et.al. | 2411.02373 | null |
| 2024-11-03 | Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation | Zhenbin Wang et.al. | 2411.01647 | null |
| 2024-11-03 | Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli | Matthias Tangemann et.al. | 2411.01505 | null |
| 2024-11-02 | Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks | Aarjav Kavathia et.al. | 2411.01348 | null |
| 2024-10-29 | Motion Graph Unleashed: A Novel Approach to Video Prediction | Yiqi Zhong et.al. | 2410.22288 | link |
| 2024-10-29 | FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives | Qizhi Chen et.al. | 2410.22070 | null |
| 2024-10-29 | Investigation of moving objects through atmospheric turbulence from a non-stationary platform | Nicholas Ferrante et.al. | 2410.21639 | null |
| 2024-10-27 | CloudCast – Total Cloud Cover Nowcasting with Machine Learning | Mikko Partio et.al. | 2410.21329 | link |
| 2024-10-28 | Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context | Manuel Benavent-Lledo et.al. | 2410.21275 | link |
| 2024-10-27 | BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events | Yijin Li et.al. | 2410.20451 | null |
| 2024-10-26 | UniVST: A Unified Framework for Training-free Localized Video Style Transfer | Quanjian Song et.al. | 2410.20084 | null |
| 2024-10-25 | FastPCI: Motion-Structure Guided Fast Point Cloud Frame Interpolation | Tianyu Zhang et.al. | 2410.19573 | link |
| 2024-10-23 | Separating edges from microstructure in X-ray dark-field imaging: Evolving and devolving perspectives via the X-ray Fokker-Planck equation | Samantha J. Alloo et.al. | 2410.18317 | null |
| 2024-10-17 | Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation | Xuezhi Xiang et.al. | 2410.13355 | null |
| 2024-10-16 | Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks | Pranjali Pathre et.al. | 2410.12432 | null |
| 2024-10-14 | Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world | Han Ling et.al. | 2410.10453 | link |
| 2024-10-12 | A Collaborative Team of UAV-Hexapod for an Autonomous Retrieval System in GNSS-Denied Maritime Environments | Seungwook Lee et.al. | 2410.09606 | null |
| 2024-10-12 | Robust Optical Flow Computation: A Higher-Order Differential Approach | Chanuka Algama et.al. | 2410.09563 | null |
| 2024-10-10 | MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting | Ruijie Zhu et.al. | 2410.07707 | link |
| 2024-10-09 | Z-upscaling: Optical Flow Guided Frame Interpolation for Isotropic Reconstruction of 3D EM Volumes | Fisseha A. Ferede et.al. | 2410.07043 | link |
| 2024-10-08 | Future frame prediction in chest cine MR imaging using the PCA respiratory motion model and dynamically trained recurrent neural networks | Michel Pohl et.al. | 2410.05882 | null |
| 2024-10-02 | Scene Flow as a Partial Differential Equation | Kyle Vedder et.al. | 2410.02031 | null |
| 2024-10-01 | Descriptor: Face Detection Dataset for Programmable Threshold-Based Sparse-Vision | Riadul Islam et.al. | 2410.00368 | link |
| 2024-10-08 | DressRecon: Freeform 4D Human Reconstruction from Monocular Video | Jeff Tan et.al. | 2409.20563 | null |
| 2024-10-06 | Visual collective behaviors on spherical robots | Diego Castro et.al. | 2409.20539 | null |
| 2024-09-26 | Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming | Zehao Zhu et.al. | 2409.17596 | null |
| 2024-09-26 | TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene | Sandika Biswas et.al. | 2409.17459 | null |
| 2024-09-25 | EventHDR: from Event to High-Speed HDR Videos and Beyond | Yunhao Zou et.al. | 2409.17029 | null |
| 2024-09-25 | Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation | Hanyu Zhou et.al. | 2409.17001 | null |
| 2024-09-25 | Pose-Guided Fine-Grained Sign Language Video Generation | Tongkai Shi et.al. | 2409.16709 | null |
| 2024-09-24 | FSF-Net: Enhance 4D Occupancy Forecasting with Coarse BEV Scene Flow for Autonomous Driving | Erxin Guo et.al. | 2409.15841 | null |
| 2024-09-21 | BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow | EungGu Kang et.al. | 2409.15384 | link |
| 2024-09-23 | Skills Made to Order: Efficient Acquisition of Robot Cooking Skills Guided by Multiple Forms of Internet Data | Mrinal Verghese et.al. | 2409.15172 | null |
| 2024-09-22 | Secrets of Edge-Informed Contrast Maximization for Event-Based Vision | Pritam P. Karmokar et.al. | 2409.14611 | null |
| 2024-09-18 | Optical Flow Matters: an Empirical Comparative Study on Fusing Monocular Extracted Modalities for Better Steering | Fouad Makiyeh et.al. | 2409.12716 | null |
| 2024-09-16 | ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video | Han Ling et.al. | 2409.12202 | link |
| 2024-09-16 | Continual Learning of Conjugated Visual Representations through Higher-order Motion Flows | Simone Marullo et.al. | 2409.11441 | null |
| 2024-09-17 | Training Datasets Generation for Machine Learning: Application to Vision Based Navigation | Jérémy Lebreton et.al. | 2409.11383 | null |
| 2024-09-17 | Multimodal Attention-Enhanced Feature Fusion-based Weekly Supervised Anomaly Violence Detection | Yuta Kaneko et.al. | 2409.11223 | null |
| 2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
| 2024-09-16 | Embodiment-Agnostic Action Planning via Object-Part Scene Flow | Weiliang Tang et.al. | 2409.10032 | null |
| 2024-09-16 | SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning | Amogh Joshi et.al. | 2409.09990 | null |
| 2024-09-15 | Dynamic Layer Detection of a Thin Silk Cloth using DenseTact Optical Tactile Sensors | Ankush Kundan Dhawan et.al. | 2409.09849 | null |
| 2024-09-15 | Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings | Oriel Perl et.al. | 2409.09841 | null |
| 2024-09-13 | InstantDrag: Improving Interactivity in Drag-based Image Editing | Joonghyuk Shin et.al. | 2409.08857 | null |
| 2024-09-11 | Violence detection in videos using deep recurrent and convolutional neural networks | Abdarahmane Traoré et.al. | 2409.07581 | null |
| 2024-09-11 | Distance Measurement for UAVs in Deep Hazardous Tunnels | Vishal Choudhary et.al. | 2409.07160 | null |
| 2024-09-09 | LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow | Hongyu Wen et.al. | 2409.05688 | null |
| 2024-09-11 | Real-Time Human Action Recognition on Embedded Platforms | Ruiqi Wang et.al. | 2409.05662 | null |
| 2024-09-09 | HMAFlow: Learning More Accurate Optical Flow via Hierarchical Motion Field Alignment | Dianbo Ma et.al. | 2409.05531 | link |
| 2024-09-09 | FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model | Jianzhi Lu et.al. | 2409.05396 | link |
| 2024-09-06 | Hybrid Cost Volume for Memory-Efficient Optical Flow | Yang Zhao et.al. | 2409.04243 | link |
| 2024-09-06 | SDformerFlow: Spatiotemporal swin spikeformer for event-based optical flow estimation | Yi Tian et.al. | 2409.04082 | link |
| 2024-09-03 | DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos | Wenbo Hu et.al. | 2409.02095 | null |
| 2024-09-01 | IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching | Gangwei Xu et.al. | 2409.00638 | link |
| 2024-08-29 | FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation Learning | Li-Heng Lin et.al. | 2408.16944 | null |
| 2024-08-29 | Estimating Dynamic Flow Features in Groups of Tracked Objects | Tanner D. Harms et.al. | 2408.16190 | null |
| 2024-08-28 | MMASD+: A Novel Dataset for Privacy-Preserving Behavior Analysis of Children with Autism Spectrum Disorder | Pavan Uttej Ravva et.al. | 2408.15077 | link |
| 2024-08-21 | Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars | Zhihao Lin et.al. | 2408.11582 | null |
| 2024-08-21 | SelfDRSC++: Self-Supervised Learning for Dual Reversed Rolling Shutter Correction | Wei Shang et.al. | 2408.11411 | link |
| 2024-09-02 | Video Diffusion Models are Strong Video Inpainter | Minhyeok Lee et.al. | 2408.11402 | null |
| 2024-08-20 | PooDLe: Pooled and dense self-supervised learning from naturalistic videos | Alex N. Wang et.al. | 2408.11208 | null |
| 2024-08-21 | NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices | Zhiyong Zhang et.al. | 2408.10161 | link |
| 2024-08-19 | Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data | Tao Yang et.al. | 2408.10119 | null |
| 2024-08-18 | Contactless seismocardiography via Gunnar-Farneback optical flow | Mohammad Muntasir Rahman et.al. | 2408.09512 | null |
| 2024-08-18 | OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare | Chen Long-fei et.al. | 2408.09409 | null |
| 2024-08-16 | CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving | Shihan Peng et.al. | 2408.08500 | null |
| 2024-08-15 | MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing | Chenjie Cao et.al. | 2408.08000 | null |
| 2024-08-12 | FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Lukas Meyer et.al. | 2408.06190 | link |
| 2024-08-12 | Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network | Kailai Sun et.al. | 2408.05877 | null |
| 2024-08-11 | Egocentric Vision Language Planning | Zhirui Fang et.al. | 2408.05802 | null |
| 2024-08-08 | MultiViPerFrOG: A Globally Optimized Multi-Viewpoint Perception Framework for Camera Motion and Tissue Deformation | Guido Caccianiga et.al. | 2408.04367 | null |
| 2024-08-08 | KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance | Jingxian Lu et.al. | 2408.02912 | null |
| 2024-08-05 | Gaussian Mixture based Evidential Learning for Stereo Matching | Weide Liu et.al. | 2408.02796 | null |
| 2024-08-02 | NOLO: Navigate Only Look Once | Bohan Zhou et.al. | 2408.01384 | null |
| 2024-07-31 | RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining | Hongtao Wu et.al. | 2407.21773 | link |
| 2024-07-31 | Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching | Pengjie Zhang et.al. | 2407.21735 | null |
| 2024-07-30 | SpotFormer: Multi-Scale Spatio-Temporal Transformer for Facial Expression Spotting | Yicheng Deng et.al. | 2407.20799 | null |
| 2024-07-29 | Event-based Optical Flow on Neuromorphic Processor: ANN vs. SNN Comparison based on Activation Sparsification | Yingfu Xu et.al. | 2407.20421 | null |
| 2024-07-26 | Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations | Zipeng Wang et.al. | 2407.18500 | null |
| 2024-07-23 | Occlusion-Aware 3D Motion Interpretation for Abnormal Behavior Detection | Su Li et.al. | 2407.16788 | null |
| 2024-07-23 | SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging | Lingtong Kong et.al. | 2407.16308 | link |
| 2024-07-18 | Many Perception Tasks are Highly Redundant Functions of their Input Data | Rahul Ramesh et.al. | 2407.13841 | null |
| 2024-07-18 | Long-Term 3D Point Tracking By Cost Volume Fusion | Hung Nguyen et.al. | 2407.13337 | null |
| 2024-07-18 | Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain | Bach Nguyen Gia et.al. | 2407.13159 | link |
| 2024-07-17 | Fusion Flow-enhanced Graph Pooling Residual Networks for Unmanned Aerial Vehicles Surveillance in Day and Night Dual Visions | Alam Noor et.al. | 2407.12647 | null |
| 2024-07-16 | Improving Unsupervised Video Object Segmentation via Fake Flow Generation | Suhwan Cho et.al. | 2407.11714 | link |
| 2024-07-16 | ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment | Xinyi Wang et.al. | 2407.11496 | link |
| 2024-07-16 | Hybrid physics-AI outperforms numerical weather prediction for extreme precipitation nowcasting | Puja Das et.al. | 2407.11317 | null |
| 2024-07-16 | Gaussian Splatting LK | Liuyue Xie et.al. | 2407.11309 | null |
| 2024-07-15 | Temporal Event Stereo via Joint Learning with Stereoscopic Flow | Hoonhee Cho et.al. | 2407.10831 | null |
| 2024-07-15 | Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation | Friedhelm Hamann et.al. | 2407.10802 | link |
| 2024-07-14 | Research Experience of an Undergraduate Student in Computer Vision and Robotics | Ayush V. Gowda et.al. | 2407.10044 | null |
| 2024-07-13 | ScaleRAFT: Cross-Scale Recurrent All-Pairs Field Transforms for 3D Motion Estimation | Han Ling et.al. | 2407.09797 | link |
| 2024-07-11 | Generalizable Implicit Motion Modeling for Video Frame Interpolation | Zujin Guo et.al. | 2407.08680 | null |
| 2024-07-11 | Event-based vision on FPGAs – a survey | Tomasz Kryjak et.al. | 2407.08356 | null |
| 2024-07-10 | Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation | Jaeyeul Kim et.al. | 2407.07995 | link |
| 2024-07-10 | Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction | Yili Liu et.al. | 2407.07587 | null |
| 2024-07-05 | Unsupervised 4D Cardiac Motion Tracking with Spatiotemporal Optical Flow Networks | Long Teng et.al. | 2407.04663 | null |
| 2024-07-04 | CardioSpectrum: Comprehensive Myocardium Motion Analysis with 3D Deep Learning and Geometric Insights | Shahar Zuler et.al. | 2407.03794 | link |
| 2024-07-03 | Towards High Resolution Real-Time Optical Flow Particle Image Velocimetry | Juan Pimienta et.al. | 2407.03057 | null |
| 2024-07-03 | EgoFlowNet: Non-Rigid Scene Flow from Point Clouds with Ego-Motion Support | Ramy Battrawy et.al. | 2407.02920 | null |
| 2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
| 2024-07-01 | SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving | Qingwen Zhang et.al. | 2407.01702 | link |
| 2024-07-01 | DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models | Chang-Han Yeh et.al. | 2407.01519 | null |
| 2024-07-01 | RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields | Haochen Jiang et.al. | 2407.01303 | link |
| 2024-07-01 | RMS-FlowNet++: Efficient and Robust Multi-Scale Scene Flow Estimation for Large-Scale Point Clouds | Ramy Battrawy et.al. | 2407.01129 | null |
| 2024-06-27 | What Matters in Detecting AI-Generated Videos like Sora? | Chirui Chang et.al. | 2406.19568 | null |
| 2024-06-27 | A Universal Railway Obstacle Detection System based on Semi-supervised Segmentation And Optical Flow | Qiushi Guo et.al. | 2406.18908 | null |
| 2024-06-27 | Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach | Yuxiang Huang et.al. | 2406.18837 | null |
| 2024-06-25 | Disentangled Motion Modeling for Video Frame Interpolation | Jaihyun Lew et.al. | 2406.17256 | link |
| 2024-06-19 | Simultaneous Map and Object Reconstruction | Nathaniel Chodosh et.al. | 2406.13896 | null |
| 2024-06-26 | Splatter a Video: Video Gaussian Representation for Versatile Processing | Yang-Tian Sun et.al. | 2406.13870 | null |
| 2024-06-19 | Low Latency Visual Inertial Odometry with On-Sensor Accelerated Optical Flow for Resource-Constrained UAVs | Jonas Kühne et.al. | 2406.13345 | null |
| 2024-06-17 | MEDeA: Multi-view Efficient Depth Adjustment | Mikhail Artemyev et.al. | 2406.12048 | null |
| 2024-06-15 | NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows | Zhenggang Tang et.al. | 2406.10543 | link |
| 2024-06-13 | Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion | Linzhan Mou et.al. | 2406.09402 | null |
| 2024-06-11 | PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow | Joshua Tokarsky et.al. | 2406.07667 | null |
| 2024-06-11 | Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang et.al. | 2406.07551 | link |
| 2024-06-07 | DVOS: Self-Supervised Dense-Pattern Video Object Segmentation | Keyhan Najafian et.al. | 2406.05131 | null |
| 2024-06-07 | Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior | Tanvir Mahmud et.al. | 2406.04873 | null |
| 2024-06-07 | Interplay between preconditioning and regularization for linear ill-posed problems solved by conjugate gradient. Application to optical flow estimation | Ahmed Chabib et.al. | 2406.04695 | null |
| 2024-06-04 | Neural Representations of Dynamic Visual Stimuli | Jacob Yeung et.al. | 2406.02659 | null |
| 2024-06-03 | DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation | Chun-Hung Wu et.al. | 2406.01591 | null |
| 2024-06-03 | Prototypical Transformer as Unified Motion Learners | Cheng Han et.al. | 2406.01559 | null |
| 2024-06-03 | Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers | Pablo Arratia et.al. | 2406.01299 | null |
| 2024-06-03 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042 | link |
| 2024-06-03 | Synthetic Data Generation for 3D Myocardium Deformation Analysis | Shahar Zuler et.al. | 2406.01040 | link |
| 2024-05-30 | EMAG: Ego-motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos | Masashi Hatano et.al. | 2405.20030 | null |
| 2024-05-30 | May the Dance be with You: Dance Generation Framework for Non-Humanoids | Hyemin Ahn et.al. | 2405.19743 | null |
| 2024-05-28 | GFlow: Recovering 4D World from Monocular Video | Shizun Wang et.al. | 2405.18426 | null |
| 2024-05-28 | Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition | Muhammad Adi Nugroho et.al. | 2405.18012 | null |
| 2024-05-27 | DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation | Mengtan Zhang et.al. | 2405.16960 | null |
| 2024-05-27 | SCSim: A Realistic Spike Cameras Simulator | Liwen Hu et.al. | 2405.16790 | link |
| 2024-05-26 | Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition | Tong Shi et.al. | 2405.16701 | null |
| 2024-05-26 | Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception | Shuangpeng Han et.al. | 2405.16493 | null |
| 2024-05-24 | UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes | Ted Lentsch et.al. | 2405.15688 | link |
| 2024-05-24 | Time-Harmonic Optical Flow with Applications in Elastography | Oleh Melnyk et.al. | 2405.15507 | null |
| 2024-05-24 | Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features | Lichuan Ji et.al. | 2405.15343 | null |
| 2024-05-24 | Unsupervised Motion Segmentation for Neuromorphic Aerial Surveillance | Sami Arja et.al. | 2405.15209 | null |
| 2024-05-23 | SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow | Yihan Wang et.al. | 2405.14793 | null |
| 2024-05-23 | OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance | Shuheng Ge et.al. | 2405.14709 | null |
| 2024-05-23 | Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields | Tom Fischer et.al. | 2405.14599 | null |
| 2024-05-22 | MotionCraft: Physics-based Zero-Shot Video Generation | Luca Savant Aira et.al. | 2405.13557 | null |
| 2024-05-21 | Weakly supervised alignment and registration of MR-CT for cervical cancer radiotherapy | Jjahao Zhang et.al. | 2405.12850 | null |
| 2024-05-21 | Rethink Predicting the Optical Flow with the Kinetics Perspective | Yuhao Cheng et.al. | 2405.12512 | link |
| 2024-05-18 | GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition | Mallika Garg et.al. | 2405.11180 | link |
| 2024-05-17 | MicroBundlePillarTrack, A Python package for automated segmentation, tracking, and analysis of pillar deflection in cardiac microbundles | Hiba Kobeissi et.al. | 2405.11096 | null |
| 2024-05-16 | Physics-incorporated Graph Neural Network for Multivariate Time Series Imputation | Guojun Liang et.al. | 2405.10995 | link |
| 2024-05-15 | Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | Xuanchen Wang et.al. | 2405.09266 | null |
| 2024-05-11 | DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation | Volodymyr Fedynyak et.al. | 2405.08715 | null |
| 2024-05-14 | EchoTracker: Advancing Myocardial Point Tracking in Echocardiography | Md Abulkalam Azad et.al. | 2405.08587 | null |
| 2024-05-15 | Vector-Symbolic Architecture for Event-Based Optical Flow | Hongzhi You et.al. | 2405.08300 | null |
| 2024-05-12 | NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU | Yuhao Zhang et.al. | 2405.07392 | link |
| 2024-05-11 | Global Motion Understanding in Large-Scale Video Object Segmentation | Volodymyr Fedynyak et.al. | 2405.07031 | null |
| 2024-05-09 | A Survey on Backbones for Deep Video Action Recognition | Zixuan Tang et.al. | 2405.05584 | null |
| 2024-05-08 | Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection | Shengyang Sun et.al. | 2405.05130 | link |
| 2024-05-07 | Visually Guided Swarm Motion Coordination via Insect-inspired Small Target Motion Reactions | Md Arif Billah et.al. | 2405.04591 | null |
| 2024-05-06 | Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation | Dong Lao et.al. | 2405.03662 | null |
| 2024-05-06 | Hierarchical Space-Time Attention for Micro-Expression Recognition | Haihong Hao et.al. | 2405.03202 | link |
| 2024-05-05 | JOSENet: A Joint Stream Embedding Network for Violence Detection in Surveillance Videos | Pietro Nardelli et.al. | 2405.02961 | null |
| 2024-05-04 | UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model | Shuai Yuan et.al. | 2405.02608 | link |
| 2024-05-03 | DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos | Wen-Hsuan Chu et.al. | 2405.02280 | link |
| 2024-05-03 | Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations | Zhilu Zhang et.al. | 2405.02171 | link |
| 2024-04-30 | Semantically Consistent Video Inpainting with Conditional Diffusion Models | Dylan Green et.al. | 2405.00251 | null |
| 2024-04-29 | $ν$ -DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction | Yunxuan Mao et.al. | 2404.18439 | null |
| 2024-04-28 | Event-based Video Frame Interpolation with Edge Guided Motion Refinement | Yuhan Liu et.al. | 2404.18156 | null |
| 2024-04-26 | Camera Motion Estimation from RGB-D-Inertial Scene Flow | Samuel Cerezo et.al. | 2404.17251 | null |
| 2024-04-25 | Motor Focus: Ego-Motion Prediction with All-Pixel Matching | Hao Wang et.al. | 2404.17031 | link |
| 2024-04-26 | Deep-learning Optical Flow Outperforms PIV in Obtaining Velocity Fields from Active Nematics | Phu N. Tran et.al. | 2404.15497 | link |
| 2024-04-23 | Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization | Lahav Lipson et.al. | 2404.15263 | link |
| 2024-04-23 | FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent | Cameron Smith et.al. | 2404.15259 | link |
| 2024-04-22 | Structure-Aware Human Body Reshaping with Adaptive Affinity-Graph Network | Qiwen Deng et.al. | 2404.13983 | null |
| 2024-04-28 | Attack on Scene Flow using Point Clouds | Haniyeh Ehsani Oskouie et.al. | 2404.13621 | null |
| 2024-04-21 | Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence | Ripon Kumar Saha et.al. | 2404.13605 | null |
| 2024-04-19 | ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model | Dingming Liu et.al. | 2404.12903 | null |
| 2024-04-19 | 3D Multi-frame Fusion for Video Stabilization | Zhan Peng et.al. | 2404.12887 | null |
| 2024-04-18 | Moving Object Segmentation: All You Need Is SAM (and Flow) | Junyu Xie et.al. | 2404.12389 | link |
| 2024-04-17 | TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation | Thomas Monninger et.al. | 2404.11803 | null |
| 2024-04-17 | Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection | Deepti Hegde et.al. | 2404.11737 | null |
| 2024-04-17 | Vision-based control for landing an aerial vehicle on a marine vessel | Haohua Dong et.al. | 2404.11336 | null |
| 2024-04-16 | CMU-Flownet: Exploring Point Cloud Scene Flow Estimation in Occluded Scenario | Jingze Chen et.al. | 2404.10571 | null |
| 2024-04-12 | SEVD: Synthetic Event-based Vision Dataset for Ego and Fixed Traffic Perception | Manideep Reddy Aliminati et.al. | 2404.10540 | null |
| 2024-04-16 | Improving Bracket Image Restoration and Enhancement with Flow-guided Alignment and Enhanced Feature Aggregation | Wenjie Lin et.al. | 2404.10358 | null |
| 2024-04-15 | Table tennis ball spin estimation with an event camera | Thomas Gossard et.al. | 2404.09870 | null |
| 2024-04-15 | FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features | Andre Rochow et.al. | 2404.09736 | null |
| 2024-04-13 | Rethinking Iterative Stereo Matching from Diffusion Bridge Model Perspective | Yuguang Shi et.al. | 2404.09051 | null |
| 2024-04-12 | Let It Flow: Simultaneous Optimization of 3D Flow and Object Clustering | Patrik Vacek et.al. | 2404.08363 | null |
| 2024-04-11 | SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations | Jamie Menjay Lin et.al. | 2404.08135 | null |
| 2024-04-11 | Chaos in Motion: Unveiling Robustness in Remote Heart Rate Measurement through Brain-Inspired Skin Tracking | Jie Wang et.al. | 2404.07687 | null |
| 2024-04-07 | MemFlow: Optical Flow Estimation and Prediction with Memory | Qiaole Dong et.al. | 2404.04808 | null |
| 2024-04-06 | Salient Sparse Visual Odometry With Pose-Only Supervision | Siyu Chen et.al. | 2404.04677 | null |
| 2024-04-04 | A primal-dual adaptive finite element method for total variation based motion estimation | Martin Alkämper et.al. | 2404.03125 | null |
| 2024-04-01 | LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization | Akshita Gupta et.al. | 2404.01282 | null |
| 2024-04-01 | BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks | Zhiyuan Cheng et.al. | 2404.00924 | null |
| 2024-03-29 | SceneTracker: Long-term Scene Flow Estimation Network | Bo Wang et.al. | 2403.19924 | null |
| 2024-03-28 | FlowDepth: Decoupling Optical Flow for Self-Supervised Monocular Depth Estimation | Yiyang Sun et.al. | 2403.19294 | null |
| 2024-03-28 | Uncertainty-Aware Deep Video Compression with Ensembles | Wufei Ma et.al. | 2403.19158 | null |
| 2024-03-27 | The Correlations of Scene Complexity, Workload, Presence, and Cybersickness in a Task-Based VR Game | Mohammadamin Sanaei et.al. | 2403.19019 | null |
| 2024-03-27 | $\mathrm{F^2Depth}$ : Self-supervised Indoor Monocular Depth Estimation via Optical Flow Consistency and Feature Map Synthesis | Xiaotong Guo et.al. | 2403.18443 | null |
| 2024-03-27 | DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment | Jiuming Liu et.al. | 2403.18274 | null |
| 2024-03-26 | OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation | Jisoo Jeong et.al. | 2403.18092 | null |
| 2024-03-26 | Optical Flow Based Detection and Tracking of Moving Objects for Autonomous Vehicles | MReza Alipour Sormoli et.al. | 2403.17779 | null |
| 2024-03-25 | AI-Generated Video Detection via Spatio-Temporal Anomaly Learning | Jianfa Bai et.al. | 2403.16638 | null |
| 2024-03-24 | Emotion Recognition from the perspective of Activity Recognition | Savinay Nagendra et.al. | 2403.16263 | null |
| 2024-03-24 | Self-Supervised Multi-Frame Neural Scene Flow | Dongrui Liu et.al. | 2403.16116 | null |
| 2024-03-23 | DS-NeRV: Implicit Neural Video Representation with Decomposed Static and Dynamic Codes | Hao Yan et.al. | 2403.15679 | null |
| 2024-03-21 | CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers | Alex Ranne et.al. | 2403.14465 | null |
| 2024-03-20 | DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping | Yuxuan Zhou et.al. | 2403.13714 | link |
| 2024-03-22 | S2DM: Sector-Shaped Diffusion Models for Video Generation | Haoran Lang et.al. | 2403.13408 | null |
| 2024-03-19 | TAPTR: Tracking Any Point with Transformers as Detection | Hongyang Li et.al. | 2403.13042 | null |
| 2024-03-19 | GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation | Quankai Gao et.al. | 2403.12365 | null |
| 2024-03-18 | GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects | Sungphill Moon et.al. | 2403.11510 | null |
| 2024-03-18 | Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction | Zhiyang Guo et.al. | 2403.11447 | null |
| 2024-03-17 | Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction | Xue Bai et.al. | 2403.11337 | null |
| 2024-03-15 | NeuFlow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge Devices | Zhiyong Zhang et.al. | 2403.10425 | link |
| 2024-03-15 | Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation | Marcos Fernández-Rodríguez et.al. | 2403.10216 | null |
| 2024-03-15 | Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation | Peiran Wu et.al. | 2403.10039 | link |
| 2024-03-17 | Intention-driven Ego-to-Exo Video Generation | Hongchen Luo et.al. | 2403.09194 | null |
| 2024-03-13 | MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Jialv Zou et.al. | 2403.08760 | link |
| 2024-03-12 | Flow-Based Visual Stream Compression for Event Cameras | Daniel C. Stumpp et.al. | 2403.08086 | null |
| 2024-03-12 | Bring Event into RGB and LiDAR: Hierarchical Visual-Motion Fusion for Scene Flow | Hanyu Zhou et.al. | 2403.07432 | null |
| 2024-03-11 | LISO: Lidar-only Self-Supervised 3D Object Detection | Stefan Baur et.al. | 2403.07071 | null |
| 2024-03-11 | STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow | Zhiyang Lu et.al. | 2403.07032 | link |
| 2024-03-11 | HDA-LVIO: A High-Precision LiDAR-Visual-Inertial Odometry in Urban Environments with Hybrid Data Association | Jian Shi et.al. | 2403.06590 | null |
| 2024-03-11 | Ada-Tracker: Soft Tissue Tracking via Inter-Frame and Adaptive-Template Matching | Jiaxin Guo et.al. | 2403.06479 | null |
| 2024-03-09 | Fast Kernel Scene Flow | Xueqian Li et.al. | 2403.05896 | link |
| 2024-03-09 | DO3D: Self-supervised Learning of Decomposed Object-aware 3D Motion and Depth from Monocular Videos | Xiuzhe Wu et.al. | 2403.05895 | null |
| 2024-03-08 | DiffSF: Diffusion Models for Scene Flow Estimation | Yushan Zhang et.al. | 2403.05327 | null |
| 2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | link |
| 2024-03-08 | PIPsUS: Self-Supervised Dense Point Tracking in Ultrasound | Wanwen Chen et.al. | 2403.04969 | null |
| 2024-03-07 | I Can’t Believe It’s Not Scene Flow! | Ishan Khatri et.al. | 2403.04739 | link |
| 2024-03-07 | Out of the Room: Generalizing Event-Based Dynamic Motion Segmentation for Complex Scenes | Stamatios Georgoulis et.al. | 2403.04562 | null |
| 2024-03-06 | HDRFlow: Real-Time HDR Video Reconstruction with Large Motions | Gangwei Xu et.al. | 2403.03447 | null |
| 2024-03-05 | Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation | Robert Mendel et.al. | 2403.03120 | null |
| 2024-03-04 | Explicit Motion Handling and Interactive Prompting for Video Camouflaged Object Detection | Xin Zhang et.al. | 2403.01968 | null |
| 2024-03-01 | Trustworthy Self-Attention: Enabling the Network to Focus Only on the Most Relevant References | Yu Jing et.al. | 2403.00211 | null |
| 2024-02-29 | From Flies to Robots: Inverted Landing in Small Quadcopters with Dynamic Perching | Bryan Habas et.al. | 2403.00128 | null |
| 2024-02-29 | SeMoLi: What Moves Together Belongs Together | Jenny Seidenschwarz et.al. | 2402.19463 | null |
| 2024-02-28 | Digging Into Normal Incorporated Stereo Matching | Zihua Liu et.al. | 2402.18171 | link |
| 2024-03-01 | 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling | Chaokang Jiang et.al. | 2402.18146 | link |
| 2024-02-27 | ICP-Flow: LiDAR Scene Flow Estimation with ICP | Yancong Lin et.al. | 2402.17351 | link |
| 2024-02-25 | LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding | Yuxuan Wang et.al. | 2402.16050 | link |
| 2024-02-18 | TDE-3: An improved prior for optical flow computation in spiking neural networks | Matthew Yedutenko et.al. | 2402.11662 | null |
| 2024-02-17 | Dense Matchers for Dense Tracking | Tomáš Jelínek et.al. | 2402.11287 | null |
| 2024-02-16 | Multi-Model 3D Registration: Finding Multiple Moving Objects in Cluttered Point Clouds | David Jin et.al. | 2402.10865 | null |
| 2024-02-14 | Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation | Ge Shi et.al. | 2402.08882 | null |
| 2024-02-12 | A Flow-based Credibility Metric for Safety-critical Pedestrian Detection | Maria Lyssenko et.al. | 2402.07642 | null |
| 2024-02-09 | Image-based Deep Learning for the time-dependent prediction of fresh concrete properties | Max Meyer et.al. | 2402.06611 | null |
(<a href=../README.md>back to main</a>)