Picture for Xiaojuan Qi

Xiaojuan Qi

UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation

Add code
May 30, 2025
Viaarxiv icon

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Add code
May 19, 2025
Viaarxiv icon

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation

Add code
Mar 19, 2025
Viaarxiv icon

QDM: Quadtree-Based Region-Adaptive Sparse Diffusion Models for Efficient Image Super-Resolution

Add code
Mar 15, 2025
Viaarxiv icon

MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction

Add code
Mar 13, 2025
Viaarxiv icon

ObjectMover: Generative Object Movement with Video Prior

Add code
Mar 11, 2025
Viaarxiv icon

"Principal Components" Enable A New Language of Images

Add code
Mar 11, 2025
Viaarxiv icon

Generalized Kullback-Leibler Divergence Loss

Add code
Mar 11, 2025
Viaarxiv icon

A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning

Add code
Mar 10, 2025
Viaarxiv icon

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Add code
Feb 27, 2025
Viaarxiv icon