Picture for Ying Shan

Ying Shan

Aligning Latent Spaces with Flow Priors

Add code
Jun 05, 2025
Viaarxiv icon

Sci-Fi: Symmetric Constraint for Frame Inbetweening

Add code
May 27, 2025
Viaarxiv icon

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Add code
May 27, 2025
Viaarxiv icon

TensorAR: Refinement is All You Need in Autoregressive Image Generation

Add code
May 22, 2025
Viaarxiv icon

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Add code
May 19, 2025
Viaarxiv icon

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Add code
May 08, 2025
Viaarxiv icon

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

Add code
May 06, 2025
Viaarxiv icon

Cobra: Efficient Line Art COlorization with BRoAder References

Add code
Apr 16, 2025
Viaarxiv icon

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Add code
Apr 01, 2025
Viaarxiv icon

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Add code
Apr 01, 2025
Viaarxiv icon