Picture for Xiaoyu Li

Xiaoyu Li

Proactive Guidance of Multi-Turn Conversation in Industrial Search

Add code
May 30, 2025
Viaarxiv icon

Sci-Fi: Symmetric Constraint for Frame Inbetweening

Add code
May 27, 2025
Viaarxiv icon

ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations

Add code
May 20, 2025
Viaarxiv icon

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese

Add code
May 16, 2025
Viaarxiv icon

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Add code
Apr 01, 2025
Viaarxiv icon

Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers

Add code
Mar 19, 2025
Viaarxiv icon

SMILE: a Scale-aware Multiple Instance Learning Method for Multicenter STAS Lung Cancer Histopathology Diagnosis

Add code
Mar 18, 2025
Viaarxiv icon

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Add code
Mar 17, 2025
Viaarxiv icon

Limits of KV Cache Compression for Tensor Attention based Autoregressive Transformers

Add code
Mar 14, 2025
Viaarxiv icon

Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows

Add code
Mar 12, 2025
Viaarxiv icon