Picture for Li Yuan

Li Yuan

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Add code
Jun 06, 2025
Viaarxiv icon

LeanPO: Lean Preference Optimization for Likelihood Alignment in Video-LLMs

Add code
Jun 05, 2025
Viaarxiv icon

Multi-objective Aligned Bidword Generation Model for E-commerce Search Advertising

Add code
Jun 04, 2025
Viaarxiv icon

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Add code
May 28, 2025
Viaarxiv icon

Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular Chemical Operations

Add code
May 27, 2025
Viaarxiv icon

Sci-Fi: Symmetric Constraint for Frame Inbetweening

Add code
May 27, 2025
Viaarxiv icon

ImgEdit: A Unified Image Editing Dataset and Benchmark

Add code
May 26, 2025
Viaarxiv icon

Rethinking Text-based Protein Understanding: Retrieval or LLM?

Add code
May 26, 2025
Viaarxiv icon

GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation

Add code
May 21, 2025
Viaarxiv icon

CAD: A General Multimodal Framework for Video Deepfake Detection via Cross-Modal Alignment and Distillation

Add code
May 21, 2025
Viaarxiv icon