Picture for Qingxiu Dong

Qingxiu Dong

Reinforcement Pre-Training

Add code
Jun 09, 2025
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Reward Reasoning Model

Add code
May 20, 2025
Viaarxiv icon

RICo: Refined In-Context Contribution for Automatic Instruction-Tuning Data Selection

Add code
May 18, 2025
Viaarxiv icon

SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning

Add code
May 16, 2025
Viaarxiv icon

ICon: In-Context Contribution for Automatic Data Selection

Add code
May 08, 2025
Viaarxiv icon

Scaling Laws of Synthetic Data for Language Models

Add code
Mar 26, 2025
Viaarxiv icon

MPO: Boosting LLM Agents with Meta Plan Optimization

Add code
Mar 04, 2025
Viaarxiv icon

How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation

Add code
Feb 20, 2025
Viaarxiv icon

Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences?

Add code
Feb 19, 2025
Viaarxiv icon