Picture for Jinyang Gao

Jinyang Gao

Incentivizing Strong Reasoning from Weak Supervision

Add code
May 28, 2025
Viaarxiv icon

Incentivizing Reasoning from Weak Supervision

Add code
May 26, 2025
Viaarxiv icon

Evaluation Report on MCP Servers

Add code
Apr 15, 2025
Viaarxiv icon

RePO: ReLU-based Preference Optimization

Add code
Mar 10, 2025
Viaarxiv icon

ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL

Add code
Nov 13, 2024
Figure 1 for XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL
Figure 2 for XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL
Figure 3 for XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL
Figure 4 for XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL
Viaarxiv icon

What is Wrong with Perplexity for Long-context Language Modeling?

Add code
Oct 31, 2024
Figure 1 for What is Wrong with Perplexity for Long-context Language Modeling?
Figure 2 for What is Wrong with Perplexity for Long-context Language Modeling?
Figure 3 for What is Wrong with Perplexity for Long-context Language Modeling?
Figure 4 for What is Wrong with Perplexity for Long-context Language Modeling?
Viaarxiv icon

MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases

Add code
Oct 24, 2024
Viaarxiv icon

$α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs

Add code
Oct 14, 2024
Figure 1 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 2 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 3 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 4 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Viaarxiv icon

Semantic Alignment for Multimodal Large Language Models

Add code
Aug 23, 2024
Figure 1 for Semantic Alignment for Multimodal Large Language Models
Figure 2 for Semantic Alignment for Multimodal Large Language Models
Figure 3 for Semantic Alignment for Multimodal Large Language Models
Figure 4 for Semantic Alignment for Multimodal Large Language Models
Viaarxiv icon