Picture for Zejun Ma

Zejun Ma

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Add code
May 29, 2025
Viaarxiv icon

General-Reasoner: Advancing LLM Reasoning Across All Domains

Add code
May 21, 2025
Viaarxiv icon

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Add code
Apr 22, 2025
Viaarxiv icon

ACVUBench: Audio-Centric Video Understanding Benchmark

Add code
Mar 25, 2025
Viaarxiv icon

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Add code
Mar 24, 2025
Viaarxiv icon

Improving LLM Video Understanding with 16 Frames Per Second

Add code
Mar 18, 2025
Viaarxiv icon

Video Instruction Tuning With Synthetic Data

Add code
Oct 03, 2024
Figure 1 for Video Instruction Tuning With Synthetic Data
Figure 2 for Video Instruction Tuning With Synthetic Data
Figure 3 for Video Instruction Tuning With Synthetic Data
Figure 4 for Video Instruction Tuning With Synthetic Data
Viaarxiv icon

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Add code
Jul 10, 2024
Viaarxiv icon

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

Add code
Jun 22, 2024
Viaarxiv icon

Can Large Language Models Understand Spatial Audio?

Add code
Jun 12, 2024
Viaarxiv icon