Picture for Ranjay Krishna

Ranjay Krishna

VideoMolmo: Spatio-Temporal Grounding Meets Pointing

Add code
Jun 05, 2025
Viaarxiv icon

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Add code
Jun 05, 2025
Viaarxiv icon

Contrastive Flow Matching

Add code
Jun 05, 2025
Viaarxiv icon

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory

Add code
May 29, 2025
Viaarxiv icon

Convergent Functions, Divergent Forms

Add code
May 27, 2025
Viaarxiv icon

MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation

Add code
May 23, 2025
Viaarxiv icon

GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation

Add code
May 19, 2025
Viaarxiv icon

PointArena: Probing Multimodal Grounding Through Language-Guided Pointing

Add code
May 15, 2025
Viaarxiv icon

Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation

Add code
Apr 25, 2025
Viaarxiv icon

FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations

Add code
Apr 11, 2025
Viaarxiv icon