Picture for Beyza Ermis

Beyza Ermis

The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It

Add code
May 30, 2025
Viaarxiv icon

The Multilingual Divide and Its Impact on Global AI Safety

Add code
May 27, 2025
Viaarxiv icon

How to Improve the Robustness of Closed-Source Models on NLI

Add code
May 26, 2025
Viaarxiv icon

Aya Vision: Advancing the Frontier of Multilingual Multimodality

Add code
May 13, 2025
Viaarxiv icon

The Leaderboard Illusion

Add code
Apr 29, 2025
Viaarxiv icon

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Add code
Apr 09, 2025
Viaarxiv icon

Command A: An Enterprise-Ready Large Language Model

Add code
Apr 01, 2025
Viaarxiv icon

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier

Add code
Dec 05, 2024
Figure 1 for Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
Figure 2 for Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
Figure 3 for Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
Figure 4 for Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
Viaarxiv icon

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Add code
Dec 04, 2024
Figure 1 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 2 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 3 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 4 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Viaarxiv icon

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Add code
Oct 14, 2024
Figure 1 for Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Figure 2 for Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Figure 3 for Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Figure 4 for Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Viaarxiv icon