WMT25: Slavic LLMs for MT & QA

The task focuses on developing and improve LLMs under limited data and compute resources for three Slavic languages: Ukrainian (uk), Upper Sorbian (hsb) and Lower Sorbian (dsb). We consider two tasks that are to be evaluated jointly: Machine Translation (MT) and Multiple-Choice Question Answering (QA).

Official website: https://www2.statmt.org/wmt25/limited-resources-slavic-llm.html

The anonymous submission with the smallest #number in each leaderboard corresponds to the baseline model.

Download test sets Register your team Create submission Competition updates

Leaderboard of WMT25: Slavic LLMs for MT & QA

wmtslavicllm2025-mt.cs-uk test set (cs-uk)

# Name BLEU chrF Date
1 Anonymous submission #129 --- --- July 11, 2025, 7:21 p.m.
BLEU and ChrF are sacreBLEU scores. Systems in bold face are your submissions. We only display the top-10 submissions per language pair. Submission validation errors denoted by -1.0 score.

Click on the column header to sort the table. Hold down the Shift key and select a second column to sort by multiple criteria.

wmtslavicllm2025-mt.de-dsb test set (de-dsb)

# Name BLEU chrF Date
1 Anonymous submission #135 58.2 76.4 July 16, 2025, 9:06 p.m.
2 Anonymous submission #124 1.7 13.7 July 11, 2025, 12:39 p.m.
BLEU and ChrF are sacreBLEU scores. Systems in bold face are your submissions. We only display the top-10 submissions per language pair. Submission validation errors denoted by -1.0 score.

Click on the column header to sort the table. Hold down the Shift key and select a second column to sort by multiple criteria.

wmtslavicllm2025-mt.de-hsb test set (de-hsb)

# Name BLEU chrF Date
1 Anonymous submission #123 1.3 15.6 July 11, 2025, 12:38 p.m.
BLEU and ChrF are sacreBLEU scores. Systems in bold face are your submissions. We only display the top-10 submissions per language pair. Submission validation errors denoted by -1.0 score.

Click on the column header to sort the table. Hold down the Shift key and select a second column to sort by multiple criteria.

wmtslavicllm2025-mt.en-uk test set (en-uk)

# Name BLEU chrF Date
1 Anonymous submission #133 0.0 0.4 July 15, 2025, 11:08 p.m.
BLEU and ChrF are sacreBLEU scores. Systems in bold face are your submissions. We only display the top-10 submissions per language pair. Submission validation errors denoted by -1.0 score.

Click on the column header to sort the table. Hold down the Shift key and select a second column to sort by multiple criteria.

wmtslavicllm2025-qa.dsb test set (dsb-dsb)

# Name Accuracy Date
1 Anonymous submission #121 45.9 July 11, 2025, 12:37 p.m.
2 Anonymous submission #134 33.2 July 16, 2025, 9:02 p.m.
BLEU and ChrF are sacreBLEU scores. Systems in bold face are your submissions. We only display the top-10 submissions per language pair. Submission validation errors denoted by -1.0 score.

Click on the column header to sort the table. Hold down the Shift key and select a second column to sort by multiple criteria.

wmtslavicllm2025-qa.hsb test set (hsb-hsb)

# Name Accuracy Date
1 Anonymous submission #125 42.9 July 11, 2025, 12:40 p.m.
BLEU and ChrF are sacreBLEU scores. Systems in bold face are your submissions. We only display the top-10 submissions per language pair. Submission validation errors denoted by -1.0 score.

Click on the column header to sort the table. Hold down the Shift key and select a second column to sort by multiple criteria.

wmtslavicllm2025-qa.uk test set (uk-uk)

# Name Accuracy Date
1 Anonymous submission #126 31.2 July 11, 2025, 12:41 p.m.
BLEU and ChrF are sacreBLEU scores. Systems in bold face are your submissions. We only display the top-10 submissions per language pair. Submission validation errors denoted by -1.0 score.

Click on the column header to sort the table. Hold down the Shift key and select a second column to sort by multiple criteria.