Table 1. Dataset statistics.
Dataset
Language pair
Training size
Test size
IWSLT
English-German
230k
1k
Europarl
English-French
2.0M
2k
News commentary
English-Czech
300k
1k
WMT
English-Finnish
5.0M
3k