Table 1. Dataset statistics.

Dataset Language pair Training size Test size
IWSLT English-German 230k 1k
Europarl English-French 2.0M 2k
News commentary English-Czech 300k 1k
WMT English-Finnish 5.0M 3k