Table 2. The dataset for train and test.

The number of sentences The number of words
Train data 53,832 602,315
Test data 5,817 57,738