Table 2. Hyperparameters used for model training.
Hyperparameter
Value
Batch size
32
Max sequence length
128
Epochs
100
Learning rate
2×10
-5
Dropout
0.1
Optimizer
AdamW
Early stopping patience
3 epochs