Table 2. Hyperparameters used for model training.

Hyperparameter Value
Batch size 32
Max sequence length 128
Epochs 100
Learning rate 2×10-5
Dropout 0.1
Optimizer AdamW
Early stopping patience 3 epochs