-
Notifications
You must be signed in to change notification settings - Fork 130
Description
Hi, I am running on the original code task_steer.py file with the arguments below, but struggling to reproduce the same results on the GLUE mrpc dataset. Are these the correct arguments to use:
epochs: int = 40
lr: float = 0.0003
position: str = "f3"
rank: int = 1
dropout: float = 0.05
weight_decay: float = 0.0
warmup_ratio: float = 0.0
reft_intervention: str = "ConditionedSourceLowRankRotatedSpaceIntervention"
layers: str = "all"
batch_size: int = 32
eval_batch_size: int = 32
accumulation_steps: int = 1
max_grad_norm: float = 1.0
logging_steps: int = 20
max_length: int = 256
seeds: list[int] = [42, 43, 44, 45, 46]
task: str = "glue"
train_dataset: str = "mrpc"
model_name: str = "FacebookAI/roberta-base"
test_split: str = "test"
metric: str = "accuracy"