Skip to content

[P1] difficulty in reproducing mrpc results #178

@NitayGitHub

Description

@NitayGitHub

Hi, I am running on the original code task_steer.py file with the arguments below, but struggling to reproduce the same results on the GLUE mrpc dataset. Are these the correct arguments to use:

epochs: int = 40
lr: float = 0.0003
position: str = "f3"
rank: int = 1
dropout: float = 0.05
weight_decay: float = 0.0
warmup_ratio: float = 0.0
reft_intervention: str = "ConditionedSourceLowRankRotatedSpaceIntervention"
layers: str = "all"
batch_size: int = 32
eval_batch_size: int = 32
accumulation_steps: int = 1
max_grad_norm: float = 1.0
logging_steps: int = 20
max_length: int = 256
seeds: list[int] = [42, 43, 44, 45, 46]
task: str = "glue"
train_dataset: str = "mrpc"
model_name: str = "FacebookAI/roberta-base"
test_split: str = "test"
metric: str = "accuracy"

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions