[P1] difficulty in reproducing mrpc results

Hi, I am running on the original code task_steer.py file with the arguments below, but struggling to reproduce the same results on the GLUE mrpc dataset. Are these the correct arguments to use:

  epochs: int = 40
  lr: float = 0.0003
  position: str = "f3"
  rank: int = 1
  dropout: float = 0.05
  weight_decay: float = 0.0
  warmup_ratio: float = 0.0
  reft_intervention: str = "ConditionedSourceLowRankRotatedSpaceIntervention"
  layers: str = "all"
  batch_size: int = 32
  eval_batch_size: int = 32
  accumulation_steps: int = 1
  max_grad_norm: float = 1.0
  logging_steps: int = 20
  max_length: int = 256
  seeds: list[int] = [42, 43, 44, 45, 46]
  task: str = "glue"
  train_dataset: str = "mrpc"
  model_name: str = "FacebookAI/roberta-base"
  test_split: str = "test"
  metric: str = "accuracy"


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[P1] difficulty in reproducing mrpc results #178

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[P1] difficulty in reproducing mrpc results #178

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions