Wandb Run: https://wandb.ai/eleutherai/pythia-rlhf/runs/kj29wswk Eval Results: | Task |Version|Filter| Metric |Value | |Stderr| |--------------|-------|------|----------|-----:|---|-----:| |arc_challenge |Yaml |none |acc |0.2995|± |0.0134| | | |none |acc_norm |0.3251|± |0.0137| |arc_easy |Yaml |none |acc |0.6486|± |0.0098| | | |none |acc_norm |0.5673|± |0.0102| |lambada_openai|Yaml |none |perplexity|4.7801|± |0.1197| | | |none |acc |0.6412|± |0.0067| |logiqa |Yaml |none |acc |0.2120|± |0.0160| | | |none |acc_norm |0.2873|± |0.0177| |piqa |Yaml |none |acc |0.7524|± |0.0101| | | |none |acc_norm |0.7530|± |0.0101| |sciq |Yaml |none |acc |0.8820|± |0.0102| | | |none |acc_norm |0.8160|± |0.0123| |winogrande |Yaml |none |acc |0.6077|± |0.0137| |wsc |Yaml |none |acc |0.3654|± |0.0474|