Adding `safetensors` variant of this model (#4) e03682d verified pszemraj SFconvertbot commited on 5 days ago
add new checkpoint trained for a hundred steps with smaller max grad norm and weight decay 7a20e92 pszemraj commited on Jun 9, 2022
Change BOS token from 0 to 2 as BOS token is equal to EOS for OPT. See: https://github.com/huggingface/transformers/issues/17431 (#2) 83f1019 pszemraj patrickvonplaten commited on May 26, 2022