Model repeating prompt and not learning eos token
#129
by
Essacheez
- opened
I am trying to fine tune mistral but the model is repeating the input prompt and is not learning the eos token.
I tried changing the tokenizer.pad_token from eos to unk_token , the sides from right to left but its not working. I already added the bos and eos token to my dataset.
Here is an example from my dataset
<s>[INST]Translate the following text from French to English: D'où venons-nous?[/INST]Where did we come from?</s>
Currently these are by settings
tokenizer.add_eos_token = False
tokenizer.add_bos_token = False
tokenizer.pad_token = tokenizer.unk_token
tokenizer.padding_side = "left"
This comment has been hidden
Essacheez
changed discussion status to
closed
Essacheez
changed discussion status to
open