davanstrien
/

Smol-Hub-tldr

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

davanstrien HF staff commited on 21 days ago

Commit

d9f025a

·

verified ·

1 Parent(s): 7d46fbd

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -42,9 +42,11 @@ The model was trained on:
 - Model card summaries generated by Llama 3.3 70B
 - Dataset card summaries generated by Llama 3.3 70B
 ## Usage
-Using the chat template when using the model in inference is recommended. Additionally, you should prepend either `<MODEL_CARD>` or `<DATASET_CARD>` to the start of the card you want to summarize. The training data used the body of the model or dataset card, i.e., the part after the YAML, so you will likely get better results only by passing this part of the card.
 I have so far found that a low temperature of `0.4` generates better results.

 - Model card summaries generated by Llama 3.3 70B
 - Dataset card summaries generated by Llama 3.3 70B
+Model context length: the model was trained with cards up to a length of 2048 tokens
 ## Usage
+Using the chat template when using the model in inference is recommended. Additionally, you should prepend either `<MODEL_CARD>` or `<DATASET_CARD>` to the start of the card you want to summarize. The training data used the body of the model or dataset card (i.e., the part after the YAML, so you will likely get better results only by passing this part of the card.
 I have so far found that a low temperature of `0.4` generates better results.