Text Generation
Transformers
Safetensors
PyTorch
English
llama
facebook
meta
llama-3
100K+ context length
LoRA
Theta Scaling
question answering
Norm & Embed Trained
Big Patents
instruct
question answering
merged
chat
8B
research
science
RoPE
long context
conversational
text-generation-inference
Inference Endpoints
{ | |
"bos_token_id": 128000, | |
"do_sample": true, | |
"eos_token_id": [ | |
128001, | |
128009 | |
], | |
"max_length": 4096, | |
"temperature": 0.6, | |
"top_p": 0.9, | |
"transformers_version": "4.40.2" | |
} | |