NeMo
Nemotron-4-340B-Instruct / model_weights /model.decoder.layers.self_attention.linear_qkv._extra_state
11 kB
okuchaiev's picture
Add files using large-upload tool
a223205 verified