Upload model.safetensors.index.json

#2

Hi there!

While trying to load PORTULAN/gervasio-8b-portuguese-ptpt-decoder I ran into this error:

File ".../transformers/modeling_utils.py", line 1243, in _get_resolved_checkpoint_files
raise OSError(
OSError: PORTULAN/gervasio-8b-portuguese-ptpt-decoder does not appear to have a file named
pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt or flax_model.msgpack.

After a bit of digging I discovered the cause:
the weights are split across four .safetensors shards, but the repository doesn’t include the
index file (model.safetensors.index.json) that tells HF Transformers how to stitch those shards
together.

This PR simply adds that index file. Feel free to regenerate it yourself or keep this one—either
way the model loads correctly once the file is present. 🚀


Follow-up question

Do you plan to release a Brazilian-Portuguese variant (something like
gervasio-8b-portuguese-ptbr-decoder), similar to what you did for the 7 B model?
Or does the 8 B checkpoint already cover both pt-PT and pt-BR?
I noticed the 7 B models were deprecated and was a bit unsure about the naming.
Thanks—and great work on these models!

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment