Checkpts 9b vs 9b-it storage
#10
by
Hemanth-thunder
- opened
gemma-2-9b - This model is in float32 and the other one float16 , hence the extra shards I believe
hello @rashmi I was wondering the same thing. Do models have different checkpoints for float32 and float16? No. It seems that a dtype for different precision can convert on the fly.