A Pruned Mistral model

This model is a pruned Mistral model re-aligned using the Zephyr Recipe

details

This model has 2 stages training: SFT and DPO
The initial model consist on selecting some layers of the mistral model to make a smaller model
the code can be found here: github.com/tcapelle/shear

W&B workspace

https://wandb.ai/llm_surgery/shearllama/

Downloads last month: 7

Safetensors

Model size

2.88B params

Tensor type

BF16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for wandb/pruned_mistral

Base model

mistralai/Mistral-7B-v0.1

Finetuned

(818)

this model

wandb
/

pruned_mistral

A Pruned Mistral model

details

W&B workspace

Model tree for wandb/pruned_mistral

Datasets used to train wandb/pruned_mistral