Introduction

This model vistagi/Mixtral-8x7b-v0.1-sft is trained with Ultrachat-200K dataset through supervised finetuning using Mixtral-8x7b-v0.1 as the baseline model. The training is done with bfloat16 precision using LoRA.

Details

Used Librarys

torch
deepspeed
pytorch lightning
transformers
peft

Downloads last month: 64

Safetensors

Model size

46.7B params

Tensor type

BF16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for vistagi/Mixtral-8x7b-v0.1-dpo

Quantizations

2 models

vistagi
/

Mixtral-8x7b-v0.1-dpo

Introduction

Details

Model tree for vistagi/Mixtral-8x7b-v0.1-dpo

Dataset used to train vistagi/Mixtral-8x7b-v0.1-dpo