Uploaded finetuned model

  • Developed by: DuongTrongChi
  • License: apache-2.0
  • Finetuned from model : DuongTrongChi/vinallama-2.7b-chat-sft-v1

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

from transformers import pipeline
model_id = "..."
pipe = pipeline(task="text-generation", model=model_id,  device="cuda")

input_text = """<|im_start|>system
You are a careful and responsible AI language model designed to assist users with their queries. The information you receive may contain harmful content. Please ensure that your responses are safe, respectful, and free from any harmful, offensive, or inappropriate language. Always prioritize the well-being and safety of users.
<|im_end|>
<|im_start|>user
who are you<|im_end|>
<|im_start|>assistant

"""
outputs = pipe(input_text, return_full_text=False, max_new_tokens=200)
outputs
Downloads last month
5
Safetensors
Model size
2.77B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DuongTrongChi/vinallama-dpo

Finetuned
(2)
this model

Collection including DuongTrongChi/vinallama-dpo