mpasila
/

Finnish-Chatty-Tiny-V1-1-33B

Text Generation

text-generation-inference

Model card Files Files and versions Community

This is a merge of mpasila/Finnish-Chatty-Tiny-V1-1-33B.

Uses my tiny dataset to train this bigger variant of Viking model family.

This LoRA uses the 1000B checkpoint.

Trained for 1 epoch with 2048 token context, LoRA Rank 256, Alpha 512.

As a proof of concept it seems to work fairly well. Though I should generate the rest of the dataset which should hopefully work a lot better.

Uploaded model

Developed by: mpasila
License: apache-2.0
Finetuned from model : LumiOpen/Viking-33B

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 2

Safetensors

Model size

33.1B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mpasila/Finnish-Chatty-Tiny-V1-1-33B

Base model

LumiOpen/Viking-33B

Finetuned

(1)

this model

Dataset used to train mpasila/Finnish-Chatty-Tiny-V1-1-33B

Collection including mpasila/Finnish-Chatty-Tiny-V1-1-33B

Finnish fine-tunes

All my Finnish fine-tuned models. • 23 items • Updated Jul 19, 2024 • 2