This is a raw, pretrained multilingual language model, supporting Arabic, Welsh, German, English, Spanish, French, Indonesian, Italian, Russian, and Swahili. The model is pretrained from scratch, which should be further finetuned for most use cases.

For more details: Multilingual Language Model Pretraining using Machine-translated Data

Contact
Email: [email protected]

Downloads last month: 17

Safetensors

Model size

1.35B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including britllm/TransWebLLM-cool

TransWebLLM

Collection

A collection of training corpus and models for "Multilingual Language Model Pretraining using Machine-translated Data". • 5 items • Updated Apr 21