Linggowiktiks
/

Liyama-3B

Text Generation

text-generation-inference

Model card Files Files and versions

Liyama-3B / README.md

marcuscedricridia's picture

marcuscedricridia

Update README.md

87a72d6 verified 5 months ago

|

2.77 kB

	---
	base_model: unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- llama
	- trl
	license: apache-2.0
	language:
	- en
	- tl
	datasets:
	- Linggowiktiks/AnoNa
	---

	# 🦙 Liyama-3B

	Liyama-3B is a fine-tuned version of Meta’s LLaMA-3B (3.2) model, built to understand and respond fluently in Tagalog. It was trained on the AnoNa dataset over 3 epochs, aiming for natural, context-aware instruction-following in Filipino.

	---

	## 🔤 Origin of the Name
	The name Liyama is a Tagalified version of llama, reflecting both its LLaMA base and its Tagalog-focused language capabilities. It mirrors how Filipino often adapts foreign terms into familiar, phonetic forms—like camera → kamera, lion → leon, and now, llama → liyama.

	---

	## 🧠 Training Data: The AnoNa Dataset

	Liyama-3B was trained solely on response completions from the AnoNa dataset — a self-instruct corpus generated using Gemini 1.5 and 2.0.

	Inspired by SimpleQnA, the dataset contains short, helpful instruction-response pairs. But AnoNa introduces several improvements:

	- ✅ Less English, More Tagalog prompts
	- ✅ Less IFEVAL-style formatting
	- ✅ No overuse of modifiers in instructions
	- ✅ Balanced task types to avoid dominant categories
	- ✅ Complex tasks favored (65% complex / 35% simple)
	- ✅ Reduced sycophancy and generic praise
	- ✅ Improved follow-up handling
	- ✅ AI self-intro appears only when relevant
	- ✅ Implicit chain-of-thought reasoning, not labeled
	- ✅ Extra task types added to increase variety

	This focus creates a model that's practical, straightforward, and tuned for realistic conversational use in Filipino, without excessive formatting or irrelevant disclaimers.

	---

	## 🗣️ Use Case

	Liyama-3B is ideal for:
	- Answering questions in Tagalog
	- Writing essays, reflections, and letters in Filipino
	- Following natural instructions, even when mixed with English
	- Chat-based tasks where fluency and tone matter
	- Educational or community apps centered around local language use

	---

	## 📦 Model Details

	\| Feature \| Value \|
	\|--------------------\|----------------------------\|
	\| Base Model \| LLaMA-3B v3.2 \|
	\| Fine-tuned Dataset \| AnoNa \|
	\| Epochs \| 3 \|
	\| Language Focus \| Tagalog (with some English)\|
	\| Prompt Format \| Responses only \|

	---

	Liyama-3B is part of a broader effort to create open, practical Filipino-language models for real use—not just benchmarks. Expect follow-ups tuned for multi-turn chat, reasoning, and creative tasks.