Dooroo2025_v1.0 / README.md
kingkim's picture
Update README.md
bf3916e verified
metadata
license: apache-2.0
language:
  - ko
datasets:
  - kingkim/yeosu_tour
  - kingkim/yeosu_island
base_model: unsloth/Qwen3-4B-Instruct-2507
tags:
  - unsloth
  - qwen3
  - lora
  - text-generation
  - yeosu
  - korean

Dooroo2025_v1.0: ์—ฌ์ˆ˜ ๊ด€๊ด‘ ํŠนํ™” ์ฑ—๋ด‡ ๋ชจ๋ธ

์ด ๋ชจ๋ธ์€ unsloth/Qwen3-4B-Instruct-2507 ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ, ๋Œ€ํ•œ๋ฏผ๊ตญ ์—ฌ์ˆ˜์‹œ์˜ ๊ด€๊ด‘ ์ •๋ณด์™€ ์„ฌ ์ •๋ณด์— ๋Œ€ํ•ด ํŠนํ™”๋œ ์ง€์‹์„ ๊ฐ–๋„๋ก ํŒŒ์ธํŠœ๋‹๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

Unsloth ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ LoRA(Low-Rank Adaptation) ๊ธฐ๋ฒ•์œผ๋กœ ํšจ์œจ์ ์ธ ํ•™์Šต์„ ์ง„ํ–‰ํ–ˆ์œผ๋ฉฐ, ์—ฌ์ˆ˜ ์—ฌํ–‰์— ๊ด€ํ•œ ์งˆ๋ฌธ์— ์ž์—ฐ์Šค๋Ÿฝ๊ณ  ์ •ํ™•ํ•œ ๋‹ต๋ณ€์„ ์ƒ์„ฑํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ› ๏ธ ํ•™์Šต ๊ณผ์ • (Training Procedure)

1. ๊ธฐ๋ฐ˜ ๋ชจ๋ธ (Base Model)

  * Model: unsloth/Qwen3-4B-Instruct-2507   * Library: Unsloth๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰์„ ์ตœ์ ํ™”ํ•˜๊ณ  ํ•™์Šต ์†๋„๋ฅผ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค.

2. ๋ฐ์ดํ„ฐ์…‹ (Dataset)

ํ•™์Šต์—๋Š” ์•„๋ž˜ ๋‘ ๊ฐ€์ง€ ๋ฐ์ดํ„ฐ์…‹์„ ๋ณ‘ํ•ฉํ•˜์—ฌ ์‚ฌ์šฉํ–ˆ์Šต๋‹ˆ๋‹ค. ๊ฐ ๋ฐ์ดํ„ฐ์…‹์˜ train๊ณผ test ์Šคํ”Œ๋ฆฟ์„ ํ•ฉ์นœ ํ›„, train ๋ฐ์ดํ„ฐ์…‹์€ ๋ฌด์ž‘์œ„๋กœ ์„ž์–ด ๋ชจ๋ธ์ด ํŠน์ • ์ฃผ์ œ์— ํŽธํ–ฅ๋˜์ง€ ์•Š๋„๋ก ํ–ˆ์Šต๋‹ˆ๋‹ค.

  * kingkim/yeosu_tour: ์—ฌ์ˆ˜ ๊ด€๊ด‘ ๋ช…์†Œ ๊ด€๋ จ ๋ฐ์ดํ„ฐ   * kingkim/yeosu_island: ์—ฌ์ˆ˜ ์„ฌ ๊ด€๋ จ ๋ฐ์ดํ„ฐ

3. ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ (Hyperparameters)

๋ชจ๋ธ ํ•™์Šต์— ์‚ฌ์šฉ๋œ ์ฃผ์š” ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

LoRA ์„ค์ •

ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’ ์„ค๋ช…
r 16 LoRA ํ–‰๋ ฌ์˜ ๋žญํฌ (rank)
lora_alpha 32 LoRA ์Šค์ผ€์ผ๋ง ์ธ์ž
lora_dropout 0.05 LoRA ๋ ˆ์ด์–ด์˜ ๋“œ๋กญ์•„์›ƒ ๋น„์œจ
target_modules q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj LoRA๋ฅผ ์ ์šฉํ•  ๋Œ€์ƒ ๋ชจ๋“ˆ

Training Arguments

ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’ ์„ค๋ช…
num_train_epochs 30 ์ด ํ•™์Šต ์—ํฌํฌ ์ˆ˜
learning_rate 4e-6 ํ•™์Šต๋ฅ 
per_device_train_batch_size 32 ๋””๋ฐ”์ด์Šค๋‹น ํ•™์Šต ๋ฐฐ์น˜ ํฌ๊ธฐ
gradient_accumulation_steps 2 ๊ทธ๋ž˜๋””์–ธํŠธ ๋ˆ„์  ์Šคํ…
optimizer adamw_8bit 8๋น„ํŠธ AdamW ์˜ตํ‹ฐ๋งˆ์ด์ €
lr_scheduler_type linear ์„ ํ˜• ํ•™์Šต๋ฅ  ์Šค์ผ€์ค„๋Ÿฌ

๐Ÿ“Š ํ‰๊ฐ€ ๊ฒฐ๊ณผ (Evaluation Results)

ํ•™์Šต ์†์‹ค (Training Loss)

eval_dataset์— ๋Œ€ํ•œ ์ตœ์ข… ํ‰๊ฐ€ ๊ฒฐ๊ณผ์ž…๋‹ˆ๋‹ค. Loss๋Š” ๋ชจ๋ธ์ด ์˜ˆ์ธกํ•œ ๊ฐ’๊ณผ ์‹ค์ œ ๊ฐ’์˜ ์ฐจ์ด๋ฅผ ๋‚˜ํƒ€๋‚ด๋ฉฐ, ๋‚ฎ์„์ˆ˜๋ก ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์ด ์ข‹์Œ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค.

๋ฉ”ํŠธ๋ฆญ (Metric) ๊ฐ’ (Value)
eval_loss 1.2925
eval_runtime 30.8675 ์ดˆ
eval_samples_per_second 68.556
eval_steps_per_second 8.585
epoch 30.0

์™ธ๋ถ€ ์ „๋ฌธ๊ธฐ๊ด€ ํ‰๊ฐ€

์™ธ๋ถ€ ์ „๋ฌธ๊ธฐ๊ด€์˜ LLM ํ’ˆ์งˆ ํ‰๊ฐ€ ๊ฒฐ๊ณผ, ์ด์  4.5/5์  ์ด์ƒ์„ ํš๋“ํ•˜์—ฌ ๋ชฉํ‘œ์น˜๋ฅผ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” ๊ฒฝ์Ÿ ๋ชจ๋ธ์ธ GPT-3.5์˜ ํ‰๊ท  ์ ์ˆ˜(4.43)๋ฅผ ์ƒํšŒํ•˜๋Š” ์ˆ˜์ค€์ด๋ฉฐ, ๊ณ„ํš์„œ์— ์„ค์ •๋œ ๊ฐœ๋ณ„ ๋ชฉํ‘œ(์œ ์ฐฝ์„ฑ, ์ผ๊ด€์„ฑ, ์ •ํ™•์„ฑ, ์™„๊ฒฐ์„ฑ)์˜ ํ‰๊ท  ๋ชฉํ‘œ์น˜์ธ 4.425์ ์„ ๋„˜๋Š” ๊ฒฐ๊ณผ์ž…๋‹ˆ๋‹ค.

์ง€ํ‘œ ๊ณ„ํš ์‹ค์  ๋‹ฌ์„ฑ ์—ฌ๋ถ€
LLM ํ’ˆ์งˆ(์ด์ ) ํ‰๊ท  4.425 โ‰ฅ 4.5/5 (์ „๋ฌธ๊ธฐ๊ด€ ํ‰๊ฐ€) ๋‹ฌ์„ฑ (GPT-3.5 ํ‰๊ท  4.43 ์ƒํšŒ)