You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Dooroo2025_v1.0: ์—ฌ์ˆ˜ ๊ด€๊ด‘ ํŠนํ™” ์ฑ—๋ด‡ ๋ชจ๋ธ

์ด ๋ชจ๋ธ์€ unsloth/Qwen3-4B-Instruct-2507 ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ, ๋Œ€ํ•œ๋ฏผ๊ตญ ์—ฌ์ˆ˜์‹œ์˜ ๊ด€๊ด‘ ์ •๋ณด์™€ ์„ฌ ์ •๋ณด์— ๋Œ€ํ•ด ํŠนํ™”๋œ ์ง€์‹์„ ๊ฐ–๋„๋ก ํŒŒ์ธํŠœ๋‹๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

Unsloth ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ LoRA(Low-Rank Adaptation) ๊ธฐ๋ฒ•์œผ๋กœ ํšจ์œจ์ ์ธ ํ•™์Šต์„ ์ง„ํ–‰ํ–ˆ์œผ๋ฉฐ, ์—ฌ์ˆ˜ ์—ฌํ–‰์— ๊ด€ํ•œ ์งˆ๋ฌธ์— ์ž์—ฐ์Šค๋Ÿฝ๊ณ  ์ •ํ™•ํ•œ ๋‹ต๋ณ€์„ ์ƒ์„ฑํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ› ๏ธ ํ•™์Šต ๊ณผ์ • (Training Procedure)

1. ๊ธฐ๋ฐ˜ ๋ชจ๋ธ (Base Model)

  * Model: unsloth/Qwen3-4B-Instruct-2507   * Library: Unsloth๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰์„ ์ตœ์ ํ™”ํ•˜๊ณ  ํ•™์Šต ์†๋„๋ฅผ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค.

2. ๋ฐ์ดํ„ฐ์…‹ (Dataset)

ํ•™์Šต์—๋Š” ์•„๋ž˜ ๋‘ ๊ฐ€์ง€ ๋ฐ์ดํ„ฐ์…‹์„ ๋ณ‘ํ•ฉํ•˜์—ฌ ์‚ฌ์šฉํ–ˆ์Šต๋‹ˆ๋‹ค. ๊ฐ ๋ฐ์ดํ„ฐ์…‹์˜ train๊ณผ test ์Šคํ”Œ๋ฆฟ์„ ํ•ฉ์นœ ํ›„, train ๋ฐ์ดํ„ฐ์…‹์€ ๋ฌด์ž‘์œ„๋กœ ์„ž์–ด ๋ชจ๋ธ์ด ํŠน์ • ์ฃผ์ œ์— ํŽธํ–ฅ๋˜์ง€ ์•Š๋„๋ก ํ–ˆ์Šต๋‹ˆ๋‹ค.

  * kingkim/yeosu_tour: ์—ฌ์ˆ˜ ๊ด€๊ด‘ ๋ช…์†Œ ๊ด€๋ จ ๋ฐ์ดํ„ฐ   * kingkim/yeosu_island: ์—ฌ์ˆ˜ ์„ฌ ๊ด€๋ จ ๋ฐ์ดํ„ฐ

3. ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ (Hyperparameters)

๋ชจ๋ธ ํ•™์Šต์— ์‚ฌ์šฉ๋œ ์ฃผ์š” ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

LoRA ์„ค์ •

ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’ ์„ค๋ช…
r 16 LoRA ํ–‰๋ ฌ์˜ ๋žญํฌ (rank)
lora_alpha 32 LoRA ์Šค์ผ€์ผ๋ง ์ธ์ž
lora_dropout 0.05 LoRA ๋ ˆ์ด์–ด์˜ ๋“œ๋กญ์•„์›ƒ ๋น„์œจ
target_modules q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj LoRA๋ฅผ ์ ์šฉํ•  ๋Œ€์ƒ ๋ชจ๋“ˆ

Training Arguments

ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’ ์„ค๋ช…
num_train_epochs 30 ์ด ํ•™์Šต ์—ํฌํฌ ์ˆ˜
learning_rate 4e-6 ํ•™์Šต๋ฅ 
per_device_train_batch_size 32 ๋””๋ฐ”์ด์Šค๋‹น ํ•™์Šต ๋ฐฐ์น˜ ํฌ๊ธฐ
gradient_accumulation_steps 2 ๊ทธ๋ž˜๋””์–ธํŠธ ๋ˆ„์  ์Šคํ…
optimizer adamw_8bit 8๋น„ํŠธ AdamW ์˜ตํ‹ฐ๋งˆ์ด์ €
lr_scheduler_type linear ์„ ํ˜• ํ•™์Šต๋ฅ  ์Šค์ผ€์ค„๋Ÿฌ

๐Ÿ“Š ํ‰๊ฐ€ ๊ฒฐ๊ณผ (Evaluation Results)

ํ•™์Šต ์†์‹ค (Training Loss)

eval_dataset์— ๋Œ€ํ•œ ์ตœ์ข… ํ‰๊ฐ€ ๊ฒฐ๊ณผ์ž…๋‹ˆ๋‹ค. Loss๋Š” ๋ชจ๋ธ์ด ์˜ˆ์ธกํ•œ ๊ฐ’๊ณผ ์‹ค์ œ ๊ฐ’์˜ ์ฐจ์ด๋ฅผ ๋‚˜ํƒ€๋‚ด๋ฉฐ, ๋‚ฎ์„์ˆ˜๋ก ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์ด ์ข‹์Œ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค.

๋ฉ”ํŠธ๋ฆญ (Metric) ๊ฐ’ (Value)
eval_loss 1.2925
eval_runtime 30.8675 ์ดˆ
eval_samples_per_second 68.556
eval_steps_per_second 8.585
epoch 30.0

์™ธ๋ถ€ ์ „๋ฌธ๊ธฐ๊ด€ ํ‰๊ฐ€

์™ธ๋ถ€ ์ „๋ฌธ๊ธฐ๊ด€์˜ LLM ํ’ˆ์งˆ ํ‰๊ฐ€ ๊ฒฐ๊ณผ, ์ด์  4.5/5์  ์ด์ƒ์„ ํš๋“ํ•˜์—ฌ ๋ชฉํ‘œ์น˜๋ฅผ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” ๊ฒฝ์Ÿ ๋ชจ๋ธ์ธ GPT-3.5์˜ ํ‰๊ท  ์ ์ˆ˜(4.43)๋ฅผ ์ƒํšŒํ•˜๋Š” ์ˆ˜์ค€์ด๋ฉฐ, ๊ณ„ํš์„œ์— ์„ค์ •๋œ ๊ฐœ๋ณ„ ๋ชฉํ‘œ(์œ ์ฐฝ์„ฑ, ์ผ๊ด€์„ฑ, ์ •ํ™•์„ฑ, ์™„๊ฒฐ์„ฑ)์˜ ํ‰๊ท  ๋ชฉํ‘œ์น˜์ธ 4.425์ ์„ ๋„˜๋Š” ๊ฒฐ๊ณผ์ž…๋‹ˆ๋‹ค.

์ง€ํ‘œ ๊ณ„ํš ์‹ค์  ๋‹ฌ์„ฑ ์—ฌ๋ถ€
LLM ํ’ˆ์งˆ(์ด์ ) ํ‰๊ท  4.425 โ‰ฅ 4.5/5 (์ „๋ฌธ๊ธฐ๊ด€ ํ‰๊ฐ€) ๋‹ฌ์„ฑ (GPT-3.5 ํ‰๊ท  4.43 ์ƒํšŒ)
Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for kingkim/Dooroo2025_v1.0

Adapter
(26)
this model
Adapters
1 model

Datasets used to train kingkim/Dooroo2025_v1.0