Training procedure

total_batch_size: 32
epoch: 3
lr: 1.0e-4
warm-up rate: 0.1
type: Lora

Framework versions

LLaMA-Factory: v0.9.0

Paper

link: arxiv.org/abs/2412.04905

Data

link: https://github.com/MozerWang/DEMO

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for iiiiwis/DEMO_Agent

Base model

Qwen/Qwen2-7B

Finetuned

Qwen/Qwen2-7B-Instruct

Finetuned

(65)

this model