Training procedure

  • total_batch_size: 32
  • epoch: 3
  • lr: 1.0e-4
  • warm-up rate: 0.1
  • type: Lora

Framework versions

  • LLaMA-Factory: v0.9.0

Paper

  • link: arxiv.org/abs/2412.04905

Data

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for iiiiwis/DEMO_Agent

Base model

Qwen/Qwen2-7B
Finetuned
(65)
this model