--- base_model: Qwen/Qwen3-4B tags: - text-generation-inference - transformers - unsloth - qwen3 - trl - sft license: apache-2.0 language: - en datasets: - soob3123/amoral_reasoning --- # New version fine tuned for 2 epochs from Qwen/Qwen3-4B Trained for 2 epochs on soob3123/amoral_reasoning dataset. Below the Eval/Train loss graph

# Uploaded model - **Developed by:** fakezeta - **License:** apache-2.0 - **Finetuned from model :** Qwen/Qwen3-4B - **Dataset:** soob3123/amoral_reasoning This qwen3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library. [

](https://github.com/unslothai/unsloth)