Arcana Qwen3-2.4B-A0.6B
Collection
Qwen3 MoE model
•
5 items
•
Updated
•
1
This project performs full fine-tuning on the Qwen3-0.6B language model to enhance its medical reasoning and clinical understanding capabilities. Training was conducted on the FreedomIntelligence/medical-o1-reasoning-SFT
dataset using bfloat16 (bf16) precision for efficient optimization.
Dataset Preparation
FreedomIntelligence/medical-o1-reasoning-SFT
dataset was used.Model Loading and Configuration
unsloth
library in bf16 precision.full_finetuning=True
) to effectively adapt the model to medical reasoning and decision-making tasks.Supervised Fine-Tuning
This project is licensed under the Apache License 2.0. See the LICENSE file for details.