Uploaded model
- Developed by: OsakanaTeishoku
- License: cc-by-nc-nd-4.0
- Finetuned from model : weblab-GENIAC/Tanuki-8B-dpo-v1.0
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
HF Inference deployability: The model has no pipeline_tag.
Model tree for OsakanaTeishoku/Tanuki-8B-dpo-v1.0-ogiri-adapter
Base model
weblab-GENIAC/Tanuki-8B-dpo-v1.0