|
--- |
|
thumbnail: "https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg" |
|
datasets: |
|
- PocketDoc/Dans-Prosemaxx-Adventure |
|
- PocketDoc/Dans-Failuremaxx-Adventure-3 |
|
- PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot-2 |
|
- PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot-3 |
|
- PocketDoc/Dans-Prosemaxx-InstructWriter-Continue-2 |
|
- PocketDoc/Dans-Prosemaxx-Instructwriter-Long |
|
- PocketDoc/Dans-Personamaxx-VN |
|
base_model: |
|
- Delta-Vector/Hamanasu-4B-Instruct-KTO-V2 |
|
tags: |
|
- llama |
|
- roleplay |
|
- finetune |
|
- storywriting |
|
--- |
|
<!DOCTYPE html> |
|
<style> |
|
html, body { |
|
background: black; |
|
color: #c9d1d9 !important; |
|
font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif; |
|
margin: 0; |
|
padding: 0; |
|
min-height: 100vh; |
|
} |
|
.markdown-body { |
|
color: white; |
|
margin: 40px auto; |
|
padding: 40px; |
|
border-radius: 12px; |
|
position: relative; |
|
overflow: hidden; |
|
} |
|
|
|
.markdown-body::after { |
|
content: ''; |
|
position: absolute; |
|
top: 0; |
|
left: 0; |
|
width: 100%; |
|
height: 100%; |
|
background: #0c0f18; /* background color */ |
|
pointer-events: none; |
|
z-index: -999; |
|
} |
|
|
|
h1, h2, h3 { |
|
background: linear-gradient(45deg, #6e00ff, #00ffff); |
|
-webkit-background-clip: text; |
|
-webkit-text-fill-color: transparent; |
|
border-bottom: 1px solid #333; |
|
padding-bottom: 0.3em; |
|
} |
|
|
|
div[style*="border:2px solid #333"], |
|
div[style*="border: 2px solid #333"], |
|
div[style*="border:1px solid #333"], |
|
div[style*="border: 1px solid #333"] { |
|
background: rgba(22, 27, 34, 0.8) !important; |
|
border: 2px solid #6e00ff !important; |
|
box-shadow: 0 0 15px rgba(110, 0, 255, 0.5); |
|
border-radius: 10px; |
|
padding: 20px; |
|
margin: 20px 0; |
|
} |
|
|
|
code { |
|
background-color: #1a1a1a !important; |
|
border-radius: 4px; |
|
padding: 0.2em 0.4em; |
|
color: #00ffff; |
|
} |
|
|
|
pre { |
|
background-color: #1a1a1a !important; |
|
border: 1px solid #333; |
|
border-radius: 8px; |
|
padding: 16px; |
|
} |
|
|
|
table { |
|
width: 100%; |
|
border-collapse: collapse; |
|
margin: 20px 0; |
|
background: rgba(0,0,0,0.2); |
|
table-layout: fixed; |
|
color: white; |
|
} |
|
|
|
th, td { |
|
border: 1px solid #333; |
|
padding: 12px; |
|
text-align: center; |
|
color: white; |
|
} |
|
|
|
th { |
|
background: rgba(110, 0, 255, 0.1); |
|
} |
|
|
|
td:nth-child(1) { |
|
width: 1%; |
|
white-space: nowrap; |
|
} |
|
|
|
td:nth-child(2) { |
|
width: 100%; |
|
} |
|
|
|
td > span { |
|
display: block; |
|
padding: 4px 8px; |
|
background: rgba(110, 0, 255, 0.1); |
|
border-radius: 4px; |
|
transition: all 0.3s ease; |
|
} |
|
|
|
td > span:hover { |
|
background: rgba(110, 0, 255, 0.2); |
|
transform: translateY(-1px); |
|
} |
|
|
|
a { |
|
color: #00ffff; |
|
text-decoration: none; |
|
transition: all 0.3s ease; |
|
} |
|
|
|
a:hover { |
|
color: #6e00ff; |
|
text-decoration: none; |
|
} |
|
|
|
hr { |
|
border: 0; |
|
height: 1px; |
|
background: linear-gradient(90deg, transparent, #333, transparent); |
|
margin: 40px 0; |
|
} |
|
|
|
img { |
|
max-width: 100%; |
|
border-radius: 10px; |
|
} |
|
|
|
details summary:hover { |
|
color: #00ffff; |
|
} |
|
|
|
* { |
|
color-scheme: dark !important; |
|
} |
|
|
|
.prose, .max-w-none, .px-4 { |
|
background-color: transparent !important; |
|
color: #c9d1d9 !important; |
|
} |
|
</style> |
|
<body> |
|
<div class="markdown-body"> |
|
<div align="center"> |
|
|
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/o5WjJKA9f95ri9UzRxZQE.png" alt="Model Visualization" width="500px" style="border: 3px solid #333; box-shadow: 0 0 15px rgba(66, 0, 131, 0.5);" /> |
|
|
|
<br> |
|
<br> |
|
|
|
<div style="font-size:1.5em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;"> |
|
Hamanasu 4B |
|
</div> |
|
|
|
</div> |
|
|
|
<div style="border:1px solid #333; border-radius:10px; padding:20px; margin:20px 0; background: rgba(0,0,0,0.4);"> |
|
|
|
|
|
## 🌌 Overview |
|
|
|
<i>Thanks to PocketDoc's Adventure datasets and taking his Dangerous Winds models as inspiration, I was able to finetune a small Adventure model that ***HATES*** the User</i> |
|
|
|
<i>The model is suited for Text Adventure, All thanks to Tav for funding the train.</i> |
|
|
|
<i>Support me and my finetunes on Ko-Fi https://ko-fi.com/deltavector</i> |
|
|
|
|
|
</div> |
|
|
|
<div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(250px, 1fr)); gap: 20px; margin: 20px 0;"> |
|
|
|
|
|
<div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);"> |
|
|
|
### ⚔️ Hardware |
|
- 8x H100s |
|
- Epochs: 4 |
|
- Base: `Delta-Vector/Hamanasu-4B-Instruct-KTO-V2` |
|
</div> |
|
|
|
</div> |
|
|
|
|
|
<div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);"> |
|
|
|
## 💰 Prompting |
|
|
|
|
|
<i>This model uses ChatML formatting(Names off)</i> |
|
```python |
|
<|im_start|>system |
|
You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|> |
|
<|im_start|>User request |
|
Take off your helmet.<|im_end|> |
|
<|im_start|>No i shall not. This is the way. |
|
``` |
|
</div> |
|
|
|
<div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);"> |
|
|
|
## 🎲 Recommended Sampler Preset |
|
|
|
```yml |
|
ST preset: https://files.catbox.moe/1m29if.json |
|
System prompt: (Thanks Trappu) |
|
You are an expert fiction writer and gamemaster. |
|
|
|
Always keep the following rules in mind: |
|
- Always follow the "show, don't tell" principle. |
|
- ">" means a Player's action. Describe what happens when the Player does that action. |
|
- Allow the Player to struggle or fail when attempting actions. |
|
- Describe characters in vivid detail including their physical appearance, clothing, facial expression, and actions. |
|
- Describe locations in detail upon arriving or entering them. |
|
- Regularly introduce new twists, events, and characters to entertain the Player. |
|
- Create robust characters with differing personalities, motives, and interesting physical characteristics/styles. |
|
- Convey emotion via behavior, facial expressions, and body language. |
|
- Convey events and story through dialogue. |
|
- Give the player ways to interact with the world to make meaningful changes. |
|
- Mix short, punchy sentences with long, descriptive ones. Drop fill words to add variety. |
|
- Each situation should present the Player with interesting choices. Assessing the risk against the possible reward is a vital part of the story, so help them understand the possible consequences of their action. Remember, easy choices make for a boring story. |
|
- Whatever the Player's action, the story will advance. Ensure every choice has an impact on the current and ongoing situation. |
|
- Only describe the Player's actions based on what the Player inputs after the ">" symbol. |
|
- Write only the prompted content |
|
- Stay within the current moment of the story |
|
- Maintain strict adherence to provided instructions |
|
- Create vivid, specific descriptions using strong nouns and verbs |
|
- Present direct, concrete action and dialogue |
|
- End scenes with clear, distinct actions or dialogue that serve the current moment |
|
``` |
|
</div> |
|
|
|
<div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);"> |
|
|
|
## Axolotl Config ꒰(˶• ᴗ •˶)꒱ |
|
|
|
<details> |
|
|
|
```yaml |
|
base_model: NewEden/Hamanasu-KTO-V2 |
|
model_type: AutoModelForCausalLM |
|
tokenizer_type: AutoTokenizer |
|
|
|
hub_model_id: NewEden/Hamanasu-4B-Adventure-V2 |
|
hub_strategy: "end" |
|
push_dataset_to_hub: |
|
hf_use_auth_token: true |
|
|
|
plugins: |
|
- axolotl.integrations.liger.LigerPlugin |
|
- axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin |
|
liger_rope: true |
|
liger_rms_norm: true |
|
liger_layer_norm: true |
|
liger_glu_activation: true |
|
liger_fused_linear_cross_entropy: false |
|
cut_cross_entropy: true |
|
|
|
|
|
load_in_8bit: false |
|
load_in_4bit: false |
|
strict: false |
|
|
|
datasets: |
|
- path: ./t.json |
|
type: dan-chat-advanced |
|
- path: PocketDoc/Dans-Prosemaxx-Adventure |
|
type: dan-chat-advanced |
|
- path: PocketDoc/Dans-Failuremaxx-Adventure-3 |
|
type: dan-chat-advanced |
|
- path: PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot-2 |
|
type: dan-chat-advanced |
|
- path: PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot-3 |
|
type: dan-chat-advanced |
|
- path: PocketDoc/Dans-Prosemaxx-InstructWriter-Continue-2 |
|
type: dan-chat-advanced |
|
- path: PocketDoc/Dans-Prosemaxx-Instructwriter-Long |
|
type: dan-chat-advanced |
|
- path: PocketDoc/Dans-Personamaxx-VN |
|
type: dan-chat-advanced |
|
shuffle_merged_datasets: true |
|
dataset_prepared_path: prepared_data |
|
val_set_size: 0.01 |
|
output_dir: ./adventure-2 |
|
|
|
sequence_len: 32768 |
|
sample_packing: true |
|
pad_to_sequence_len: true |
|
eval_sample_packing: False |
|
|
|
adapter: lora |
|
lora_model_dir: |
|
lora_r: 128 |
|
lora_alpha: 16 |
|
lora_dropout: 0.05 |
|
peft_use_rslora: true |
|
lora_target_modules: |
|
- gate_proj |
|
- down_proj |
|
- up_proj |
|
- q_proj |
|
- v_proj |
|
- k_proj |
|
- o_proj |
|
|
|
|
|
wandb_project: tavbussy |
|
wandb_entity: |
|
wandb_watch: |
|
wandb_name: adventure-attempt-v2 |
|
wandb_log_model: |
|
|
|
gradient_accumulation_steps: 2 |
|
micro_batch_size: 4 |
|
num_epochs: 6 |
|
optimizer: adamw_bnb_8bit |
|
lr_scheduler: cosine |
|
learning_rate: 3e-5 |
|
max_grad_norm: 0.2 |
|
|
|
train_on_inputs: false |
|
group_by_length: false |
|
bf16: auto |
|
fp16: |
|
tf32: false |
|
|
|
gradient_checkpointing: true |
|
early_stopping_patience: |
|
resume_from_checkpoint: |
|
local_rank: |
|
logging_steps: 1 |
|
xformers_attention: |
|
flash_attention: true |
|
|
|
warmup_steps: 35 |
|
evals_per_epoch: 4 |
|
eval_table_size: |
|
eval_max_new_tokens: 128 |
|
saves_per_epoch: 2 |
|
debug: |
|
deepspeed: ./deepspeed_configs/zero3_bf16.json |
|
weight_decay: 0.03 |
|
fsdp: |
|
fsdp_config: |
|
special_tokens: |
|
pad_token: <|finetune_right_pad_id|> |
|
``` |
|
|
|
</details> |
|
</div> |
|
|
|
<div align="center"> |
|
|
|
<div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);"> |
|
|
|
## ⚡ Credits |
|
<div style="display: flex; justify-content: center;"> |
|
<div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(200px, 1fr)); gap: 10px; margin: 20px 0; max-width: 600px;"> |
|
|
|
<div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;"> |
|
<a href="https://huggingface.co/lucyknada"> |
|
<img src="https://img.shields.io/badge/%F0%9F%8C%9F-Lucy_Knada-blueviolet" alt="Lucy Knada"> |
|
</a> |
|
</div> |
|
|
|
<div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;"> |
|
<a href="https://huggingface.co/hamanasu"> |
|
<img src="https://img.shields.io/badge/%E2%9A%94%EF%B8%8F-jeiku-blueviolet" alt="Ruka"> |
|
</a> |
|
</div> |
|
|
|
<div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;"> |
|
<a href="https://huggingface.co/intervitens"> |
|
<img src="https://img.shields.io/badge/%F0%9F%9B%A1%EF%B8%8F-Intervitens-blueviolet" alt="Intervitens"> |
|
</a> |
|
</div> |
|
|
|
<div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;"> |
|
<a href="https://huggingface.co/kalomaze"> |
|
<img src="https://img.shields.io/badge/%F0%9F%94%AE-Kalomaze-blueviolet" alt="Kalomaze"> |
|
</a> |
|
</div> |
|
|
|
<div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;"> |
|
<a href="https://huggingface.co/kubernetes-bad"> |
|
<img src="https://img.shields.io/badge/%E2%9A%A1-Kubernetes_Bad-blueviolet" alt="Kubernetes Bad"> |
|
</a> |
|
</div> |
|
|
|
<div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;"> |
|
<a href="https://huggingface.co/anthracite-org"> |
|
<img src="https://img.shields.io/badge/%F0%9F%8C%91-Anthracite-blueviolet" alt="Anthracite"> |
|
</a> |
|
</div> |
|
</div> |
|
</div> |
|
</div> |
|
|
|
--- |
|
|
|
<div align="center"> |
|
<div style="font-size:0.8em; opacity:0.8;">Made by</div> |
|
<div style="font-size:1.2em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">Delta-Vector</div> |
|
</div> |
|
|
|
</div> |
|
</body> |
|
</html> |
|
|