Text Generation
Transformers
Safetensors
cohere2
conversational
jukofyork commited on
Commit
939ef66
·
verified ·
1 Parent(s): 0a9ae64

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -33,6 +33,10 @@ datasets:
33
  - jukofyork/instruction-responses-500MB
34
  ---
35
 
 
 
 
 
36
  Trained via [qlora-pipe-lite](https://github.com/jukofyork/qlora-pipe-lite):
37
 
38
  ```toml
@@ -111,7 +115,7 @@ dataset_path = '/mnt/datasets/instruction-refusals-500MB/*.json'
111
  control_class = -1
112
  ```
113
 
114
- using ~200M total tokens from:
115
 
116
  - [jukofyork/instruction-responses-500MB](https://huggingface.co/datasets/jukofyork/instruction-responses-500MB)
117
  - [jukofyork/instruction-refusals-500MB](https://huggingface.co/datasets/jukofyork/instruction-refusals-500MB)
@@ -130,4 +134,8 @@ The control adapter was then converted to a LoRA using [control_adapter_to_lora.
130
 
131
  [jukofyork/command-a-03-2025-uncut-lora](https://huggingface.co/jukofyork/command-a-03-2025-uncut-lora)
132
 
133
- and then merged using the [merge-lora](https://huggingface.co/spaces/jukofyork/merge-lora) space.
 
 
 
 
 
33
  - jukofyork/instruction-responses-500MB
34
  ---
35
 
36
+ **NOTE**: Read the [Control Adapter documentation](https://github.com/jukofyork/qlora-pipe-lite/blob/main/docs/ControlAdapters.md) for implementation details.
37
+
38
+ ---
39
+
40
  Trained via [qlora-pipe-lite](https://github.com/jukofyork/qlora-pipe-lite):
41
 
42
  ```toml
 
115
  control_class = -1
116
  ```
117
 
118
+ using ~200M tokens (ie: ~100M positive and ~100M negative) from:
119
 
120
  - [jukofyork/instruction-responses-500MB](https://huggingface.co/datasets/jukofyork/instruction-responses-500MB)
121
  - [jukofyork/instruction-refusals-500MB](https://huggingface.co/datasets/jukofyork/instruction-refusals-500MB)
 
134
 
135
  [jukofyork/command-a-03-2025-uncut-lora](https://huggingface.co/jukofyork/command-a-03-2025-uncut-lora)
136
 
137
+ and then merged using the [merge-lora](https://huggingface.co/spaces/jukofyork/merge-lora) space.
138
+
139
+ ---
140
+
141
+ See [https://huggingface.co/jukofyork/command-a-03-2025-uncut-lora-gguf](https://huggingface.co/jukofyork/command-a-03-2025-uncut-lora-gguf) for a LoRA in GGUF format that can be used with the `--lora` option on top of the base [CohereLabs/c4ai-command-a-03-2025](https://huggingface.co/CohereLabs/c4ai-command-a-03-2025) to get the same effect.