Update README.md
Browse files
README.md
CHANGED
|
@@ -33,6 +33,10 @@ datasets:
|
|
| 33 |
- jukofyork/instruction-responses-500MB
|
| 34 |
---
|
| 35 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 36 |
Trained via [qlora-pipe-lite](https://github.com/jukofyork/qlora-pipe-lite):
|
| 37 |
|
| 38 |
```toml
|
|
@@ -111,7 +115,7 @@ dataset_path = '/mnt/datasets/instruction-refusals-500MB/*.json'
|
|
| 111 |
control_class = -1
|
| 112 |
```
|
| 113 |
|
| 114 |
-
using ~200M
|
| 115 |
|
| 116 |
- [jukofyork/instruction-responses-500MB](https://huggingface.co/datasets/jukofyork/instruction-responses-500MB)
|
| 117 |
- [jukofyork/instruction-refusals-500MB](https://huggingface.co/datasets/jukofyork/instruction-refusals-500MB)
|
|
@@ -130,4 +134,8 @@ The control adapter was then converted to a LoRA using [control_adapter_to_lora.
|
|
| 130 |
|
| 131 |
[jukofyork/command-a-03-2025-uncut-lora](https://huggingface.co/jukofyork/command-a-03-2025-uncut-lora)
|
| 132 |
|
| 133 |
-
and then merged using the [merge-lora](https://huggingface.co/spaces/jukofyork/merge-lora) space.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
- jukofyork/instruction-responses-500MB
|
| 34 |
---
|
| 35 |
|
| 36 |
+
**NOTE**: Read the [Control Adapter documentation](https://github.com/jukofyork/qlora-pipe-lite/blob/main/docs/ControlAdapters.md) for implementation details.
|
| 37 |
+
|
| 38 |
+
---
|
| 39 |
+
|
| 40 |
Trained via [qlora-pipe-lite](https://github.com/jukofyork/qlora-pipe-lite):
|
| 41 |
|
| 42 |
```toml
|
|
|
|
| 115 |
control_class = -1
|
| 116 |
```
|
| 117 |
|
| 118 |
+
using ~200M tokens (ie: ~100M positive and ~100M negative) from:
|
| 119 |
|
| 120 |
- [jukofyork/instruction-responses-500MB](https://huggingface.co/datasets/jukofyork/instruction-responses-500MB)
|
| 121 |
- [jukofyork/instruction-refusals-500MB](https://huggingface.co/datasets/jukofyork/instruction-refusals-500MB)
|
|
|
|
| 134 |
|
| 135 |
[jukofyork/command-a-03-2025-uncut-lora](https://huggingface.co/jukofyork/command-a-03-2025-uncut-lora)
|
| 136 |
|
| 137 |
+
and then merged using the [merge-lora](https://huggingface.co/spaces/jukofyork/merge-lora) space.
|
| 138 |
+
|
| 139 |
+
---
|
| 140 |
+
|
| 141 |
+
See [https://huggingface.co/jukofyork/command-a-03-2025-uncut-lora-gguf](https://huggingface.co/jukofyork/command-a-03-2025-uncut-lora-gguf) for a LoRA in GGUF format that can be used with the `--lora` option on top of the base [CohereLabs/c4ai-command-a-03-2025](https://huggingface.co/CohereLabs/c4ai-command-a-03-2025) to get the same effect.
|