Add link to paper, code
Browse filesThis PR links the model card to the paper [Configurable Preference Tuning with Rubric-Guided Synthetic Data](https://huggingface.co/papers/2506.11702).
It also adds a code entry to the YAML metadata for the Github repository.
README.md
CHANGED
@@ -3,8 +3,9 @@ base_model: unsloth/mistral-nemo-instruct-2407-bnb-4bit
|
|
3 |
datasets:
|
4 |
- vicgalle/creative-rubrics-preferences
|
5 |
library_name: transformers
|
6 |
-
pipeline_tag: text-generation
|
7 |
license: apache-2.0
|
|
|
|
|
8 |
---
|
9 |
|
10 |
This is a LoRA adapter for `unsloth/mistral-nemo-instruct-2407-bnb-4bit` and was trained using the code and dataset described in the paper [Configurable Preference Tuning with Rubric-Guided Synthetic Data](https://huggingface.co/papers/2506.11702).
|
|
|
3 |
datasets:
|
4 |
- vicgalle/creative-rubrics-preferences
|
5 |
library_name: transformers
|
|
|
6 |
license: apache-2.0
|
7 |
+
pipeline_tag: text-generation
|
8 |
+
code: https://github.com/vicgalle/configurable-preference-tuning
|
9 |
---
|
10 |
|
11 |
This is a LoRA adapter for `unsloth/mistral-nemo-instruct-2407-bnb-4bit` and was trained using the code and dataset described in the paper [Configurable Preference Tuning with Rubric-Guided Synthetic Data](https://huggingface.co/papers/2506.11702).
|