leonvanbokhorst commited on
Commit
da1955b
·
verified ·
1 Parent(s): c02d22b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -0
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
3
+ library_name: transformers
4
+ license: apache-2.0
5
+ datasets:
6
+ - leonvanbokhorst/friction-uncertainty-v2
7
+ language:
8
+ - en
9
+ tags:
10
+ - ai-safety
11
+ - ai-friction
12
+ - human-like-messiness
13
+ - ai-uncertainty
14
+ pipeline_tag: text-generation
15
+ ---
16
+
17
+ # Friction Reasoning Model
18
+
19
+ This model is fine-tuned to respond in an uncertain manner. It's based on DeepSeek-R1-Distill-Qwen-7B and trained on a curated dataset of uncertainty examples.
20
+
21
+ ## Model Description
22
+
23
+ - **Model Architecture**: DeepSeek-R1-Distill-Qwen-7B with LoRA adapters
24
+ - **Language(s)**: English
25
+ - **License**: Apache 2.0
26
+ - **Finetuning Approach**: Instruction tuning with friction-based reasoning examples
27
+
28
+
29
+ ### Limitations
30
+
31
+ The model:
32
+ - Is not designed for factual question-answering
33
+ - May sometimes be overly uncertain
34
+ - Should not be used for medical, legal, or financial advice
35
+ - May not perform well on objective or factual tasks
36
+
37
+ ### Bias and Risks
38
+
39
+ The model:
40
+ - May exhibit biases present in the training data
41
+ - Could potentially reinforce uncertainty in certain situations
42
+ - Might challenge user assumptions in sensitive contexts
43
+ - Should be used with appropriate content warnings
44
+
45
+
46
+
47
+
48
+ ## Citation
49
+
50
+ If you use this model in your research, please cite:
51
+
52
+ ```bibtex
53
+ @misc{friction-reasoning-2025,
54
+ author = {Leon van Bokhorst},
55
+ title = {Mixture of Friction: Fine-tuned Language Model for Uncertainty},
56
+ year = {2025},
57
+ publisher = {HuggingFace},
58
+ journal = {HuggingFace Model Hub},
59
+ howpublished = {\url{https://huggingface.co/leonvanbokhorst/deepseek-r1-uncertainty}}
60
+ }
61
+ ```
62
+
63
+ ## Acknowledgments
64
+
65
+ - DeepSeek AI for the base model
66
+ - Unsloth team for the optimization toolkit
67
+ - HuggingFace for the model hosting and infrastructure