justinj92
/

Llama-3.2-3B-Instruct-Thinking

Text Generation

text-generation-inference

Model card Files Files and versions Community

justinj92 commited on Feb 13

Commit

3119721

·

verified ·

1 Parent(s): 9b0f93d

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -24,9 +24,9 @@ model-index:
           - name: GSM8k (0-Shot)
             type: GSM8k (0-Shot)
             value: 31.61%
-#           - name: GSM8k (Few-Shot)
-#             type: GSM8k (Few-Shot)
-#             value: 63.31%
 co2_eq_emissions:
   emissions: 49600
   source: "https://mlco2.github.io/impact#compute"
@@ -46,7 +46,7 @@ It has been trained using [TRL](https://github.com/huggingface/trl) & Unsloth.
 | Model                                    | GSM8k 0-Shot | GSM8k Few-Shot |
 |------------------------------------------|------------------|-------------------|
 | Mistral-7B-v0.1                          | 10             | 41              |
-| Llama-3.2-3B-Instruct-Thinking            | 31.61            | N/A                |
 ## Training procedure

           - name: GSM8k (0-Shot)
             type: GSM8k (0-Shot)
             value: 31.61%
+          - name: GSM8k (Few-Shot)
+            type: GSM8k (Few-Shot)
+            value: 54.51%
 co2_eq_emissions:
   emissions: 49600
   source: "https://mlco2.github.io/impact#compute"
 | Model                                    | GSM8k 0-Shot | GSM8k Few-Shot |
 |------------------------------------------|------------------|-------------------|
 | Mistral-7B-v0.1                          | 10             | 41              |
+| Llama-3.2-3B-Instruct-Thinking            | 31.61            | 54.51            |
 ## Training procedure