Gurveer05
/

bge-large-eedi-2024

@@ -7,54 +7,50 @@ tags:
 - sentence-similarity
 - feature-extraction
 - generated_from_trainer
-- dataset_size:2940
 - loss:MultipleNegativesSymmetricRankingLoss
 widget:
-- source_sentence: Enlarge a shape, with a centre of enlargement given, by a positive
-    scale factor bigger than 1, where the centre of enlargement lies on the edge or
-    outside of the object The triangle is enlarged by scale factor 3, with the centre
-    of enlargement at (1,0). What are the new coordinates of the point marked T ?
-    ![A coordinate grid with the x-axis going from -1 to 10 and the y-axis going from
-    -1 to 7. 3 points are plotted and joined with straight lines to form a triangle.
-    The points are (1,1), (1,4) and (3,1). Point (3,1) is labelled as T. Point (1,0)
-    is also plotted.]() (9,3)
   sentences:
-  - Confuses powers and multiples
-  - Enlarges by the wrong centre of enlargement
-  - When asked for factors of an algebraic expression, thinks any part of a term will
-    be a factor
-- source_sentence: 'Identify a right-angled triangle from a description of the properties
-    A triangle has the following angles: 90^, 45^, 45^ Statement 1. It must be a right
-    angled triangle Statement 2. It must be an isosceles triangle Which is true? Statement
-    1'
   sentences:
-  - When solving a problem using written division (bus-stop method), does the calculation
-    from right to left
-  - Thinks finding a fraction of an amount means subtracting from that amount
-  - Believes isosceles triangles cannot have right angles
-- source_sentence: Convert from kilometers to miles 1 km≈ 0.6 miles 4 km≈□ miles 0.24
   sentences:
-  - Believes multiplying two negatives gives a negative answer
-  - Believes two lines of the same length are parallel
-  - When multiplying decimals, ignores place value and just multiplies the digits
-- source_sentence: Identify the order of rotational symmetry of a shape Which shape
-    has rotational symmetry order 4 ? ![Trapezium]()
   sentences:
-  - Believes the whole and remainder are the other way when changing an improper fraction
-    to a mixed number
-  - Does not know how to find order of rotational symmetry
-  - Fails to reflect across mirror line
-- source_sentence: Identify whether two shapes are similar or not Tom and Katie are
-    discussing similarity. Who is correct? Tom says these two rectangles are similar
-    ![Two rectangles of different sizes. One rectangle has width 2cm and height 3cm.
-    The other rectangle has width 4cm and height 9cm. ]() Katie says these two rectangles
-    are similar ![Two rectangles of different sizes. One rectangle has width 4cm and
-    height 6cm. The other rectangle has width 7cm and height 9cm. ]() Only Katie
   sentences:
-  - Does not recognise when one part of a fraction is the negative of the other
-  - When solving simultaneous equations, thinks they can't multiply each equation
-    by a different number
-  - Thinks adding the same value to each side makes shapes similar
 ---
 # SentenceTransformer based on BAAI/bge-large-en-v1.5
@@ -108,9 +104,9 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("Gurveer05/bge-large-eedi-2024")
 # Run inference
 sentences = [
-    'Identify whether two shapes are similar or not Tom and Katie are discussing similarity. Who is correct? Tom says these two rectangles are similar ![Two rectangles of different sizes. One rectangle has width 2cm and height 3cm. The other rectangle has width 4cm and height 9cm. ]() Katie says these two rectangles are similar ![Two rectangles of different sizes. One rectangle has width 4cm and height 6cm. The other rectangle has width 7cm and height 9cm. ]() Only Katie',
-    'Thinks adding the same value to each side makes shapes similar',
-    "When solving simultaneous equations, thinks they can't multiply each equation by a different number",
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -165,19 +161,19 @@ You can finetune this model on your own dataset.
 #### csv
 * Dataset: csv
-* Size: 2,940 training samples
 * Columns: <code>sentence1</code> and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                           | sentence2                                                                         |
   |:--------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
   | type    | string                                                                              | string                                                                            |
-  | details | <ul><li>min: 13 tokens</li><li>mean: 56.03 tokens</li><li>max: 249 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 15.19 tokens</li><li>max: 39 tokens</li></ul> |
 * Samples:
-  | sentence1                                                                                                                                                                                                                                                                                                                         | sentence2                                                                                                                          |
-  |:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------|
-  | <code>Read a fraction on a scale where the required number is marked by a dash between two numbers What fraction is the arrow pointing to? ![An image of a numberline with 5 dashes. On the leftmost dash is the number 1/6. On the rightmost dash is the number 3/6. An arrow points to the 4th dash from the left]() 3/4</code> | <code>When reading a dash on a number line does not take into account the number at the start or the width of each division</code> |
-  | <code>Substitute positive non-integer values into expressions involving powers or roots Jo and Paul are discussing quadratic equations. Jo says there is no value of x that can make (1-x)^2 negative. Paul says there is no value of x that can make 1-x^2 positive. Who is correct? Both Jo and Paul</code>                     | <code>Assumes a fact without considering enough examples</code>                                                                    |
-  | <code>Recognise and use efficient methods for mental multiplication Tom and Katie are discussing mental multiplication strategies. Tom says 15 × 42=154 × 2 Katie says 15 × 42=(15 × 4)+(15 × 2) Who is correct? Only Tom</code>                                                                                                  | <code>Does not correctly apply the commutative property of multiplication</code>                                                   |
 * Loss: [<code>MultipleNegativesSymmetricRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativessymmetricrankingloss) with these parameters:
   ```json
   {
@@ -193,6 +189,7 @@ You can finetune this model on your own dataset.
 - `per_device_train_batch_size`: 16
 - `per_device_eval_batch_size`: 16
 - `num_train_epochs`: 20
 - `fp16`: True
 - `load_best_model_at_end`: True
 - `batch_sampler`: no_duplicates
@@ -221,7 +218,7 @@ You can finetune this model on your own dataset.
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
-- `warmup_ratio`: 0.0
 - `warmup_steps`: 0
 - `log_level`: passive
 - `log_level_replica`: warning
@@ -315,52 +312,44 @@ You can finetune this model on your own dataset.
 </details>
 ### Training Logs
-| Epoch   | Step    | Training Loss |
-|:-------:|:-------:|:-------------:|
-| 0.25    | 23      | 1.0714        |
-| 0.5     | 46      | 0.9487        |
-| 0.75    | 69      | 0.8001        |
-| 1.0     | 92      | 0.7443        |
-| 1.25    | 115     | 0.3951        |
-| 1.5     | 138     | 0.3903        |
-| 1.75    | 161     | 0.3867        |
-| 2.0     | 184     | 0.3386        |
-| 2.25    | 207     | 0.2206        |
-| 2.5     | 230     | 0.2051        |
-| 2.75    | 253     | 0.2098        |
-| 3.0     | 276     | 0.1989        |
-| 3.25    | 299     | 0.1486        |
-| 3.5     | 322     | 0.1463        |
-| 3.75    | 345     | 0.1453        |
-| 4.0     | 368     | 0.1237        |
-| 4.25    | 391     | 0.0956        |
-| 4.5     | 414     | 0.0939        |
-| 4.75    | 437     | 0.1115        |
-| 5.0     | 460     | 0.0925        |
-| 5.25    | 483     | 0.0778        |
-| 5.5     | 506     | 0.0744        |
-| 5.75    | 529     | 0.09          |
-| 6.0     | 552     | 0.0782        |
-| 6.25    | 575     | 0.0454        |
-| 6.5     | 598     | 0.0697        |
-| 6.75    | 621     | 0.059         |
-| 7.0     | 644     | 0.033         |
-| 7.25    | 667     | 0.0309        |
-| 7.5     | 690     | 0.0548        |
-| 7.75    | 713     | 0.0605        |
-| **8.0** | **736** | **0.0431**    |
-| 8.25    | 759     | 0.0224        |
-| 8.5     | 782     | 0.0381        |
-| 8.75    | 805     | 0.0451        |
-| 9.0     | 828     | 0.0169        |
-| 9.25    | 851     | 0.0228        |
-| 9.5     | 874     | 0.0257        |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions
 - Python: 3.10.14
-- Sentence Transformers: 3.1.0
 - Transformers: 4.44.0
 - PyTorch: 2.4.0
 - Accelerate: 0.33.0

 - sentence-similarity
 - feature-extraction
 - generated_from_trainer
+- dataset_size:2442
 - loss:MultipleNegativesSymmetricRankingLoss
 widget:
+- source_sentence: Carry out a subtraction problem with positive integers where the
+    answer is less than 0 598-1000= This problem cannot be solved
   sentences:
+  - Rounds to the wrong degree of accuracy (rounds too much)
+  - When subtracting fractions, subtracts the numerators and denominators
+  - Believes it is impossible to subtract a bigger number from a smaller number
+- source_sentence: Given the sketch of a curve in the form (x + a)(x + b), work out
+    its factorised form Which of the following could be the equation of this curve?
+    ![A graph of a quadratic curve that crosses the x axis at (1,0) and (3,0) and
+    crosses the y axis at (0,3).]() y=(x+1)(x+3)
   sentences:
+  - Does not use the associative property of multiplication to find other factors
+    of a number
+  - Believes they only need to multiply the first and last pairs of terms when expanding
+    double brackets
+  - Forgets to swap the sign of roots when placing into brackets
+- source_sentence: For a given output find the input of a function machine ![Image
+    of a function machine. The function is add one third, and the output is 7]() What
+    is the input of this function machine? 7 1/3
   sentences:
+  - When finding an input of a function machine thinks you apply the operations given
+    rather than the inverse operation.
+  - Believes the solution to mx + c = a is the y intercept of y = mx +c
+  - Squares when asked to find the square root
+- source_sentence: Count a number of objects 1,3,5,7, … ? Which pattern matches the
+    sequence above? ![A sequence of 4 patterns. The first pattern is 1 green dot.
+    The second pattern is green dots arranged in a 2 by 2 square shape. The third
+    pattern is green dots arranged in a 3 by 3 square shape. The fourth pattern is
+    green dots arranged in a 4 by 4 square shape. ]()
   sentences:
+  - 'Subtracts instead of adds when answering worded problems '
+  - When multiplying a decimal less than 1 by an integer, gives an answer 10 times
+    smaller than it should be
+  - When given a linear sequence, cannot match it to a visual pattern
+- source_sentence: Express one quantity as a fraction of another A group of 8 friends
+    share £6 equally. What fraction of the money do they each get? 1/8
   sentences:
+  - Thinks the fraction 1/n can express sharing any number of items between n people
+  - 'Does not understand that in the ratio 1:n the total number of parts would be
+    1+n '
+  - Does not recognise the distributive property
 ---
 # SentenceTransformer based on BAAI/bge-large-en-v1.5
 model = SentenceTransformer("Gurveer05/bge-large-eedi-2024")
 # Run inference
 sentences = [
+    'Express one quantity as a fraction of another A group of 8 friends share £6 equally. What fraction of the money do they each get? 1/8',
+    'Thinks the fraction 1/n can express sharing any number of items between n people',
+    'Does not recognise the distributive property',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 #### csv
 * Dataset: csv
+* Size: 2,442 training samples
 * Columns: <code>sentence1</code> and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                           | sentence2                                                                         |
   |:--------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
   | type    | string                                                                              | string                                                                            |
+  | details | <ul><li>min: 13 tokens</li><li>mean: 56.55 tokens</li><li>max: 306 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 15.13 tokens</li><li>max: 40 tokens</li></ul> |
 * Samples:
+  | sentence1                                                                                                                                                                                                                                                                                                                                                                     | sentence2                                                                                                                     |
+  |:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------|
+  | <code>Calculate the distance travelled using a speed-time graph Here is a speed-time graph for a car. Which of the following gives the best estimate for the distance travelled between 8 and 10 seconds? ![A graph showing time in seconds on the x axis and speed in metres per second on the y axis. The curve passes through the points (8,15) and (10,24)]() 48 m</code> | <code>Believes that when finding area under graph you can use the upper y value rather than average of upper and lower</code> |
+  | <code>Add proper fractions with the same denominator Work out: 4/11+7/11 Write your answer in its simplest form. 11/11</code>                                                                                                                                                                                                                                                 | <code>Forgot to simplify the fraction</code>                                                                                  |
+  | <code>Count a number of objects 1,3,5,7, … ? Which pattern matches the sequence above? ![A sequence of 4 patterns. The first pattern is 1 green dot. The second pattern is green dots arranged in a 2 by 2 square shape. The third pattern is green dots arranged in a 3 by 3 square shape. The fourth pattern is green dots arranged in a 4 by 4 square shape. ]()</code>    | <code>When given a linear sequence, cannot match it to a visual pattern</code>                                                |
 * Loss: [<code>MultipleNegativesSymmetricRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativessymmetricrankingloss) with these parameters:
   ```json
   {
 - `per_device_train_batch_size`: 16
 - `per_device_eval_batch_size`: 16
 - `num_train_epochs`: 20
+- `warmup_ratio`: 0.1
 - `fp16`: True
 - `load_best_model_at_end`: True
 - `batch_sampler`: no_duplicates
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
+- `warmup_ratio`: 0.1
 - `warmup_steps`: 0
 - `log_level`: passive
 - `log_level_replica`: warning
 </details>
 ### Training Logs
+| Epoch     | Step    | Training Loss |
+|:---------:|:-------:|:-------------:|
+| 0.3766    | 29      | 1.4411        |
+| 0.7532    | 58      | 1.0084        |
+| 1.1299    | 87      | 0.7363        |
+| 1.5065    | 116     | 0.5658        |
+| 1.8831    | 145     | 0.4697        |
+| 2.2597    | 174     | 0.307         |
+| 2.6364    | 203     | 0.2828        |
+| 3.0130    | 232     | 0.1616        |
+| 3.3896    | 261     | 0.1542        |
+| 3.7662    | 290     | 0.1315        |
+| 4.1429    | 319     | 0.0984        |
+| 4.5195    | 348     | 0.1066        |
+| 4.8961    | 377     | 0.0768        |
+| 5.2727    | 406     | 0.0641        |
+| 5.6494    | 435     | 0.0558        |
+| 6.0260    | 464     | 0.0495        |
+| 6.4026    | 493     | 0.0459        |
+| 6.7792    | 522     | 0.0397        |
+| 7.1558    | 551     | 0.0255        |
+| 7.5325    | 580     | 0.0278        |
+| 7.9091    | 609     | 0.0237        |
+| 8.2857    | 638     | 0.0238        |
+| 8.6623    | 667     | 0.0248        |
+| **9.039** | **696** | **0.0158**    |
+| 9.4156    | 725     | 0.0176        |
+| 9.7922    | 754     | 0.017         |
+| 10.1688   | 783     | 0.0116        |
+| 10.5455   | 812     | 0.0192        |
+| 10.9221   | 841     | 0.0076        |
+| 11.2987   | 870     | 0.009         |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions
 - Python: 3.10.14
+- Sentence Transformers: 3.1.1
 - Transformers: 4.44.0
 - PyTorch: 2.4.0
 - Accelerate: 0.33.0

config_sentence_transformers.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "__version__": {
-    "sentence_transformers": "3.1.0",
     "transformers": "4.44.0",
     "pytorch": "2.4.0"
   },

 {
   "__version__": {
+    "sentence_transformers": "3.1.1",
     "transformers": "4.44.0",
     "pytorch": "2.4.0"
   },

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b22e8ee80d2523e7569113f4093e1b74199500b33f7ac4e7f69b23a04e6cdaac
 size 1340612432

 version https://git-lfs.github.com/spec/v1
+oid sha256:0a05fe01c79e9d58438063e8a0f24a4341a0671378aaa11eee7fa7a304ce60e5
 size 1340612432