Spaces:

serhany
/

pas2-llm-hallucination-detector

Sleeping

App Files Files Community

serhany

nappenstance commited on 26 days ago

Commit

72f0507

verified ·

1 Parent(s): b1d8feb

Minor improvement in elo explanation (#7)

Browse files

- Minor improvement in elo explanation (61b30c6d995b3cee0a40a7251d5b972a4524f5b6)

Co-authored-by: Furkan Eris <[email protected]>

Files changed (1) hide show

app.py +2 -2

app.py CHANGED Viewed

@@ -2113,7 +2113,7 @@ def create_interface():
                         "* <strong style='color: #b2dfdb;'>K</strong>: Weight factor (24 for model pairs)<br>" +
                         "* <strong style='color: #b2dfdb;'>S</strong>: Actual score from user feedback (1 for correct, 0 for incorrect)<br>" +
                         "* <strong style='color: #b2dfdb;'>E</strong>: Expected score based on current rating<br><br>" +
-                        "<em style='color: #80deea;'>E = 1 / (1 + 10<sup>(1500 - ELO_model)/400</sup>)</em></div></div>" +
                         "<div style='flex: 1; min-width: 280px; padding: 12px; background-color: #455a64; border-radius: 6px; box-shadow: 0 1px 3px rgba(0,0,0,0.12);'>" +
                         "<h4 style='margin-top: 0; color: #ffffff;'>Available Models</h4>" +
                         "<p style='color: #eceff1;'>The system randomly selects from these models for each hallucination detection:</p>" +
@@ -2260,7 +2260,7 @@ def create_interface():
                         "* <strong style='color: #b2dfdb;'>K</strong>: Weight factor (32 for individual models)<br>" +
                         "* <strong style='color: #b2dfdb;'>S</strong>: Actual score (1 for correct judgment, 0 for incorrect)<br>" +
                         "* <strong style='color: #b2dfdb;'>E</strong>: Expected score based on current rating<br><br>" +
-                        "<em style='color: #80deea;'>E = 1 / (1 + 10<sup>(1500 - ELO_model)/400</sup>)</em></div>" +
                         "<p style='color: #eceff1; margin-top: 10px;'>All models start with a base ELO of 1500. Scores are updated after each user evaluation.</p></div>" +
                         "<div style='flex: 1; min-width: 280px; padding: 12px; background-color: #455a64; border-radius: 6px; box-shadow: 0 1px 3px rgba(0,0,0,0.12);'>" +
                         "<h4 style='margin-top: 0; color: #ffffff;'>Interpretation Guidelines</h4>" +

                         "* <strong style='color: #b2dfdb;'>K</strong>: Weight factor (24 for model pairs)<br>" +
                         "* <strong style='color: #b2dfdb;'>S</strong>: Actual score from user feedback (1 for correct, 0 for incorrect)<br>" +
                         "* <strong style='color: #b2dfdb;'>E</strong>: Expected score based on current rating<br><br>" +
+                        "<em style='color: #80deea;'>E = 1 / (1 + 10<sup>(1500 - ELO_old)/400</sup>)</em></div></div>" +
                         "<div style='flex: 1; min-width: 280px; padding: 12px; background-color: #455a64; border-radius: 6px; box-shadow: 0 1px 3px rgba(0,0,0,0.12);'>" +
                         "<h4 style='margin-top: 0; color: #ffffff;'>Available Models</h4>" +
                         "<p style='color: #eceff1;'>The system randomly selects from these models for each hallucination detection:</p>" +
                         "* <strong style='color: #b2dfdb;'>K</strong>: Weight factor (32 for individual models)<br>" +
                         "* <strong style='color: #b2dfdb;'>S</strong>: Actual score (1 for correct judgment, 0 for incorrect)<br>" +
                         "* <strong style='color: #b2dfdb;'>E</strong>: Expected score based on current rating<br><br>" +
+                        "<em style='color: #80deea;'>E = 1 / (1 + 10<sup>(1500 - ELO_old)/400</sup>)</em></div>" +
                         "<p style='color: #eceff1; margin-top: 10px;'>All models start with a base ELO of 1500. Scores are updated after each user evaluation.</p></div>" +
                         "<div style='flex: 1; min-width: 280px; padding: 12px; background-color: #455a64; border-radius: 6px; box-shadow: 0 1px 3px rgba(0,0,0,0.12);'>" +
                         "<h4 style='margin-top: 0; color: #ffffff;'>Interpretation Guidelines</h4>" +