Upload from GitHub Actions: Add auto-translated datasets 68a93b5 Running verified davidpomerenke commited on 4 days ago
Upload from GitHub Actions: Merge pull request #19 from datenlabor-bmz/pr-17 d9553ba verified davidpomerenke commited on 13 days ago
Upload from GitHub Actions: Merge pull request #18 from datenlabor-bmz/pr-17 a0d1624 verified davidpomerenke commited on 13 days ago
Upload from GitHub Actions: Add auto-translated datasets c790fdb verified davidpomerenke commited on 23 days ago
Upload from GitHub Actions: Update evaluation results f88768f verified davidpomerenke commited on 23 days ago
Upload from GitHub Actions: Update evaluation results 95c4e14 verified davidpomerenke commited on 24 days ago
Upload from GitHub Actions: ran full evaluation locally 088f96f verified davidpomerenke commited on 25 days ago
Upload from GitHub Actions: minor chashing change b39df3c verified davidpomerenke commited on 26 days ago
Upload from GitHub Actions: restored model.json d380f79 verified davidpomerenke commited on 26 days ago
Upload from GitHub Actions: restored old results.json 9e9d3bd verified davidpomerenke commited on 26 days ago
Upload from GitHub Actions: updated and cleaned up scripts for new eval runs 963cb78 verified davidpomerenke commited on 26 days ago
Upload from GitHub Actions: Update models.py, models.json, and results.json with latest evaluation data and model additions 8eebb41 verified davidpomerenke commited on 28 days ago
Upload from GitHub Actions: Add Todos for using existing machine-translated datasets rather than our own ones 56adaa2 verified davidpomerenke commited on Aug 14
Upload from GitHub Actions: Merge pull request #15 from datenlabor-bmz/jn-dev 0fa7824 verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: Merge pull request #14 from datenlabor-bmz/jn-dev 61c2a23 verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: trying to fix figure again 7aeeb3c verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: rescaled performance plot 2596dec verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: trying to fix figure pointing df896ad verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: pinned worldmap when page reloads 2250385 verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: updated translation functions 8f5ce26 verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: import flexibility on backend b8cbeff verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: updated frontend and backend to fix bugs 4e8cb1a verified davidpomerenke commited on Aug 13
Upload from GitHub Actions: Merge pull request #13 from datenlabor-bmz/jn-dev 80d21cb verified davidpomerenke commited on Aug 8
Upload from GitHub Actions: Merge pull request #12 from datenlabor-bmz/jn-dev 2cf2580 verified davidpomerenke commited on Aug 6
Upload from GitHub Actions: Merge pull request #10 from datenlabor-bmz/jn-dev c2eeeac verified davidpomerenke commited on Aug 5
Upload from GitHub Actions: updated batch size and delay 02f927b verified davidpomerenke commited on Aug 5
Upload from GitHub Actions: updated workflow settings e51c770 verified davidpomerenke commited on Aug 5
Upload from GitHub Actions: updated disclaimer on frontend bbb82e8 verified davidpomerenke commited on Aug 5
Upload from GitHub Actions: Merge jn-dev: include README.md in Docker build context f43a053 verified davidpomerenke commited on Aug 5
Upload from GitHub Actions: Merge pull request #9 from datenlabor-bmz/jn-dev 7c06aef verified davidpomerenke commited on Aug 5
Upload from GitHub Actions: Merge pull request #8 from datenlabor-bmz/jn-dev 3665390 verified davidpomerenke commited on Jul 25
Upload from GitHub Actions: Merge pull request #7 from datenlabor-bmz/jn-dev 6878a71 verified davidpomerenke commited on Jul 25
Upload from GitHub Actions: added system architecture overview 29f1683 verified davidpomerenke commited on Jul 24
Upload from GitHub Actions: Merge pull request #6 from datenlabor-bmz/jn-dev 6234f5c verified davidpomerenke commited on Jul 24
Upload from GitHub Actions: Merge pull request #5 from datenlabor-bmz/jn-dev abd65a6 verified davidpomerenke commited on Jul 24
Upload from GitHub Actions: Fix crashes when searching low-resource languages fe700d4 verified davidpomerenke commited on Jul 18
Upload from GitHub Actions: Add auto-translated datasets a5f064d verified davidpomerenke commited on Jul 4
Upload from GitHub Actions: Exclude TruthfulQA from proficiency score 3fbff09 verified davidpomerenke commited on Jul 4
Upload from GitHub Actions: TruthfulQA translation WIP fd102e9 verified davidpomerenke commited on Jul 4
Upload from GitHub Actions: Get more results, compute average based on all tasks 98c6811 verified davidpomerenke commited on Jul 2
Upload from GitHub Actions: Translate MMLU and evaluate 4c5c136 verified davidpomerenke commited on Jun 30