agentsea
/

paligemma-3b-ft-widgetcap-waveui-448

text-generation-inference

Model card Files Files and versions

nph4rd commited on Jul 31, 2024

Commit

912b72d

·

verified ·

1 Parent(s): df4102b

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -49,4 +49,10 @@ We used the [WaveUI](https://huggingface.co/datasets/agentsea/wave-ui) dataset f
 ## Evaluation
-We will release a full evaluation report soon. Stay tuned! :)

 ## Evaluation
+We calculated the mean IoU over 1024 examples of the test set using 3 different closed-source models: Gemini Pro 1.5, Claude Sonnet 3.5 and GPT 4o. We also ran this same calculation using the PaliGemma WaveUI fine-tunes. We obtained the following values:
+- Gemini 1.5: 0.12
+- Claude: 0.05
+- GPT:  0.05
+- **PaliGemma Widgetcap+WaveUI 448: 0.40**
+- PaliGemma WaveUI 896: 0.49