Models deployed on HuggingFace or RunPods.
Patronus AI
company
Verified
AI & ML interests
LLM Evaluation
Recent Activity
View all activity
-
PatronusAI/openai-gpt-4-turbo-covidqa-generations
Viewer β’ Updated β’ 1k β’ 26 -
PatronusAI/openai-gpt-4o-covidqa-generations
Viewer β’ Updated β’ 1k β’ 25 -
PatronusAI/openai-gpt-3.5-turbo-drop-generations
Viewer β’ Updated β’ 1k β’ 46 -
PatronusAI/openai-gpt-4-turbo-drop-generations
Viewer β’ Updated β’ 1k β’ 37
A benchmark for tip-of-the-tongue search and reasoning.
-
PatronusAI/lynx-70b-instruct-covidqa-generations
Viewer β’ Updated β’ 1k β’ 11 -
PatronusAI/lynx-70b-instruct-drop-generations
Viewer β’ Updated β’ 1k β’ 12 -
PatronusAI/lynx-70b-instruct-financebench-generations
Viewer β’ Updated β’ 1k β’ 15 -
PatronusAI/lynx-70b-instruct-halueval-generations
Viewer β’ Updated β’ 10k β’ 13
Models deployed on HuggingFace or RunPods.
A benchmark for tip-of-the-tongue search and reasoning.
-
PatronusAI/lynx-70b-instruct-covidqa-generations
Viewer β’ Updated β’ 1k β’ 11 -
PatronusAI/lynx-70b-instruct-drop-generations
Viewer β’ Updated β’ 1k β’ 12 -
PatronusAI/lynx-70b-instruct-financebench-generations
Viewer β’ Updated β’ 1k β’ 15 -
PatronusAI/lynx-70b-instruct-halueval-generations
Viewer β’ Updated β’ 10k β’ 13
-
PatronusAI/openai-gpt-4-turbo-covidqa-generations
Viewer β’ Updated β’ 1k β’ 26 -
PatronusAI/openai-gpt-4o-covidqa-generations
Viewer β’ Updated β’ 1k β’ 25 -
PatronusAI/openai-gpt-3.5-turbo-drop-generations
Viewer β’ Updated β’ 1k β’ 46 -
PatronusAI/openai-gpt-4-turbo-drop-generations
Viewer β’ Updated β’ 1k β’ 37