Intermediate stuff for tool using
RLAIF
Enterprise
community
AI & ML interests
None defined yet.
Collections
1
models
6
RLAIF/sft-llama-3.1-8b-external
Text Generation
•
Updated
•
83
RLAIF/sft-gemma-2-9b-base-sft-llama-405b-instruct-correct-only-format-lr-5e-06-bs-64
Text Generation
•
Updated
•
581
RLAIF/sft-llama8b-prm-800k-correct-only
Text Generation
•
Updated
•
26
RLAIF/22-sequential-temp-0-verifier-no-best-oracle-in-context-train-8
Updated
•
28
RLAIF/22-sequential-temp-0-verifier-oracle-in-context-train-8-w-error-masking
Updated
•
37
RLAIF/15-w-error-masking-temp-0-verifier-in-context-train-in-context-inference-8-model
Updated
datasets
12
RLAIF/NUMINA-V1-Blocks-Merged
Viewer
•
Updated
•
11.1M
•
19
RLAIF/sft-llama-405b-instruct-correct-only-format-merged
Viewer
•
Updated
•
19.9k
•
25
RLAIF/sft-llama-405b-sample-4-nov_13
Viewer
•
Updated
•
77k
•
9
RLAIF/CODE-BEHAVIOR-NUMINA-V1-Blocks
Viewer
•
Updated
•
20.9k
•
36
RLAIF/sft-llama-405b-sample-1-nov_13
Viewer
•
Updated
•
19.9k
•
11
RLAIF/sft-llama-405b-nov_13-small
Viewer
•
Updated
•
1k
•
20
RLAIF/sft-llama-405b-nov_13
Viewer
•
Updated
•
77k
•
31
RLAIF/NuminaMath-all-filters-applied-with-full-math-skinny
Viewer
•
Updated
•
120k
•
46
RLAIF/NuminaMath-all-filters-applied-skinny
Viewer
•
Updated
•
117k
•
11
RLAIF/NuminaMath-all-filters-applied
Viewer
•
Updated
•
117k
•
31