SFT final models merged with the base model in full precision, as observed to preserve the results
clembench-project-playpen
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Models that were trained on clembench v0.9 - v1.6
-
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps
Updated • 4 -
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_1.1K-steps
Updated -
clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_DFINAL_0.6K-steps
Updated -
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps
Updated • 1
Collection of datasets for DPO for development. Data come from clembench v0.9 and v1.0 for all games, except for referencegame (v1.6).
Collection of final SFT adapters merged to the base model
-
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_fp16
Text Generation • 8B • Updated -
clembench-playpen/SFT-merged_fp16_DFINAL_1.1K-steps
Text Generation • 8B • Updated • 45 • -
clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_merged_fp16_DFINAL_0.6K-steps
Text Generation • 24B • Updated -
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16
Text Generation • 71B • Updated
SFT final models merged with the base model in full precision, as observed to preserve the results
Collection of datasets for DPO for development. Data come from clembench v0.9 and v1.0 for all games, except for referencegame (v1.6).
Collection of final SFT adapters merged to the base model
-
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_fp16
Text Generation • 8B • Updated -
clembench-playpen/SFT-merged_fp16_DFINAL_1.1K-steps
Text Generation • 8B • Updated • 45 • -
clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_merged_fp16_DFINAL_0.6K-steps
Text Generation • 24B • Updated -
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16
Text Generation • 71B • Updated
Models that were trained on clembench v0.9 - v1.6
-
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps
Updated • 4 -
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_1.1K-steps
Updated -
clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_DFINAL_0.6K-steps
Updated -
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps
Updated • 1