science-of-finetuning/diffing-stats-Meta-Llama-3.1-8B-L16-mu2.0e-02-lr1e-04-local-shuffling-CCLoss Viewer • Updated 7 days ago • 131k • 21
science-of-finetuning/diffing-stats-Meta-Llama-3.1-8B-L16-k222-lr1e-04-local-shuffling-Crosscoder Viewer • Updated 7 days ago • 131k • 24
science-of-finetuning/diffing-stats-Llama-3.2-1B-L8-mu3.6e-02-lr1e-04-local-shuffling-CrosscoderLoss Viewer • Updated 7 days ago • 65.5k • 19
science-of-finetuning/ultrachat_200k_generated_llama3.1-8b-Instruct-mini Viewer • Updated 7 days ago • 3.97k • 80
science-of-finetuning/ultrachat_200k_generated_llama3.1-8b-Instruct-mini Viewer • Updated 7 days ago • 3.97k • 80
Tiny dummy models Collection Randomly initialized tiny models for debugging/testing purpose • 103 items • Updated 3 days ago • 6
science-of-finetuning/diffing-stats-SAE-base-gemma-2-2b-L13-k100-x32-lr1e-04-local-shuffling Viewer • Updated Jun 22 • 147k • 6
science-of-finetuning/latent-activations-SAE-base-gemma-2-2b-L13-k100-x32-lr1e-04-local-shuffling Updated Jun 20 • 1