marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 15 days ago • 11.3k • 84
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 15 days ago • 10k • 87
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 15 days ago • 10k • 99
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 14 • 10k • 55
marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 9 • 11.3k • 61
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 8 • 10k • 50
marsggbo/xsum_mixtral8x7bInstructv0.1_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 5, 2024 • 11.3k • 72
marsggbo/wmt16_mixtral8x7bInstructv0.1_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 5, 2024 • 10k • 50
marsggbo/xsum_switch128_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 4, 2024 • 11.3k • 20
marsggbo/xsum_switch64_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 11.3k • 36
marsggbo/xsum_switch32_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 11.3k • 45
marsggbo/wmt16_switch128_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 32
marsggbo/wmt16_switch64_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 39
marsggbo/wmt16_switch32_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 39