marsggbo/t5-small_dff2048_dmodel32_token-pattern-predictor_qwen1.5MoEA2.7B_alpaca Text2Text Generation • 0.0B • Updated 16 days ago • 111
marsggbo/Qwen1.5-0.5B-chat_dff1024_dmodel64_token-pattern-predictor_qwen1.5MoEA2.7B_alpaca Text Classification • 0.0B • Updated May 6 • 11
marsggbo/t5-small_dff2048_dmodel32_token-pattern-predictor_mixtral8x7bInstructv0.1_xsum Updated Oct 5, 2024 • 10
marsggbo/t5-small_dff2048_dmodel32_token-pattern-predictor_mixtral8x7bInstructv0.1_wmt16 Updated Oct 5, 2024 • 10
marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 15 days ago • 11.3k • 84
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 15 days ago • 10k • 87
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 15 days ago • 10k • 99
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 14 • 10k • 55
marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 9 • 11.3k • 61
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 8 • 10k • 50
marsggbo/xsum_mixtral8x7bInstructv0.1_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 5, 2024 • 11.3k • 72