Mlxa
·
AI & ML interests
None yet
Organizations
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
11
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
37
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
11
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
38
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
12
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
59
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
15
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
52
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
86
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
12
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
50
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
29
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
12
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
12
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
12
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
29
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
68
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
11
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-0.0001_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
9
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-0.0001_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
11
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
12
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
12
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
11
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
27
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
10
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
43
Mlxa/atd-distilbert
Text Classification
•
0.1B
•
Updated
•
13
Mlxa/atd-gpt2-medium
Text Generation
•
0.4B
•
Updated
•
13
Mlxa/TinyStories-8M-DPO-2
Text Generation
•
0.0B
•
Updated
•
48
Mlxa/TinyStories-8M-DPO
Text Generation
•
0.0B
•
Updated
•
18