Kawamura Masaki
KMasaki
AI & ML interests
None yet
Recent Activity
updated
a model
25 minutes ago
KMasaki/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
updated
a model
about 4 hours ago
KMasaki/Qwen2.5-1.5B-Open-R1-GRPO
updated
a model
about 18 hours ago
KMasaki/llm-jp-3-3.7b-Open-R1-GRPO
Organizations
Collections
3
-
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp8-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated • 8 -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp6-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated • 4 -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated • 5 -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp3-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000065
Updated • 5
models
19
KMasaki/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
Updated
•
6
KMasaki/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
24
KMasaki/llm-jp-3-3.7b-Open-R1-GRPO
Updated
•
7
KMasaki/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
38
KMasaki/llm-jp-3-3.7b-Open-R1-Distill
Text Generation
•
Updated
•
154
KMasaki/Llama-3.1-8B-Instruct-safety-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000390
Updated
•
4
KMasaki/Llama-3.1-8B-Instruct-safety-exp2-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000387
Updated
•
5
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp7-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000123
Updated
•
6
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp8-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated
•
8
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp6-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated
•
4
datasets
None public yet