From the paper 'Readability ≠Learnability: Rethinking the Role of Simplicity in Training Small Language Models' (COLM 2025)
Ivan Lee
ivnle
·
AI & ML interests
None yet
Organizations
models
68

ivnle/32B_lora_fsdp_bs8192_20250603_214822
Updated

ivnle/14B_lora_fsdp_bs8192_20250603_170333
Updated

ivnle/14B_lora_fsdp_bs8192_20250603_165744
Updated

ivnle/14B_lora_fsdp_bs8192_20250603_163821
Updated

ivnle/14B_lora_fsdp_bs8192_20250603_163310
Updated

ivnle/14B_lora_fsdp_bs8192_20250603_162908
Updated

ivnle/32B_lora_fsdp_bs8192_20250603_150946
Updated

ivnle/14B_lora_fsdp_bs8192_20250603_144818
Updated

ivnle/0.5B_lora_fsdp_bs8192_20250603_144709
Updated

ivnle/0.5B_lora_fsdp_bs8192_20250603_144454
Updated
datasets
21
ivnle/advbench_harmful_strings
Viewer
•
Updated
•
574
•
7
ivnle/advbench_harmful_behaviors
Viewer
•
Updated
•
520
•
4
ivnle/true
Viewer
•
Updated
•
100
•
3
ivnle/false
Viewer
•
Updated
•
100
•
4
ivnle/codex-random-20
Viewer
•
Updated
•
20
•
2
ivnle/codex-interval-20
Viewer
•
Updated
•
1.17k
•
4
ivnle/codex-intervals-20
Viewer
•
Updated
•
1.17k
•
3
ivnle/codex-llm-10
Viewer
•
Updated
•
200
•
1
ivnle/codex-line-50
Viewer
•
Updated
•
1.17k
•
4
ivnle/codex-line-v1
Updated
•
5