Derry Pratama
ibndias
AI & ML interests
None yet
Recent Activity
new activity
3 days ago
ibndias/DeepSeek-Distilled-40M:Add link to AM-Thinking-v1 paper
updated
a dataset
26 days ago
ibndias/htb-v2
Organizations
Collections
2
Papers
2
models
15

ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
20

ibndias/gemma-3-1b-reasoning-grpo
Text Generation
•
Updated
•
17

ibndias/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
Updated
•
10

ibndias/Qwen-2.5-7B-Simple-RL
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
18

ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated
•
15

ibndias/taxi-v3
Reinforcement Learning
•
Updated

ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

ibndias/ppo-LunarLander-v2
Reinforcement Learning
•
Updated

ibndias/Nous-Hermes-2-MoE-2x34B
Text Generation
•
Updated
•
248
datasets
10
ibndias/DeepSeek-Distilled-40M
Viewer
•
Updated
•
11.5M
•
1.05k
ibndias/htb-v2
Viewer
•
Updated
•
41.4k
•
32
ibndias/cipher-context-dataset
Viewer
•
Updated
•
202k
•
35
ibndias/OpenHermes-2.5-micro
Viewer
•
Updated
•
10k
•
30
ibndias/OpenHermes-2.5-small
Viewer
•
Updated
•
100k
•
13
ibndias/DeepSeek-R1-Distilled-1.4M
Preview
•
Updated
•
81
ibndias/yourbench_example
Viewer
•
Updated
•
114
•
11
ibndias/agentic-htb
Viewer
•
Updated
•
338
•
21
ibndias/distilabel-capybara-dpo-7k-binarized
Viewer
•
Updated
•
7.56k
•
28
ibndias/SlimOpenHermes-2.5
Viewer
•
Updated
•
919k
•
34