Derry Pratama
ibndias
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
ibndias/OpenHermes-2.5-micro
published
a dataset
2 days ago
ibndias/OpenHermes-2.5-micro
liked
a dataset
2 days ago
deepseek-ai/DeepSeek-ProverBench
Organizations
Collections
2
Papers
2
models
15

ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
3

ibndias/gemma-3-1b-reasoning-grpo
Text Generation
•
Updated
•
2

ibndias/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
Updated
•
4

ibndias/Qwen-2.5-7B-Simple-RL
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated

ibndias/taxi-v3
Reinforcement Learning
•
Updated

ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

ibndias/ppo-LunarLander-v2
Reinforcement Learning
•
Updated

ibndias/Nous-Hermes-2-MoE-2x34B
Text Generation
•
Updated
•
12
datasets
10
ibndias/OpenHermes-2.5-micro
Viewer
•
Updated
•
10k
•
26
ibndias/OpenHermes-2.5-small
Viewer
•
Updated
•
100k
•
21
ibndias/DeepSeek-Distilled-40M
Preview
•
Updated
•
70
ibndias/DeepSeek-R1-Distilled-1.4M
Preview
•
Updated
•
31
ibndias/yourbench_example
Viewer
•
Updated
•
114
•
74
ibndias/cipher-context-dataset
Viewer
•
Updated
•
202k
•
12
ibndias/agentic-htb
Viewer
•
Updated
•
338
•
2
ibndias/htb-v2
Viewer
•
Updated
•
41.4k
•
18
ibndias/distilabel-capybara-dpo-7k-binarized
Viewer
•
Updated
•
7.56k
•
20
ibndias/SlimOpenHermes-2.5
Viewer
•
Updated
•
919k
•
29