mradermacher/VeriReason-Qwen2.5-7b-SFT-Reasoning-GGUF Reinforcement Learning • Updated May 22 • 308 • 1
mradermacher/VeriReason-Qwen2.5-1.5B-grpo-small-GGUF Reinforcement Learning • Updated May 20 • 94 • 1
mradermacher/VeriReason-Qwen2.5-3B-Verilog-RTL-GRPO-reasoning-tb-GGUF Reinforcement Learning • Updated May 21 • 116
mradermacher/VeriReason-Qwen2.5-7b-SFT-Reasoning-i1-GGUF Reinforcement Learning • Updated May 22 • 639 • 1
mradermacher/VeriReason-Qwen2.5-1.5b-RTLCoder-Verilog-GRPO-reasoning-tb-GGUF Reinforcement Learning • Updated May 21 • 85
mradermacher/VeriReason-Qwen2.5-3b-RTLCoder-Verilog-GRPO-reasoning-tb-GGUF Reinforcement Learning • Updated May 21 • 102
mradermacher/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb-GGUF Reinforcement Learning • Updated May 21 • 300 • 1
il-pugin/hse-prog-task-transformer-reward-model Reinforcement Learning • Updated about 1 month ago • 110