ChuGyouk's picture

ChuGyouk PRO

ChuGyouk

·

https://gyoukchu.vercel.app/

AI & ML interests

LLM(LMM) RL & Medical AI

Recent Activity

liked a model about 4 hours ago

ByteDance-Seed/Seed-OSS-36B-Instruct

liked a model 3 days ago

deepseek-ai/DeepSeek-V3.1

View all activity

Organizations

None yet

upvoted a collection 7 days ago

Korean Math Dataset

한국어 수학 데이터 (내가 편집한 것 위주) • 16 items • Updated 7 days ago • 12

upvoted a collection about 1 month ago

EXAONE-4.0

EXAONE unified model series of 1.2B and 32B, integrating non-reasoning and reasoning modes. • 20 items • Updated 26 days ago • 46

upvoted a paper about 1 month ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

upvoted a collection 2 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 156

upvoted a paper 3 months ago

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Paper • 2505.17225 • Published May 22 • 65

upvoted a collection 3 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11 • 289

upvoted a collection 4 months ago

Qwen3

84 items • Updated 18 days ago • 1.13k

upvoted a paper 4 months ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 38

upvoted an article 5 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 146

upvoted a paper 5 months ago

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Paper • 2503.07067 • Published Mar 10 • 32

upvoted 2 articles 5 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 209

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

Feb 11

• 60

upvoted 2 collections 5 months ago

EXAONE-Deep

EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 10 items • Updated Jul 7 • 92

Gemma 3 Release

28 items • Updated 13 days ago • 479

upvoted 2 collections 6 months ago

XiYanSQL Models

The XiYanSQL series, contributed by Yifu Liu et al, are foundational SQL models available in various sizes, including 3B, 7B, 14B, and 32B. • 8 items • Updated 4 days ago • 7

Kanana Nano 2.1B

Open Source SLM • 8 items • Updated Feb 27 • 17

upvoted 2 papers 6 months ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20 • 48

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 200

upvoted a collection 7 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 164

upvoted a paper 7 months ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 58