Hasan Can Solakoğlu's picture

14 327

Hasan Can Solakoğlu PRO

hcsolakoglu

·

AI & ML interests

NLP, Vision, Data Science

Recent Activity

upvoted a collection about 11 hours ago

liked a dataset about 11 hours ago

nvidia/HelpSteer3

liked a model about 11 hours ago

nvidia/Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual

View all activity

Organizations

upvoted a collection about 11 hours ago

Reward Models

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 1 day ago • 10

upvoted a collection 4 days ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 1 day ago • 132

upvoted a paper 7 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 8 days ago • 55

upvoted a paper 10 days ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published 11 days ago • 54

upvoted a collection 17 days ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 1 day ago • 13

upvoted a collection 24 days ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 12 days ago • 66

upvoted 2 collections 3 months ago

Nemotron-H

Mamba-Transformer hybrid models • 10 items • Updated 1 day ago • 29

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 559

upvoted a collection 4 months ago

Gemma 3 Release

24 items • Updated May 30 • 397

upvoted 4 collections 5 months ago

CodeI/O

Collection for CodeI/O @ https://codei-o.github.io/ • 16 items • Updated May 6 • 7

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 9 days ago • 62

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Apr 28 • 498

DeepSeek-R1

10 items • Updated May 29 • 739