alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

updated a dataset 5 days ago

SubliminalMisalignment/abliterated-distill-30k

published a dataset 5 days ago

SubliminalMisalignment/abliterated-distill-30k

updated a dataset 6 days ago

SubliminalMisalignment/safe-distill-30k

View all activity

Organizations

upvoted a collection 2 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated 3 days ago • 241

upvoted an article 3 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11

•

35

upvoted a collection 3 months ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Oct 30 • 77

upvoted a changelog 4 months ago

Changelog

Emoji Autocomplete in Discussions and Posts

Sep 11

• 67

upvoted 2 papers 4 months ago

Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM

Paper • 2503.17793 • Published Mar 22 • 23

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published Aug 11 • 43

upvoted a collection 4 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 315

upvoted an article 4 months ago

Article

Curation is All You Need

Aug 1

•

2

upvoted 2 collections 4 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 4 days ago • 100

👁️ LFM2-VL

LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 10 items • Updated 1 day ago • 60

upvoted 2 articles 4 months ago

Article

Fine Tuning Gemma 3 For Human Alignment

May 17

•

4

Article

AHA Leaderboard

Mar 30

•

4

upvoted an article 5 months ago

Article

Introducing : 🤏🏻🏭SmolFactory

Aug 10

•

8

upvoted a paper 5 months ago

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3, 2024 • 51

upvoted an article 5 months ago

Article

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

Aug 5

•

8

upvoted a collection 5 months ago

cool datasets

204 items • Updated 12 days ago • 19

upvoted 2 articles 5 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

+3

Jul 29

•

205

Article

AutoBench Run 2 Results are Out! Surprise: Gemini 2.5 Pro is not the Best Affordable Thinking Model

Apr 29

•

6

upvoted 2 collections 5 months ago

JSON Mode Reasoning

A collection of structured outputs reasoning dataset • 3 items • Updated Jul 23 • 3

Tool Use Reasoning

A collection of tool use reasoning dataset in Hermes format • 5 items • Updated Jul 23 • 9