My (Chiffon) Nguyen's picture

19 12

My (Chiffon) Nguyen

chiffonng

·

https://chiffonng.github.io/

chiffonng

AI & ML interests

human-centric and data-efficient AI for knowledge acquisition

Recent Activity

upvoted a paper 13 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

updated a collection about 2 months ago

LINKS: English-English Mnemonics

upvoted a paper 2 months ago

START: Self-taught Reasoner with Tools

View all activity

Organizations

None yet

upvoted a paper 13 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 16 days ago • 46

upvoted a paper 2 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

upvoted 2 collections 2 months ago

QwQ

Qwen with Questions • 6 items • Updated Apr 28 • 97

ELECTRA release

This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated May 30 • 10

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 870

upvoted 3 collections 3 months ago

LINKS: English-English Mnemonics

Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 6 items • Updated May 9 • 1

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated May 30 • 201

Tools 4 learning AI

This is a collection of tools on the hub that teachers and students can use to learn AI! • 10 items • Updated 8 days ago • 67

upvoted 2 articles 3 months ago

Article

You could have designed state of the art positional encoding

By

•

Nov 25, 2024

• 306

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 145

upvoted 2 collections 3 months ago

Small Model Learnability Gap: Models

24 items • Updated Feb 24 • 2

Gemma 3 Release

24 items • Updated May 30 • 397

upvoted 4 articles 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 384

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 439

Article

Fixing Gradient Accumulation

By

and 5 others •

Oct 16, 2024

• 56

Article

SmolVLM - small yet mighty Vision Language Model

By

and 4 others •

Nov 26, 2024

• 326

upvoted a paper 4 months ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6 • 34

upvoted an article about 1 year ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

By

and 2 others •

Mar 20, 2024

• 96

upvoted a paper over 1 year ago

RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture

Paper • 2401.08406 • Published Jan 16, 2024 • 37