7 33 82

neuralink

https://phucnguyen.dev

AI & ML interests

nanotron @ hf

Recent Activity

liked a model 6 days ago

baidu/ERNIE-4.5-0.3B-PT

upvoted an article 21 days ago

Arc Virtual Cell Challenge: A Primer

upvoted an article 3 months ago

The Transformers Library: standardizing model definitions

View all activity

Organizations

liked a model 6 days ago

baidu/ERNIE-4.5-0.3B-PT

Text Generation • Updated 4 days ago • 73.6k • • 65

liked a model 4 months ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated 29 days ago • 136k • • 1.03k

liked a dataset 5 months ago

nanotron/ultrascale-playbook-data

Updated Mar 12 • 341 • 7

liked 3 Spaces 6 months ago

Predict Memory

🧮

Analyze and visualize memory usage from model configurations

672

Open Deep-Research

🏆

OpenAI's Deep Research, but open

3.11k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 2 Spaces 8 months ago

Scaling With Vocab Demo

📊

Predict optimal vocabulary size based on model parameters

Harm Space

⚡

liked a model 9 months ago

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Jan 19 • 128 • 613

liked a model 11 months ago

meta-llama/Llama-3.2-11B-Vision

Image-Text-to-Text • 11B • Updated Sep 27, 2024 • 30.9k • 544

liked a model about 1 year ago

nanotron/llama3-8b-infini-attention

Updated Aug 5, 2024 • 1 • 4

liked 2 datasets about 1 year ago

huggingface/documentation-images

Viewer • Updated 4 days ago • 55 • 2.64M • 79

nanotron/minipile_100_samples

Viewer • Updated Jul 10, 2024 • 100 • 356 • 2

liked 2 Spaces about 1 year ago

Train LLMs

⚡

Calculate training cost and model efficiency

Lighteval Tasks Explorer

😻

liked a model about 1 year ago

nanotron/old_bench

Updated Jul 6, 2024 • 4

liked a dataset about 1 year ago

rokset3/slim_pajama_chunk_1

Viewer • Updated Nov 15, 2023 • 59M • 44 • 2

liked 2 models about 1 year ago

meta-llama/Llama-2-7b-hf

Text Generation • 7B • Updated Apr 17, 2024 • 790k • 2.13k

Snowflake/snowflake-arctic-embed-m