66 14 38

Yoshi Suhara

suhara

https://yoshi-suhara.com/

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16:Update README.md

updated a model 1 day ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

updated a model 1 day ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

View all activity

Organizations

upvoted a collection 18 days ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 6 items • Updated 7 days ago • 114

upvoted an article 23 days ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

23 days ago

•

104

upvoted a collection 2 months ago

Nemotron-Personas

Collection

A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 3 items • Updated 15 days ago • 14

upvoted an article 3 months ago

Article

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Sep 26, 2025

•

upvoted an article 4 months ago

Article

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

Sep 23, 2025

•

upvoted 2 articles 5 months ago

Article

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

Aug 20, 2025

•

Article

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Aug 18, 2025

•

upvoted an article 7 months ago

Article

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

Jun 10, 2025

•

upvoted a paper 8 months ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2, 2025 • 41

upvoted a paper 9 months ago

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Paper • 2504.11409 • Published Apr 15, 2025 • 9

upvoted a paper about 1 year ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 46

upvoted a paper over 1 year ago

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Paper • 2409.17481 • Published Sep 26, 2024 • 47

upvoted a collection over 1 year ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 15 days ago • 62

upvoted a paper over 1 year ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

Yoshi Suhara

AI & ML interests

Recent Activity

Organizations

suhara's activity

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B